News All Project News Community News Publications Artificial Intelligence Big Techday July 31st, 2025 FAZ article: "Der Boxkampf der KI-Tiger" with assessments from TNG A recent article in the Frankfurter Allgemeine Zeitung covers the World Artificial Intelligence Conference in Shanghai, providing insights into the new and significant Chinese... A recent article in the Frankfurter Allgemeine Zeitung covers the World Artificial Intelligence Conference in Shanghai, providing insights into the new and significant Chinese... July 8th, 2025 Article: "Event Modeling II - When it actually works" Our colleague Nils-Oliver Linden created a three-part article series on the topic of Event Modeling, covering the concept and its application in everyday Software Development. Par... Our colleague Nils-Oliver Linden created a three-part article series on the topic of Event Modeling, covering the concept and its application in everyday Software Development. Par... July 3rd, 2025 Release of DeepSeek-TNG R1T2 Chimera Today, we are releasing the new DeepSeek-TNG R1T2 Chimera as a Tri-Mind Assembly-of-Experts Large Language Model with three parent models: DeepSeek R1-0528, R1, and V3-0324. Today, we release DeepSeek-TNG R1T2 Chimera.This new Chimera is a Tri-Mind Assembly-of-Experts large language model with three parents, namely DeepSeek's R1-0528, R1 and... June 25th, 2025 New research paper on the "Assembly of Experts" for TNG's Chimera model The new paper is now published and explains how we constructed our 671B R1T Chimera daughter model in less than an hour of CPU time from known base models. Our new paper “Assembly of Experts: Linear-time construction of the Chimera LLM variants with emergent and adaptable behaviors” has been published on and .The paper explains how... June 24th, 2025 Article: "Event Modeling I - What it is and how it works" Our colleague Nils-Oliver Linden created a three-part article series on the topic of Event Modeling, covering the concept and its application in everyday Software Development. As... Our colleague Nils-Oliver Linden created a three-part article series on the topic of Event Modeling, covering the concept and its application in everyday Software Development. As... June 12th, 2025 Article "How Long Prompts Block Other Requests - Optimizing LLM Performance" Serving LLMs for over 50 applications, thereby consuming more than 100 million tokens while generating over ten millions tokens per day, requires us to carefully tune our request... Serving LLMs for over 50 applications, thereby consuming more than 100 million tokens while generating over ten millions tokens per day, requires us to carefully tune our request... June 11th, 2025 Article "AI-assisted Java Migration" Our new article shows you how to modernize outdated legacy code with artificial intelligence and migrate it to a new Java version. Our article “AI-assisted Java Migration” will show you how to modernize outdated legacy code and migrate it to a new Java version using Artificial Intelligence.In our new article... May 2nd, 2025 Release of DeepSeek-R1T-Chimera DeepSeek-R1T-Chimera, an open-weights model that adds R1's reasoning capabilities to DeepSeek AI V3-0324, has been released. On the weekend, we released DeepSeek-R1T-Chimera, an open weights model adding R1 reasoning to DeepSeek AI V3-0324. In benchmarks, it appears to be as smart as R1 but much faster... April 23rd, 2025 Article "Finetuning olmOCR to be a faithful OCR-Engine" We recently carried out AI-supported fine-tuning to automate internal workflows. We recently created a fine-tune of an Optical Character Recognition (OCR) AI model based on olmOCR to help us automate our internal document processing workflows. In our new... April 17th, 2025 Article "Rapid Prototyping of Collaborative Applications with CRDTs" Collaborative editing of documents has become an essential requirement for successful remote work. But setting up these collaborative features and maintaining a shared state in... Collaborative editing of documents has become an essential requirement for successful remote work. But setting up these collaborative features and maintaining a shared state in... April 16th, 2025 Article "Prefill and Decode for Concurrent Requests - Optimizing LLM Performance" At TNG, we are self-hosting numerous Large Language Models on our cluster of 24 H100 GPUs. It supports 50 different applications, handles over 5,000 inferences per hour, and... At TNG, we are self-hosting numerous Large Language Models on our cluster of 24 H100 GPUs. It supports 50 different applications, handles over 5,000 inferences per hour, and... Previous 1 2 Previous news can be found in the archive.