News May 29th, 2025 DeepSeek-R1T-Chimera: Happy 1B tokens/day! Our R1T Chimera 671B research prototype, constructed from DeepSeek R1 and V3-0324, has now been in the wild for one month.Usage reached up to 1.35 billion processed tokens per day... Our R1T Chimera 671B research prototype, constructed from DeepSeek R1 and V3-0324, has now been in the wild for one month.Usage reached up to 1.35 billion processed tokens per day... May 19th, 2025 Hello and Thank you, USA! TNG USA is now incorporated with its office in Austin, Texas. Thanks to our existing clients in Silicon Valley and New York for the multi-year cooperation, as well as our US... TNG USA is now incorporated with its office in Austin, Texas. Thanks to our existing clients in Silicon Valley and New York for the multi-year cooperation, as well as our US... May 14th, 2025 Congratulations to TUM Boring for winning the Not A Boring Competition 2025 We congratulate TUM Boring on winning the “Not a Boring Competition” in Texas. With its research and development project in the field of tunnel boring machines, the student... We congratulate TUM Boring on winning the “Not a Boring Competition” in Texas. With its research and development project in the field of tunnel boring machines, the student... May 13th, 2025 TNG joins the DevDays Europe 2025 with five talks TNG will be represented with five talks and keynotes at the DevDays Europe, one of the leading software development conferences with more than 700 participants and 100 speakers... TNG will be represented with five talks and keynotes at the DevDays Europe, one of the leading software development conferences with more than 700 participants and 100 speakers... May 8th, 2025 Upgrade of our computing infrastructure Eight new AMD MI325X GPUs joined our compute cluster of 24 H100. The new Supermicro server is AI beast of a machine having 2 Terabytes GPU memory. It gives us more capacity and... Eight new AMD MI325X GPUs joined our compute cluster of 24 H100. The new Supermicro server is AI beast of a machine having 2 Terabytes GPU memory. It gives us more capacity and... May 2nd, 2025 Release of DeepSeek-R1T-Chimera On the weekend, we released DeepSeek-R1T-Chimera, an open weights model adding R1 reasoning to DeepSeek AI V3-0324. In benchmarks, it appears to be as smart as R1 but much faster... On the weekend, we released DeepSeek-R1T-Chimera, an open weights model adding R1 reasoning to DeepSeek AI V3-0324. In benchmarks, it appears to be as smart as R1 but much faster... April 23rd, 2025 Article "Finetuning olmOCR to be a faithful OCR-Engine" We recently created a fine-tune of an Optical Character Recognition (OCR) AI model based on olmOCR to help us automate our internal document processing workflows. In our new... We recently created a fine-tune of an Optical Character Recognition (OCR) AI model based on olmOCR to help us automate our internal document processing workflows. In our new... April 22nd, 2025 Recap of our third AI & Prompt Engineering Meetup At our recent third AI & Prompt Engineering Meetup, we welcomed 60 guests to our Munich office for an evening full of experiments with Generative AI. In a special edition of... At our recent third AI & Prompt Engineering Meetup, we welcomed 60 guests to our Munich office for an evening full of experiments with Generative AI. In a special edition of... April 17th, 2025 Article "Rapid Prototyping of Collaborative Applications with CRDTs" Collaborative editing of documents has become an essential requirement for successful remote work. But setting up these collaborative features and maintaining a shared state in... Collaborative editing of documents has become an essential requirement for successful remote work. But setting up these collaborative features and maintaining a shared state in... April 16th, 2025 Article "Prefill and Decode for Concurrent Requests - Optimizing LLM Performance" At TNG, we are self-hosting numerous Large Language Models on our cluster of 24 H100 GPUs. It supports 50 different applications, handles over 5,000 inferences per hour, and... At TNG, we are self-hosting numerous Large Language Models on our cluster of 24 H100 GPUs. It supports 50 different applications, handles over 5,000 inferences per hour, and... April 4th, 2025 Article "Efficient Request Queueing – Optimizing LLM Performance" Serving Large Language Models to multiple applications and users in parallel is challenging because they compete for limited GPU resources. In the first of three articles, our... Serving Large Language Models to multiple applications and users in parallel is challenging because they compete for limited GPU resources. In the first of three articles, our... April 4th, 2025 Unitree G1 Roboter "G1PO" Hello from the newest member of the TNG team 🤖 This is our Unitree G1 Robot “G1PO”. Powered by 43 joint motors, it navigates around the office at a speed of 2 m/s using LiDAR... Hello from the newest member of the TNG team 🤖 This is our Unitree G1 Robot “G1PO”. Powered by 43 joint motors, it navigates around the office at a speed of 2 m/s using LiDAR... Previous 1 … 4 5 6 … 8 Next Previous news can be found in the archive.