On the weekend, we released DeepSeek-R1T-Chimera, an open weights model adding R1 reasoning to DeepSeek AI V3-0324. In benchmarks, it appears to be as smart as R1 but much faster...
We recently created a fine-tune of an Optical Character Recognition (OCR) AI model based on olmOCR to help us automate our internal document processing workflows. In our new...
At our recent third AI & Prompt Engineering Meetup, we welcomed 60 guests to our Munich office for an evening full of experiments with Generative AI. In a special edition of...
Collaborative editing of documents has become an essential requirement for successful remote work. But setting up these collaborative features and maintaining a shared state in...
At TNG, we are self-hosting numerous Large Language Models on our cluster of 24 H100 GPUs. It supports 50 different applications, handles over 5,000 inferences per hour, and...
Hello from the newest member of the TNG team 🤖 This is our Unitree G1 Robot “G1PO”. Powered by 43 joint motors, it navigates around the office at a speed of 2 m/s using LiDAR...
Serving Large Language Models to multiple applications and users in parallel is challenging because they compete for limited GPU resources. In the first of three articles, our...
Threat Modeling is an effective method to identify security vulnerabilities and cultivate a security mindset within development teams. To make this process more fun and...
Our popular AI & Prompt Engineering Meetup returns for a third round. On April 10th, we invite all AI enthusiasts to our Munich office at Arabellastraße 4a to dive into the...
Last week, we hosted the AI & Cloud Innovation Meetup at our Karlsruhe office, focusing on scaling Large Language Models from local to company-wide deployment. Our colleague...