Li Auto unveils next-gen autonomous driving architecture MindVLA

Beijing (Gasgoo)- On March 18, 2025, Li Auto's head of autonomous driving technology R&D, Mr. Jia Peng, delivered a keynote speech at NVIDIA GTC 2025, sharing insights into the company's latest advancements in its next-generation autonomous driving technology, MindVLA.

微信图片_20250318160903.png

Photo credit: Li Auto

MindVLA is an innovative autonomous driving model based on a dual-system architecture integrating end-to-end learning and Vision-Language Models (VLM). As a new paradigm in large-scale robotic models, MindVLA endows autonomous vehicles with enhanced 3D spatial comprehension, logical reasoning, and behavior generation capabilities, allowing them to perceive, think, and adapt to dynamic environments.

微信图片_20250318160907.png

Photo credit: Li Auto

Unlike a simple combination of end-to-end and VLM models, MindVLA features an entirely new design. Its 3D spatial encoder integrates language models and logical reasoning to generate driving decisions, outputting action tokens—a representation of environmental and driving behaviors. These tokens undergo further optimization via a diffusion model to determine the optimal driving trajectory in real time, all processed on-vehicle.

Leveraging a self-developed unified cloud-based world model, MindVLA integrates 3D scenario reconstruction, generative view completion, and unseen perspective prediction to create a highly realistic simulation environment. This enables large-scale closed-loop reinforcement learning, allowing the model to continuously improve through experience. Li Auto said it has significantly optimized its world model over the past year, increasing 3D GS training speeds by over sevenfold.

微信图片_20250318160919.png

Photo credit: Li Auto

MindVLA redefines the autonomous driving experience, enabling vehicles to understand and respond to voice commands in real-time. Users can issue natural language instructions, such as "Find me a supermarket" in an unfamiliar area, without predefined navigation. The vehicle will autonomously explore and locate the destination. Additionally, drivers can make real-time adjustments, such as "Slow down" or "Take the left lane," with the system understanding and executing the commands seamlessly.

Li Auto unveils next-gen autonomous driving architecture MindVLA

Related Articles

NIO Battery R&D Base Settles in Shanghai's Jiading District

Voyah Taishan Ultra and Black Knight Editions to Launch on March 17

Seeds | Laying "Foundation" for Embodied AI, RealMan Completes Nearly 500 Million Yuan Financing

XPENG AEROHT "Land Aircraft Carrier" Rolls Off Batch Trial Production Line