End-to-end Autonomous Driving Research: status quo of End-to-end (E2E) autonomous driving
1. Status quo of end-to-end solutions in China
An end-to-end autonomous driving system refers to direct mapping from sensor data inputs (camera images, LiDAR, etc.) to control command outputs (steering, acceleration/deceleration, etc.). It first appeared in the ALVINN project in 1988. It uses cameras and laser rangefinders as input and a simple neural network to generate steering as output.
In early 2024, Tesla rolled out FSD V12.3, featuring an amazing intelligent driving level. The end-to-end autonomous driving solution garners widespread attention from OEMs and autonomous driving solution companies in China.?
Compared with conventional multi-module solutions, the end-to-end autonomous driving solution integrates perception, prediction and planning into a single model, simplifying the solution structure. It can simulate human drivers making driving decisions directly according to visual inputs, effectively cope with long tail scenarios of modular solutions and improve the training efficiency and performance of models.



Li Auto's end-to-end solution
Li Auto believes that a complete end-to-end model should cover the whole process of perception, tracking, prediction, decision and planning, and it is the optimal solution to achieve L3 autonomous driving. In 2023, Li Auto pushed AD Max3.0, with overall framework reflecting the end-to-end concept but still a gap with a complete end-to-end solution. In 2024, Li Auto is expected to promote the system to become a complete end-to-end solution.?
Li Auto's autonomous driving framework is shown below, consisting of two systems:
Fast system: System 1, Li Auto’s existing end-to-end solution which is directly executed after perceiving the surroundings.
Slow system: System 2, a multimodal large language model that logically thinks and explores unknown environments to solve problems in unknown L4 scenarios.

In the process of promoting the end-to-end solution, Li Auto plans to unify the planning/forecast model and the perception model, and accomplish the end-to-end Temporal Planner on the original basis to integrate parking with driving.
2. Data becomes the key to the implementation of end-to-end solutions.
The implementation of an end-to-end solution requires processes covering R&D team building, hardware facilities, data collection and processing, algorithm training and strategy customization, verification and evaluation, promotion and mass production. Some of the sore points in scenarios are as shown in the table:

The integrated training in end-to-end autonomous driving solutions requires massive data, so one of the difficulties it faces lies in data collection and processing.
First of all, it needs a long time and may channels to collect data, including driving data and scenario data such as roads, weather and traffic conditions. In actual driving, the data within the driver's front view is relatively easy to collect, but the surrounding information is hard to say.
During data processing, it is necessary to design data extraction dimensions, extract effective features from massive video clips, make statistics of data distribution, etc. to support large-scale data training.
DeepRoute
As of March 2024, DeepRoute.ai's end-to-end autonomous driving solution has been designated by Great Wall Motor and involved in the cooperation with NVIDIA. It is expected to adapt to NVIDIA Thor in 2025. In the planning of DeepRoute.ai, the transition from the conventional solution to the "end-to-end" autonomous driving solution will go through sensor pre-fusion, HD map removal, and integration of perception, decision and control.

GigaStudio
DriveDreamer, an autonomous driving model of GigaStudio, is capable of scenario generation, data generation, driving action prediction and so forth. In the scenario/data generation, it has two steps:
When involving single-frame structural conditions, guide DriveDreamer to generate driving scenario images, so that it can understand structural traffic constraints easily.
Extend its understanding to video generation. Using continuous traffic structure conditions, DriveDreamer outputs driving scene videos to further enhance its understanding of motion transformation.

3. End-to-end solutions accelerate the application of embodied robots.
In addition to autonomous vehicles, embodied robots are another mainstream scenario of end-to-end solutions. From end-to-end autonomous driving to robots, it is necessary to build a more universal world model to adapt to more complex and diverse real application scenarios. The development framework of mainstream AGI (General Artificial Intelligence) is divided into two stages:
Stage 1: the understanding and generation of basic foundation models are unified, and further combined with embodied artificial intelligence (embodied AI) to form a unified world model;
Stage 2: capabilities of world model + complex task planning and control, and abstract concept induction gradually evolve into the era of the interactive AGI 1.0.
In the landing process of the world model, the construction of an end-to-end VLA (Vision-Language-Action) autonomous system has become a crucial link. VLA, as the basic foundation model of embodied AI, can seamlessly link 3D perception, reasoning and action to form a generative world model, which is built on the 3D-based large language model (LLM) and introduces a set of interactive markers to interact with the environment.

As of April 2024, some manufacturers of humanoid robots adopting end-to-end solutions are as follows:

For example, Udeer·AI's Large Physical Language Model (LPLM) is an end-to-end embodied AI solution that uses a self-labeling mechanism to improve the learning efficiency and quality of the model from unlabeled data, thereby deepening the understanding of the world and enhancing the robot's generalization capabilities and environmental adaptability in cross-modal, cross-scene, and cross-industry scenarios.

LPLM abstracts the physical world and ensures that this kind of information is aligned with the abstract level of features in LLM. It explicitly models each entity in the physical world as a token, and encodes geometric, semantic, kinematic and intentional information.
In addition, LPLM adds 3D grounding to the encoding of natural language instructions, improving the accuracy of natural language to some extent. Its decoder can learn by constantly predicting the future, thus strengthening the ability of the model to learn from massive unlabeled data.
Global and China Passenger Car T-Box Market Report, 2026
Based on 2025 market data and the latest business layouts of OEMs and suppliers from 2025 to 2026, this report analyzes the development status quo and future trends of China’s passenger car T-Box mark...
Global and China Range Extended Electric Vehicle (REEV) and Plug-in Hybrid Electric Vehicle (PHEV) Research Report, 2026
Research on REEVs and PHEVs: Foreign OEMs are considering extended-range technology as an important strategic option and will launch a series of new vehicles
Global PHEVs & REEVs tend to be domin...
Automotive Voice Industry Report, 2026
Automotive Voice Research: Explosive Growth in Features Like "See and Speak", 35-Fold Increase in External Voice Interaction in Two Years
ResearchInChina has released the Automotive Voice Industry R...
China Passenger Car Digital Chassis Research Report, 2026
Research on Digital Chassis: Leading OEMs Have Completed Configuration of Version 2.0 1. Leading OEMs Have Completed Configuration of Digital Chassis 2.0
By the degree of wired control of each c...
Vehicle Functional Safety and Safety Of The Intended Functionality (SOTIF) Research Report, 2026
Multiple Mandatory Standards for Intelligent Vehicles in China Upgrade Functional Safety Requirements from Recommended to Mandatory Access Criteria In 2026, China has intensively issued and promo...
Automotive 12V/48V Low-Voltage Lithium-ion Battery/Sodium-ion Battery Industry Research Report, 2026
Research on 12V/48V automotive low-voltage lithium-ion (sodium-ion) batteries: promoted by regulations and standardization, it is imperative to "replace lithium-ion (sodium-ion) batteries with lead-ac...
Next-Generation Automotive Wireless Communication Technologies (6G/5G-A, NearLink, Satellite Communication, UWB, etc.) and Automotive Communication Module Industry Report, 2026
Research on Next-Generation Communication and Modules: Accelerated Deployment of 5G-A, Satellite Communication, NearLink, UWB and Other Technologies in Automobiles
Automotive wireless communication t...
Research on Zonal Architecture: Smart Actuators (Micro-motors) and Application Trends in Sub-scenarios, 2026
Smart Actuator and Micro-motor Research: Under Zonal Architecture, Actuators Are Developing towards Edge Computing, 48V, and Brushless Motors.
The core components of automotive zonal architecture mai...
China Passenger Car Navigate on Autopilot (NOA) Industry Report, 2025
In 2025, NOA standardization was popularized, refined and deepened in parallel. In 2026, core variables will be added to the competitive landscape.
The evolution of autonomous driving follows a clear...
Smart Car OTA Industry Report, 2025-2026
Automotive OTA Research: In the Era of Mandatory Standards, OTA Transforms from a "Function Channel" to a New Stage of "Full Lifecycle Management"
Driven by the development and promotion of AI and so...
Automotive AI Box Research Report, 2026
Automotive AI Box Research: A new path of edge AI accelerates
This report studies the current application status of automotive AI Box from the aspects of scenario demand, product configuration, and i...
Automotive Fragrance and Air Conditioning System Research Report, 2025
Automotive Fragrance and Air Purification Research: Intelligent Fragrance Equipment to Exceed 4 Million Units by 2030, "All-in-One" Integrated Purification Becomes Mainstream
The "Automotive Fragranc...
Intelligent Vehicle Cockpit-driving Integration (Cockpit-driving-parking) Industry Report, 2025
Cockpit-Driving Integration Research: 36% CAGR by 2030, Single-Chip Cockpit-driving integration Solutions Enter Mass Production
ResearchInChina releases the "Intelligent Vehicle Cockpit-driving Integ...
Research Report on Overseas Layout of Chinese Passenger Car OEMs and Supply Chain Companies, 2025
Automotive Overseas Expansion Research: Accelerated Release of OEM Overseas Production Capacity, Chinese Intelligent Supply Chain Goes Global
This report conducts an in-depth analysis of the current ...
Passenger Car Intelligent Steering Industry Research Report, 2025-2026
Intelligent steering research: Rear-wheel steering prices drop to RMB200,000-250,000
1. Rear-wheel steering installations increased by 36.5% year-on-year.
From January to October 2025, the number of...
Global Autonomous Driving Policies & Regulations and Automotive Market Access Research Report, 2025-2026
Research on Intelligent Driving Regulations and Market Access: New Energy Vehicle Exports Double, and "Region-Specific Policies" Adapt to Regulatory Requirements of Various Countries in A Refined Mann...
Two-wheeler Intelligence and Industry Chain Research Report, 2025-2026
Two-Wheeler Electric Vehicle Research: New National Standard Drives Intelligent Popularization, AI Agent Makes Its Way onto Vehicles
ResearchInChina releases the "Two-wheeler Intelligence and Industr...
China Smart Door and Electric Tailgate Market Research Report, 2025
Smart Door Research: Driven by Automatic Doors, Knock-Knock Door Opening, etc., the Market Will Be Worth Over RMB100 Billion in 2030.
This report analyzes and researches the installation, market size...