End-to-end Autonomous Driving Research: status quo of End-to-end (E2E) autonomous driving
1. Status quo of end-to-end solutions in China
An end-to-end autonomous driving system refers to direct mapping from sensor data inputs (camera images, LiDAR, etc.) to control command outputs (steering, acceleration/deceleration, etc.). It first appeared in the ALVINN project in 1988. It uses cameras and laser rangefinders as input and a simple neural network to generate steering as output.
In early 2024, Tesla rolled out FSD V12.3, featuring an amazing intelligent driving level. The end-to-end autonomous driving solution garners widespread attention from OEMs and autonomous driving solution companies in China.?
Compared with conventional multi-module solutions, the end-to-end autonomous driving solution integrates perception, prediction and planning into a single model, simplifying the solution structure. It can simulate human drivers making driving decisions directly according to visual inputs, effectively cope with long tail scenarios of modular solutions and improve the training efficiency and performance of models.



Li Auto's end-to-end solution
Li Auto believes that a complete end-to-end model should cover the whole process of perception, tracking, prediction, decision and planning, and it is the optimal solution to achieve L3 autonomous driving. In 2023, Li Auto pushed AD Max3.0, with overall framework reflecting the end-to-end concept but still a gap with a complete end-to-end solution. In 2024, Li Auto is expected to promote the system to become a complete end-to-end solution.?
Li Auto's autonomous driving framework is shown below, consisting of two systems:
Fast system: System 1, Li Auto’s existing end-to-end solution which is directly executed after perceiving the surroundings.
Slow system: System 2, a multimodal large language model that logically thinks and explores unknown environments to solve problems in unknown L4 scenarios.

In the process of promoting the end-to-end solution, Li Auto plans to unify the planning/forecast model and the perception model, and accomplish the end-to-end Temporal Planner on the original basis to integrate parking with driving.
2. Data becomes the key to the implementation of end-to-end solutions.
The implementation of an end-to-end solution requires processes covering R&D team building, hardware facilities, data collection and processing, algorithm training and strategy customization, verification and evaluation, promotion and mass production. Some of the sore points in scenarios are as shown in the table:

The integrated training in end-to-end autonomous driving solutions requires massive data, so one of the difficulties it faces lies in data collection and processing.
First of all, it needs a long time and may channels to collect data, including driving data and scenario data such as roads, weather and traffic conditions. In actual driving, the data within the driver's front view is relatively easy to collect, but the surrounding information is hard to say.
During data processing, it is necessary to design data extraction dimensions, extract effective features from massive video clips, make statistics of data distribution, etc. to support large-scale data training.
DeepRoute
As of March 2024, DeepRoute.ai's end-to-end autonomous driving solution has been designated by Great Wall Motor and involved in the cooperation with NVIDIA. It is expected to adapt to NVIDIA Thor in 2025. In the planning of DeepRoute.ai, the transition from the conventional solution to the "end-to-end" autonomous driving solution will go through sensor pre-fusion, HD map removal, and integration of perception, decision and control.

GigaStudio
DriveDreamer, an autonomous driving model of GigaStudio, is capable of scenario generation, data generation, driving action prediction and so forth. In the scenario/data generation, it has two steps:
When involving single-frame structural conditions, guide DriveDreamer to generate driving scenario images, so that it can understand structural traffic constraints easily.
Extend its understanding to video generation. Using continuous traffic structure conditions, DriveDreamer outputs driving scene videos to further enhance its understanding of motion transformation.

3. End-to-end solutions accelerate the application of embodied robots.
In addition to autonomous vehicles, embodied robots are another mainstream scenario of end-to-end solutions. From end-to-end autonomous driving to robots, it is necessary to build a more universal world model to adapt to more complex and diverse real application scenarios. The development framework of mainstream AGI (General Artificial Intelligence) is divided into two stages:
Stage 1: the understanding and generation of basic foundation models are unified, and further combined with embodied artificial intelligence (embodied AI) to form a unified world model;
Stage 2: capabilities of world model + complex task planning and control, and abstract concept induction gradually evolve into the era of the interactive AGI 1.0.
In the landing process of the world model, the construction of an end-to-end VLA (Vision-Language-Action) autonomous system has become a crucial link. VLA, as the basic foundation model of embodied AI, can seamlessly link 3D perception, reasoning and action to form a generative world model, which is built on the 3D-based large language model (LLM) and introduces a set of interactive markers to interact with the environment.

As of April 2024, some manufacturers of humanoid robots adopting end-to-end solutions are as follows:

For example, Udeer·AI's Large Physical Language Model (LPLM) is an end-to-end embodied AI solution that uses a self-labeling mechanism to improve the learning efficiency and quality of the model from unlabeled data, thereby deepening the understanding of the world and enhancing the robot's generalization capabilities and environmental adaptability in cross-modal, cross-scene, and cross-industry scenarios.

LPLM abstracts the physical world and ensures that this kind of information is aligned with the abstract level of features in LLM. It explicitly models each entity in the physical world as a token, and encodes geometric, semantic, kinematic and intentional information.
In addition, LPLM adds 3D grounding to the encoding of natural language instructions, improving the accuracy of natural language to some extent. Its decoder can learn by constantly predicting the future, thus strengthening the ability of the model to learn from massive unlabeled data.
OEMs and Tier 1 Suppliers' Cost Reduction and Efficiency Enhancement Strategy Analysis Report, 2025
ResearchInChina released the "OEMs and Tier 1 Suppliers' Cost Reduction and Efficiency Enhancement Strategy Analysis Report, 2025", summarizing hundreds of cost reduction strategies to provide referen...
Automotive Fixed Panoramic Sunroof and Smart Roof Research Report, 2025
With the intelligent application of car roofs as the core, this report systematically sorts out a series of new products such as fixed panoramic sunroof/openable sunroof, ceiling screen, roof ambient ...
Automotive-Grade Power Semiconductor and Module (SiC, GaN) Industry Research Report, 2025
SiC/GaN Research: Sales volume of 800V+ architecture-based vehicles will increase more than 10 times, and hybrid carbon (SiC+IGBT) power modules are rapidly being deployed in vehicles.
Sales volume o...
Cockpit Agent Engineering Research Report, 2025
Cockpit Agent Engineering Research: Breakthrough from Digital AI to Physical AI
Cockpit Agent Engineering Research Report, 2025 starts with the status quo of cockpit agents, summarizes the technical ...
Prospective Study on L3 Intelligent Driving Technology of OEMs and Tier 1 Suppliers, 2025
L3 Research: The Window of Opportunity Has Arrived - Eight Trends in L3 Layout of OEMs and Tier 1 Suppliers
Through in-depth research on 15 OEMs (including 8 Chinese and 7 foreign OEMs) and 9 Tier 1 ...
China Commercial Vehicle IoV and Intelligent Cockpit Industry Research Report 2025
Commercial Vehicle IoV and Cockpit Research: The Third Wave of Passenger Car/Commercial Vehicle Technology Integration Arrives, and T-Box Integrates e-Call and 15.6-inch for Vehicles
I. The third wav...
Intelligent Vehicle Electronic and Electrical Architecture (EEA) and Technology Supply Chain Construction Strategy Research Report, 2025
E/E Architecture Research: 24 OEMs Deploy Innovative Products from Platform Architectures to Technical Selling Points
According to statistics from ResearchInChina, 802,000 passenger cars with domain...
Research Report on Intelligent Vehicle Cross-Domain Integration Strategies and Innovative Function Scenarios, 2025
Cross-Domain Integration Strategy Research: Automakers' Competition Extends to Cross-Domain Innovative Function Scenarios such as Cockpit-Driving, Powertrain, and Chassis
Cross-domain integration of ...
China Autonomous Driving Data Closed Loop Research Report, 2025
Data Closed-Loop Research: Synthetic Data Accounts for Over 50%, Full-process Automated Toolchain Gradually Implemented
Key Points:From 2023 to 2025, the proportion of synthetic data increased from 2...
Automotive Glass and Smart Glass Research Report, 2025
Automotive Glass Report: Dimmable Glass Offers Active Mode, Penetration Rate Expected to Reach 10% by 2030
ResearchInChina releases the Automotive Glass and Smart Glass Research Report, 2025. This r...
Passenger Car Brake-by-Wire (BBW) Research Report, 2025
Brake-by-Wire: EHB to Be Installed in 12 Million Vehicles in 2025
1. EHB Have Been Installed in over 10 Million Vehicles, A Figure to Hit 12 Million in 2025.
In 2024, the brake-by-wire, Electro-Hydr...
Autonomous Driving Domain Controller and Central Computing Unit (CCU) Industry Report, 2025
Research on Autonomous Driving Domain Controllers: Monthly Penetration Rate Exceeded 30% for the First Time, and 700T+ Ultrahigh-compute Domain Controller Products Are Rapidly Installed in Vehicles
L...
China Automotive Lighting and Ambient Lighting System Research Report, 2025
Automotive Lighting System Research: In 2025H1, Autonomous Driving System (ADS) Marker Lamps Saw an 11-Fold Year-on-Year Growth and the Installation Rate of Automotive LED Lighting Approached 90...
Ecological Domain and Automotive Hardware Expansion Research Report, 2025
ResearchInChina has released the Ecological Domain and Automotive Hardware Expansion Research Report, 2025, which delves into the application of various automotive extended hardware, supplier ecologic...
Automotive Seating Innovation Technology Trend Research Report, 2025
Automotive Seating Research: With Popularization of Comfort Functions, How to Properly "Stack Functions" for Seating?
This report studies the status quo of seating technologies and functions in aspe...
Research Report on Chinese Suppliers’ Overseas Layout of Intelligent Driving, 2025
Research on Overseas Layout of Intelligent Driving: There Are Multiple Challenges in Overseas Layout, and Light-Asset Cooperation with Foreign Suppliers Emerges as the Optimal Solution at Present
20...
High-Voltage Power Supply in New Energy Vehicle (BMS, BDU, Relay, Integrated Battery Box) Research Report, 2025
The high-voltage power supply system is a core component of new energy vehicles. The battery pack serves as the central energy source, with the capacity of power battery affecting the vehicle's range,...
Automotive Radio Frequency System-on-Chip (RF SoC) and Module Research Report, 2025
Automotive RF SoC Research: The Pace of Introducing "Nerve Endings" such as UWB, NTN Satellite Communication, NearLink, and WIFI into Intelligent Vehicles Quickens
RF SoC (Radio Frequency Syst...