In August 2025, CRRC successfully passed the Level 4 evaluation—the highest rating—for trusted AI dataset quality by the China Academy of Information and Communications Technology (CAICT), becoming the first central state-owned enterprise (SOE) in the manufacturing industry to receive this recognition.
The evaluation was conducted in strict accordance with the industry standard General Requirements for Dataset Quality Assessment for Artificial Intelligence (2021-1303T-YD), led by CAICT. This milestone marks CRRC’s entry into the “top tier” of high-quality AI data development in the manufacturing sector.
For the rail transit equipment manufacturing industry, high-quality datasets cover the three core business areas of R&D and design, production and manufacturing, and operations and maintenance services. CRRC has built specialized datasets for high-value application scenarios in these areas, ensuring close alignment between data resources and industry needs.
During the assessment, expert teams carried out comprehensive quantitative reviews against 12 primary indicators—including completeness, standardization, accuracy, timeliness, consistency, density, diversity, balance, relevance, originality, adaptability, and accessibility—along with related secondary metrics. CRRC’s datasets met the highest standards across all criteria, demonstrating its professionalism and rigour in data quality management.
In the era of large models, high-quality AI datasets are essential for building accurate, efficient, and reliable systems. CRRC will continue to strengthen the construction of high-quality datasets in rail transit equipment manufacturing, enhance its data quality management system, and expand application value. CRRC will also further integrate AI with its rail transit businesses, driving industry-wide digital and intelligent transformation and contributing solid data foundations and technical support for the high-quality development of China’s rail transit equipment manufacturing sector.