① On February 22, the theme forum of the 2025 Global Developer Pioneer Conference “Corpus Building the Foundation for Intelligent Students” was held in Xuhui District, Shanghai City, focusing on the cutting-edge themes of large model corpus and discussing the development opportunities and potential of corpus data.
② The forum released the Corpus Data Intelligent Creative Competition and the recruitment order for the 2025 Corpus Billboard of the Model Shanghai Corpus Inclusive Project, and launched a special project and Corpus Working Committee with a Specific Intelligent Corpus.
On February 22, the theme forum of the 2025 Global Developer Pioneer Conference “Language Materials Building the Foundation for Intelligent Students” officially kicked off in Xuhui District, Shanghai City. The forum is guided by the Organizing Committee of the Global Developer Pioneers Conference, hosted by Shanghai Copas Technology Co., Ltd., and co-organized by Caohejing Development Zone Corporation, Shanghai Artificial Intelligence Laboratory, Shangtang Technology, Step Star, Xiyu Technology and other companies. Zhang Hongtao, deputy director of the Shanghai City Economic and Information Commission, and Yu Linwei, member of the Standing Committee of the Xuhui District Committee and deputy district mayor, attended the forum and delivered speeches.
The conference deeply focused on cutting-edge themes of large model corpus, gathered top wisdom from industry, academia and research, jointly discussed the infinite opportunities and potential for the development of corpus data, jointly built a prosperous ecosystem of Shanghai’s large model corpus, and injected new momentum into the innovative development and application of artificial intelligence large models.
Zhang Hongtao said that Shanghai has comprehensively and strategically deployed the artificial intelligence large model industry, accelerated the “Modeling and Shenzhen City” action plan, provided strong basic base empowerment and rich application scenario support for large models, and became an ideal fertile ground for innovation and development in the industry. In the future, Shanghai will continue to consolidate the foundation of high-quality comprehensive corpus, build a core hub for data corpus, accelerate the innovation of key technologies for corpus services, accelerate the promotion of the “5+6” vertical field corpus project, improve the industry corpus supply system, and build a win-win and prosperous corpus service ecosystem to better support the innovative development and application of large models.
Zhang Hongtao, Deputy Director of Shanghai City Economic and Information Commission
Yu Linwei said that Xuhui, as a national-level artificial intelligence industry cluster, took the lead in developing the artificial intelligence large model industry in the city. The urban area has jointly launched the country’s first large model innovation ecological community-“Model Speed Space, which has created computing power scheduling and open data. Five functional platforms, including financial services, provide enterprises with” nannies “and” special class “services. In the future, Xuhui will continue to ensure the implementation of factors, continue to attract the world’s top talents, continue to optimize the large model and corpus service industry ecosystem, build the model speed space into the “world’s largest artificial intelligence incubator”, and build Xuhui into a national artificial intelligence highland. The source of innovation is peak.
Yu Linwei, Member of the Standing Committee of the Xuhui District Committee and Deputy District Mayor
At the meeting, Zhang Hongtao, deputy director of the Municipal Economic and Information Technology Commission, Huang Weijun, secretary of the Party Committee and vice president of Shanghai Information Investment Corporation, Jin Yuchun, general manager of the Shanghai Branch of People’s Daily Online, and Zhong Junhao, secretary general of the Shanghai Artificial Intelligence Association, jointly released the Model Shanghai Corpus Data Inclusive Plan. Intelligent Creative Competition (referred to as “CICC”). Relying on the “Model Speed Shenzhen City Corpus Inclusive Plan”, the CICC Competition looks for “good corpus, good technology, and good scenarios” for the whole society, builds a solid corpus foundation for Shanghai’s “Model Shenzhen City” project, and opens up high-quality corpus data collection, labeling, Sharing, and application the full link.
The Corpus Data Intelligent Creative Competition of the Shanghai Municipal Corpus Inclusive Project was officially launched
Subsequently, the conference grandly issued a recruitment order for the 2025 Corpus List. In order to gather the industry’s top wisdom and build an open cooperation ecosystem, at the 2024 World Artificial Intelligence Conference, Cooper launched the first corpus list, and a number of good companies and products stood out. The 2025 Corpus List will continue the basic framework of “good companies, good products, and good rules”, complete the collection and selection in the next four months, and officially release the “2025 China Corpus Manufacturers Top 10” and “2025 China Corpus Service Providers Top 10” at the 2025 World Artificial Intelligence Conference.
The 2025 collection of corpus rankings officially launched
As a key direction at the forefront of the development of large models, the development of embodied intelligence has entered the fast lane, and high-quality data has become the top priority in promoting the exploration of embodied intelligence application scenarios. At this conference, companies such as Copas United Nations Land Center, Caohejing Park, Qiongche, Zhiyuan, Songying, Fourier, Xinghaitu, China Power Science Institute 21, and Large Model Ecological Development officially launched a special project for intelligent corpus. and “production-accompanying” data collection project. The first phase of the project focuses on creating three corpus data collection models: “production accompaniment”, physical field remote operation, and simulation synthesis, with a scale of up to 50 million pieces, basically forming a world-class and domestically leading specific intelligent corpus data supply system and standard specification system.
The special project of specific intelligent corpus and “production-accompanying” data collection were officially launched
In order to further promote the construction of high-quality corpus data, Cooper, under the guidance of the Municipal Economic and Information Technology Commission, joined hands with the first batch of 103 companies, scientific research institutions, experts and scholars to jointly initiate the establishment of corpus data with the attitude of universal benefit, linking and innovation. Working Committee. Focusing on the construction of high-quality corpus, the Corpus Working Committee will promote the implementation of the tripartite cooperation model between corpus, model, and application scenarios by optimizing the link mechanism between corpus platforms and various vertical application fields, thereby building a high-quality and application value corpus ecosystem.
The Corpus Working Committee was officially established
The forum invited industry experts, entrepreneur representatives, young scientists, etc. to give keynote speeches. During the keynote speech session, Professor Liu Pengfei of Shanghai Jiao Tong University presented “Thinking and Exploration of the Next Generation of Large Model Training Corpus Data”, Shan Dongming, Chairman of Shanghai Copas Technology Co., Ltd., presented “Five Steps for Vertical Application of Large Models”, and Shanghai Wang Yu, head of the Scenario and Data Alliance Cooperation Center of the Artificial Intelligence Innovation Center, presented “Thousands of Volumes·Silk Road” Multilingual Corpus Interpretation and Open Source Cooperation Plan. The forum also invited Zhou Qi, Chairman of Yilijie (Shanghai) Information Technology Co., Ltd., to interpret “Dynamic Governance and Value Refining of Medical Corpus Data”, and Chen Qin, Chief Economist of Shanghai Maice Data Technology Co., Ltd., to share “Looking at the Big Model from Recruitment Corpus” The Impact on the Labor Market “.
Shan Dongming, Chairman of Shanghai Copas Technology Co., Ltd.
Wang Yu, head of the Shanghai Artificial Intelligence Laboratory Scenario and Data Alliance Cooperation Center
Zhou Qi, Chairman of Yilijie (Shanghai) Information Technology Co., Ltd.
Chen Qin, chief economist of Shanghai Maice Data Technology Co., Ltd.
The round-table session was presided over by Miao Guocheng, director of Beijing Yiou Netmeng Technology Co., Ltd. and general manager of Shanghai Company, and joined hands with Xiao Jun, executive deputy director of Shanghai Open Distance Education Engineering Technology Research Center, Tan Chang, executive deputy general manager of Anhui Feishu Information Technology Co., Ltd. Liu Yidong, CTO of Technology Co., Ltd., and Ni Qinyu, head of artificial intelligence data of Shanghai Xinzhi Software Co., Ltd., jointly discussed the “Tao” and “Technique” for the innovative development of artificial intelligence corpus “.
The theme forum of the 2025 Global Developer Pioneer Conference “Corpus Building the Foundation for Intelligent Students”, as the annual Corpus event hosted by Cooper, provides an in-depth analysis of the development and future trend of the large model Corpus industry, and demonstrates the “super factory” and hub platform of Cooper’s Corpus. The forward-looking overall vision leads and promotes the innovation and development process of large models driven by corpus data and driven by industrial applications. The content ranges from the high-profile industry development direction, authoritative results release, We will provide new ideas, new methods, and new paths for the development of the industry. We look forward to marching forward to the AI world with all walks of life and heading for a smart future!