💙 Gate Square #Gate Blue Challenge# 💙
Show your limitless creativity with Gate Blue!
📅 Event Period
August 11 – 20, 2025
🎯 How to Participate
1. Post your original creation (image / video / hand-drawn art / digital work, etc.) on Gate Square, incorporating Gate’s brand blue or the Gate logo.
2. Include the hashtag #Gate Blue Challenge# in your post title or content.
3. Add a short blessing or message for Gate in your content (e.g., “Wishing Gate Exchange continued success — may the blue shine forever!”).
4. Submissions must be original and comply with community guidelines. Plagiarism or re
It's been 7 months, and the Chinese AI model can't learn from ChatGPT
Source: Titanium Media
Author: Lin Zhijia
Similar to the artificial intelligence (AI)-related conferences in the previous few weeks, at the 2023 Global Digital Economy Conference Artificial Intelligence Summit Forum, entrepreneurs and academics discussed the impact of large-scale models and generative AI technologies on society and industry.
"In the past few months, everyone has been discussing when OpenAI and China will be able to make their own large-scale models. In the past few months, large-scale models have emerged one after another. I see that many investors are starting to be anxious." Zhou Hongyi, chairman of 360, said in a speech at the forum He said that the real opportunity for large-scale models lies in China, especially the enterprise-level market, including governments and cities, and China's large-scale model should seize the opportunity of industrial Internet development.
Tang Wenbin, co-founder of Megvii Technology, Zhang Peng, CEO of Zhipu AI, Zhou Bowen, founder of Lianyuan Technology and Huiyan Chair Professor of Tsinghua University, and other guests also agreed with this view. They generally believed that large models should no longer be consumed A lot of money is spent on general large-scale models with hundreds of billions of parameters such as ChatGPT. Instead, it is necessary to build large-scale industry models with a billion-level industry, go deep into To B industry solutions, and solve practical problems in vertical fields.
In fact, as ChatGPT set off an upsurge in the investment and industrial development of China's AI large models, companies have followed ChatGPT and strived to become the "Chinese version of OpenAI". Now more than 80 large models have been publicly tested.
**But the problem is that many investors and entrepreneurs found that the technical threshold of large-scale models is not high after seven months of "rolling". Commercial solutions, while ChatGPT’s “$20 per month” Plus membership service can only become “free” in China, and the continuous AI technology investment of OpenAI is not feasible in China. **
**In the end, it was discovered that only customers in several vertical fields such as government affairs, finance, and medical care can pay for the "big model". **
Enterprises no longer compete with AI large-scale model technology, but return to commercialization
Over the past six months or so, large-scale model technologies represented by ChatGPT and GPT-4 have led a new round of global AI innovation. Large-scale model research at home and abroad is iterative at a high speed, various models emerge in an endless stream, and the performance of the models continues to improve.
In China, from Baidu Wenxin Yiyan, Ali Tongyi Qianwen, Huawei Pangu large model, to 360 smart brain, Zhipu AI ChatGLM, etc., all walks of life are building large models. According to Jiang Guangzhi, director of the Beijing Municipal Bureau of Economy and Information Technology, more than 80 large-scale models have been released in China so far, of which Beijing accounts for about half (more than 40).
"We seem to have climbed a mountain for a long time, and finally saw a new peak and a new dawn." Jiang Guangzhi said in his speech that in the past two decades, we have experienced the Internet, smart phones, cloud computing, big data, The Internet of Things, a series of major changes in IT technology, has profoundly changed all aspects of our production and life, and now we have ushered in a new era of general AI.
However, compared with the payment model of consumer-level platforms such as ChatGPT, China's AI large models are generally free for internal testing, and users can obtain a license for use as long as they apply for internal testing on the platform. Not only that, under the "Hundred Models War", everyone makes large models, and it is difficult to form barriers to the enterprise itself and the industry. In addition, the cost of AI computing power is high, and the commercialization of large models is not as easy as imagined.
To put it simply, the domestic AI model is far from OpenAI's business model and technology cost. OpenAI can invest hundreds of millions of dollars in computing power training costs, which cannot be achieved by domestic small and medium-sized AI companies, especially most of which are concentrated on the application side.
"The amount of parameters is actually a dialectical issue. Hundreds of billions of trillions of parameters just represent your highest ability, but high parameters mean higher requirements for thinking ability and high computing power, and customer scenarios do not necessarily require such high computing power requirements. Because not all scenarios can accept the cost of 100 billion parameters.” Dai Wenyuan, the founder and CEO of 4Paradigm, told Titanium Media App in May this year that under various constraints and companies looking for profits, To B customers began to choose cost-effective Controlled vertical AI mockup.
**At the 2023 Global Digital Economy Conference Artificial Intelligence Summit Forum, Zhou Hongyi, Zhang Peng, Zhou Bowen and other guests generally mentioned the "industrialization" and "verticalization" of large models, and went deep into government affairs, finance, cultural and creative industries, medical care and other industries Expansion, and discussions around model security, credibility, and controllability. **
Zhou Hongyi bluntly stated that the large-scale model that the enterprise-level market really needs in the future must conform to the characteristics of industrialization, enterpriseization, verticalization, miniaturization, and proprietaryization. Not only that, but in his opinion, the construction of enterprise-level large models must adhere to the following three principles:
At the same time, Zhou Hongyi also suggested that enterprises should first enable the "assistant" and "co-pilot" modes when using the large model, so that the large model can remain relatively independent from the existing business system and maintain isolation, which is also safer and more controllable for the enterprise. In addition, the large enterprise model should be used by both leaders and employees, so that AI is inclusive.
"It is always difficult for large models to complete a job independently, and many employees are resistant to large models. It is still difficult for large models to complete a job independently. More positioning is human Good tools and knowledge assistants, so the development of large models should be people-oriented.” Zhou Hongyi mentioned that ease of use is the first principle of the development of large models.
Zhou Hongyi judged that the digital human will become an important entrance and carrying form of enterprise-level large models. The "soul" digital assistant released by 360 Smart Brain can solve the problem of ease of use of large models. At the scene, Zhou Hongyi also showed the customized "Beijing Customer Service Can't Ask" and "Beijing City Merchants", two large-scale digital humans trained for Beijing to solve some core needs in the government and enterprise fields.
"We have three versions, one is end-to-end model training service, and has helped you complete some model migration training on the self-built computing power platform; the other is to provide model building services and licenses; the third is to cooperate with cloud computing vendors, Proposed API calls and Model Instance services to help everyone quickly build powerful infrastructure capabilities." Zhang Peng said.
Titanium Media App learned that Zhipu AI is developing a new ChatGLM2 large-scale model product, which reduces the number of parameters but improves data quality. Ability has been greatly improved. In multiple evaluations, ChatGLM2 scores more than GPT-4 and ChatGPT.
Zhou Bowen, founder of Lianyuan Technology and Huiyan Chair Professor of Tsinghua University, said that whether Al can be fully integrated with business is the key factor that determines whether Al can realize economic value. Only with a business-oriented AI strategy design, a complete supporting structure, sufficient AI talents, and a sound internal training mechanism can AI be fully integrated with business development needs and maximize economic benefits.
Fang Han, CEO of Kunlun Wanwei, believes that at present, large-scale models are in a state of reducing costs at the B-side and increasing efficiency at the C-side.
He believes that China's B-end service companies can see that it is difficult for any company to monopolize the entire B-end service. Due to the large model’s demand for industry data, it is inevitable that every large model company can achieve first-mover success in one or two industries. No company can succeed in all industries, and it is difficult for any company to obtain The data of the whole industry; and the C-end is bound to be fragmented. Because the payment habits of the Chinese market are actually very different from overseas, the free habits of Chinese users are very obvious. All companies that provide services to C-end users must be based on The free mode is the main mode, and the VIP mode is supplemented.
"The big tide will rise, and landing is king. We hope that this wave of entrepreneurship and investment in general artificial intelligence will be different from the previous wave, and it can land faster and generate users and income." Fang Han's theory is still biased Commercialization of large models. He mentioned that the AI-generated music products developed by Kunlun Wanwei have already landed in the cultural and tourism scene, and signed an agreement with Beijing Dongcheng District in April this year.
This is the first time that China Telecom has announced its large-scale model products. Zhang Xin also said that compared with other large models, most of TeleChat products use domestic Chinese big data, 90% of which are mainly domestic, and based on China Telecom's Tianyi Cloud and cloud-network fusion base, TeleChat is used for model training parameters. At that time, the capacity can reach 47%, and the efficiency of model training and model algorithm capabilities are still improving.
In the application scenario, China Telecom's TeleChat model has begun to be intelligentized by manufacturers to solve the problems of talent shortage and insufficient coverage of some voice operators. Through the improvement of AI technology, the modernization of social governance capabilities can be realized and cost consumption can be reduced.
Guo Fan, Vice President of Unisound’s Innovation Division, once mentioned that the demonstration application of the outpatient medical record generation system based on the large model of mountains and seas jointly developed by Unisound and Beijing Friendship Hospital, in the field of smart medical care, based on Unisound’s 70 billion parameter-scale automatic Research the "Mountain and Sea" large model, combined with front-end sound signal processing, voiceprint recognition, speech recognition, speech synthesis and other full-stack intelligent voice interaction technologies, it is expected to improve the efficiency of doctors' electronic medical record entry by more than 400%, saving a single patient consultation time More than 40%, improving the efficiency of doctors' outpatient services by more than 66%.
In fact, large models are typically a winner-take-all area. More money, more computing power, and better talents are needed. Because better computing power means more people use it, more people use it means more data, and more data means better computing power results. The big model must be a battleground for the giants, who have money, technology, and more importantly, data.
But start-up companies have too many projects and their funds are too scattered, especially at the enterprise end, they end up consuming funds to buy Nvidia A100 cards and cloud services, without a quantified enterprise development process. Especially when AI companies and SaaS companies are generally difficult to make profits and make blood, start-up companies need to commercialize and make profits with large models.
Therefore, the current domestic AI large-scale model is taking shape based on the general-purpose large-scale model base and computing power center established by large companies such as Ali, Tencent, Baidu, Huawei, and SenseTime, as well as including Momo Zhixing, Tianyancha, Yunzhisheng, and China The large vertical or industry domain models established by companies such as Kewenge and Megvii only focus on one or two areas to solve core problems.
The government supports the landing of large-scale model scenes
At this forum, the Beijing Economic and Information Bureau once again announced the second batch of members of the Beijing General Artificial Intelligence Industry Innovation Partnership Program, and 63 companies were selected.
It is understood that as of June 30, a total of 416 large-scale model R&D and application companies inside and outside Beijing have applied to join the second batch of "Partnership Program". In the end, 63 companies, including Beijing Baidu Netcom Technology Co., Ltd., China Power Data Service Co., Ltd., and Beijing Jizhi Future Artificial Intelligence Industry Innovation Base Co., Ltd., were selected and announced. Among them, there are 10 computing power partners, 10 data partners, 10 model partners, 24 application partners, and 9 investment partners. In addition, 30 model observers were evaluated.
According to the estimates of Titanium Media App, as of now, more than 80 companies and institutions have been selected into the Beijing General Artificial Intelligence Industry Innovation Partnership Program.
**At present, Beijing is rapidly promoting the construction of AI large-scale models and industrial layout. **
On May 21, the Beijing Municipal People's Government issued a notice on the "Implementation Plan for Beijing to Accelerate the Construction of a Globally Influential Artificial Intelligence Innovation Source (2023-2025)". By 2025, Beijing's artificial intelligence technology innovation and industrial development will enter a new era. In the stage of development, basic theoretical research has made breakthroughs, the influence of original innovation achievements has continued to increase, and the scale of the artificial intelligence industry has continued to increase, forming an industrial cluster with international competitiveness and technological dominance.
On May 23, the General Office of the Beijing Municipal People's Government issued a notice of "Several Measures to Promote the Innovation and Development of General Artificial Intelligence in Beijing", requiring that the government's guiding role and the catalytic role of innovation platforms be fully utilized to integrate innovation resources, strengthen the allocation of elements, and create innovations. ecology, attach importance to risk prevention, and promote Beijing's general artificial intelligence to achieve innovation leadership and rational and healthy development.
A person in charge of the Beijing Municipal Administration Service Bureau mentioned in a live speech that in order to support the construction of the platform "Jingce", the government needs to implement general large-scale model technology in the field of scenarios, so as to improve policy management and precise service capabilities. "In-depth mining and analysis of massive data on citizens' appeals provides stronger support for leadership decision-making, grassroots governance, and urban governance."
The person in charge mentioned above mentioned that in terms of scene advancement, an open small interface model will be used in the early stage, and privatized in-depth applications will be carried out later. In the long run, they will open up high-quality and credible government data sets by establishing digital protection mechanisms and technical evaluations under the condition of data security and controllability.
Wang Zhangsheng, head of Zhongke Wenge Delivery Center, mentioned in an exchange with Titanium Media App that government customers have high demands for large AI models, and this field also requires data security and data governance in the application of large models. Conditions, especially high-quality data training, so as to better solve practical application problems in this field.
Midu CTO Liu Yidong told Titanium Media App that the company started using Huawei Cloud infrastructure products last year to launch business in government affairs, media and other fields, and now the company is about to release large-scale model products for vertical industries that focus on online public opinion.
"The government has begun to pay for the 'big model'. On the one hand, it is policy guidance and demands for practical application scenarios. On the other hand, enterprises themselves need to solve cost problems and commercialization problems through large models." An industry insider analyzed to Titanium Media App, The domestic large-scale model industry is returning to the period when AI companies and the government are combined to form commercialization and receive government subsidies.
Jiang Guangzhi said that at present, Beijing is accelerating the construction of the National New Generation Artificial Intelligence Innovation and Development Experimental Zone and the National Artificial Intelligence Innovation Application Pilot Zone to create a globally influential source of artificial intelligence innovation. The specific measures include the following three points:
Jiang Guangzhi emphasized that the Beijing Economic and Information Bureau will continue to strengthen the allocation of high-quality resource elements, effectively integrate innovative resources, actively create an innovative ecology, and lay a solid foundation for the development of the artificial intelligence industry.
"Large models cannot be made by one company. It is best for everyone to form several large ecosystems at the levels of computing power, models, and data. Especially under the guidance of government departments, companies can form partnerships in the ecosystem, so that Everyone is more about cooperation than competition. In the future, the large model may form a relationship with ecology, partners, and win-win cooperation.” Ji Haiquan, executive director of Legend Capital, said.