中文
English

JD's hidden move has been dormant for five years

2025-04-02

JD's hidden move has been dormant for five years

Miao Zhengqing

Miao Zhengqing

Tiger Sniff Official Team

follow with interest

Product | Tiger Sniff Commercial Consumer Group


Author | Miao Zhengqing


Title Image | Visual China



Article Summary

JD.com is focusing on the B-end market in the AI field, launching products such as Yanxi Big Model, Digital Human, and Intelligent Agent Platform, and integrating DeepSeek open-source models to reduce costs. In the face of industry competition, JD.com optimizes its models through technologies such as reinforcement learning and synthetic data, solves the illusion problem, and lays out a three-step strategy (language big model → multimodal → embodied intelligence) to explore the commercialization path of AI in customer service, marketing, and other scenarios.


•    Technological breakthrough: DeepSeek's deep reasoning and open-source model reshape the industry landscape, driving AI into the L2 stage


•    Strategic Focus: JD focuses on deploying B-end AI tools, covering 800000 merchants, and focusing on customer service, marketing, and digital human scenarios


•    Open Source Impact: DeepSeek breaks down closed source barriers and forces companies to explore differentiated business models


•    Three step path: language model → multimodal → embodied intelligence, JD anchors the evolution direction of AGI technology


•    Application breakthrough: JD focuses on solving the DeepSeek illusion problem and improving the reliability of serious applications in B-end scenarios


•    Future prediction: Synthetic data and reinforcement learning become key, AI will deeply integrate physical world task interaction

Faced with the impact of DeepSeek, Internet giants are readjusting their AI strategies.



In early March, Tencent Yuanbao fully embraced DeepSeek and leveraged DeepSeek to achieve a comeback in popularity. Almost simultaneously, ByteDance's Volcano Engine and Feishu, as well as Alibaba's International Station, Alibaba Cloud, and DingTalk, announced their integration into DeepSeek.



DeepSeek, a major Internet company, has become increasingly fierce, and behind this is the key game of each major company against AI.



In this wave of DeepSeek integration frenzy, JD.com is also among them. In early February, JD Cloud officially launched the DeepSeek-R1 and DeepSeek-V3 models, and took the lead in launching the DeepSeek all-in-one machine in the industry.



Unlike Tencent and ByteDance's efforts in the AI to C market through Yuanbao and Doubao, in 2024, JD.com will focus on developing B-end products such as the Yanxi big model, digital humans, intelligent agent platforms, and intelligent coding assistant JoyCoder on the AI side. As of the end of 2024, 800000 merchants on the JD platform have already used JD AI tools.



However, facing JD.com is also a more intense 2025: with ByteDance and Alibaba investing in the AI to B field, as well as AI unicorn companies such as Baichuan and Zhipu further shifting towards the B-end market, competition around AI to B will further intensify.



What underlying logic has DeepSeek changed in the AI community? What are the key opportunities in the B market? Is JD.com ready?



After JD Cloud announced its integration with DeepSeek, Tiger Sniff immediately communicated with He Xiaodong, President of JD Technology's Artificial Intelligence Business Unit and Dean of JD Exploration Research Institute, asking him to share a series of thoughts on the impact of DeepSeek, competition in the AItoB industry, and trends in AI technology. It is worth noting that the Chinese Association of Artificial Intelligence recently released the announcement of "Wu Wenjun Artificial Intelligence Science and Technology Award" in 2024, and the artificial intelligence team of JD Science and Technology won the special prize of Wu Wenjun Artificial Intelligence Science and Technology Award, the highest prize of China's intelligent science and technology, by virtue of the project of "key technologies and industrial applications of multimodal interactive digital people", which is also the only special prize of this year.



Attached is a transcript of the communication, with some deletions and modifications:



DeepSeek reshapes the business landscape



Tiger Sniff: What was the biggest impact of DeepSeek's wave of enthusiasm at the beginning of the year on you?


He Xiaodong: I think there are two interesting points, DeepSeek's deep reasoning technology and its open-source R1 model. These two points have profoundly reshaped the landscape of the AI industry.



Technically speaking, OpenAI has divided its API into five levels, namely L1 to L5. The first level focuses on understanding human-computer dialogue language, while the second level involves deep reasoning. If we look at it according to this classification, OpenAI's O1 and DeepSeek's R1 mean that we have officially moved from L1 to L2 stage.



In fact, by the end of 2023, there were rumors in the Silicon Valley tech community that OpenAI was developing a reinforcement learning algorithm project called Q-Star in preparation for GPT-5. At that time, we only heard the name, but as soon as we heard the name, we could think that Q-ability is the classic reinforcement learning. We associate it with a method similar to AlphaGo, which combines reinforcement learning with search for deep reasoning. At that time, we thought that Go was ultimately a fixed or closed domain, and if we could achieve open domain, that is, open intelligence on large models, there would actually be many technical challenges.



But it can be seen that by the end of 2023, everyone has already seen this technological direction, and it will start to be developed in July 2024. In April 2024, when I attended an AI forum at Tsinghua University, I expressed a viewpoint that there may be two directions in the future. One is deep reinforcement learning, which uses models to train models, because relying solely on human training models is already approaching the top of GPT 4. The second direction is "synthetic data", because ordinary data is almost used up, and the next step may require more efficient synthetic data to do this.



It can be said that technological progress is indeed quite fast. Although many people have seen this technological direction, not many have actually achieved it. But for the future, everyone should have more confidence, especially after the open source of models and papers.



Another impact or influence is that open source itself has reshaped the business landscape. For example, OpenAI was the first to be impacted, and its business model was greatly affected. But ultimately, open source is a very good thing. It reduces the cost of models, opens up space for applications, and allows companies of all sizes to use AI capabilities to build applications at a lower cost. This will bring about a diverse ecosystem.



JD.com is also considering how to take advantage of this opportunity, including quickly uploading DeepSeek to JD Cloud to provide this capability to our large and small customers. We also offer the ability to privately deploy DeepSeek, including industry-specific solutions.



Tiger Sniff: As you mentioned earlier, everyone has been discussing or seeing some trends since the end of 2023, but why is there only one company that has emerged?


He Xiaodong: In the first half of 2024, many people still focus on the "big" aspect, such as pursuing larger models, more cards, and achieving trillion or 10 trillion scale. People only see the so-called size scale and do not truly realize the importance of reasoning.



In the second half of 2024, everyone will see the importance of reasoning. DeepSeek is a very pure technology company, and based on this purity, they can better see the depth of the technology itself, so they will invest more resolutely in reasoning and other aspects.



Tiger Sniff: Another hot topic sparked by DeepSeek is open source. It seems that a year ago, even in the fourth quarter of 2024, the big modeling circle's judgment on open source and closed source is not as strong as it is today. Why is DeepSeek resolutely taking this path?


He Xiaodong: Going back to a year ago, only OpenAI's technology was leading and could stand out from others. It was not open source, so people would think that the significance of open source was not that great. Actually, over a year ago, Meta's Llama model was also open sourced, but its performance was indeed slightly inferior to OpenAI. And, people may think that closed source may be a viable business model.



Why has DeepSeek had such a significant impact on open source? I think it's because it challenged OpenAI's moat with a high-quality model, and people will think that the open source ecosystem can be built. So often, the influence of key companies and individuals can change people's fundamental views on certain issues.



Tiger Sniff: What impact does the open source imagination space have on what JD.com is doing?


He Xiaodong: The AI technology competition will continue, which means further investment is needed. We can't stop and we won't stop. From JD's practice of expanding models, we have seen many opportunities in the application layer, especially the impact of DeepSeek on the surrounding ecosystem. A large number of customers have realized that a high-quality DeepSeek model can be deployed at a relatively low cost. So you can see that JD Cloud and other clouds are all being deployed. As a result, it also brought some opportunities.



In terms of technology and application, everyone will have greater actions. Technically, we will absorb the techniques from these latest papers, such as further strengthening training and utilizing techniques including distillation. The top models on the market are currently trying this comprehensive distillation to improve efficiency.



In terms of application, we are embracing DeepSeek itself and quickly integrating it into our entire AI product line, providing end-to-end services directly to customers. For example, we have been trying to use it to write live streaming scripts and marketing copy. But at the same time, some problems were also discovered, such as high hallucinations. In serious usage scenarios, there are still many areas that need to be fine tuned, such as using DeepSeek to generate equity copy. Sometimes, there may be some non-existent equity, which is unacceptable for merchants. For the entire B-end, the biggest challenge currently faced by the open-source DeepSeek in adaptation is the illusion, and we are trying various ways to reduce it.



The AI to B market is not just about technology



Tiger Sniff: What do you think is the competition situation in the AI to B market this year?


He Xiaodong: Actually, both AI to C and AI to B are what we want to do. AI to B can actually be divided into two levels, one called Model Service and the other called Software Service, with the former being a more fundamental service. This Model Service includes models, API interfaces, as well as DeepSeek cloud deployment mentioned earlier, and the development of higher-level intelligent applications. We are currently providing such capabilities, and in the near future we will provide a new generation of capabilities, such as API calls.



Software Service, It is an end-to-end product, such as some deep closed-loop AI products, including intelligent customer service, intelligent marketing, and digital human live streaming.



Through these two types of AItoB products, we can form a commercial closed loop, and I think everyone has the opportunity.



Tiger Sniff: It seems that JD's approach to AI has not changed much from before?


He Xiaodong: Yes, JD's strategy is quite firm. DeepSeek open source, for JD.com, can make some of the commercial models we previously wanted to do more efficient or cost-effective. However, the open source of DeepSeek will not fundamentally change JD's approach.



Tiger Sniff: Do you choose more MaaS or SaaS?


He Xiaodong: Both sides are advancing in parallel. The opportunities for MaaS are indeed different after DeepSeek is open sourced. Before the open source era, we often built MaaS on top of the cloud and used our own or industry's open source models. However, at that time, the industry's open source models generally had poorer performance, so our own model fees were generally higher. But the significant impact brought by the open source of DeepSeek is that our own paid model needs to differentiate from the free open source DeepSeek in order to close the business model loop.



So now the situation has changed, one is to compete on who has lower deployment costs, including lower costs or higher parallel reasoning techniques; The second approach is to make further adjustments based on DeepSeek, or to do reinforcement learning or distillation, to change its illusion problem, that is, to cultivate in specific industries and help enterprises build their own large models, so that the value of DeepSeek can be better presented. Both of these models are currently available to MaaS and are part of JD's technological accumulation that can be commercialized.



In the SaaS part, the base model is only a part of it, and overall efficiency, scenarios, and end-to-end experience are all core issues. This will lead to differentiation at the application layer, which will bring about new business models.



We are currently focusing more on AI customer service, AI marketing, and digital people in this area. Taking digital humans as an example, they are no longer just a multimodal combination of language models and other large models. It is a comprehensive product level competition, and businesses value AI driven final sales results. Just like in the Internet era, when the Internet has become an infrastructure, people are not competing for network speed but for different applications. At that time, JD was able to win not only due to technological factors, but also a series of factors such as overall terminal experience, operational methodology, live streaming traffic, and gameplay.



JD launches AI in three steps



Tiger Sniff: Did you have specific goals when you were doing AI?


He Xiaodong: As early as around 2020, we proposed a plan, and now looking back, we are actually following this plan. We talked about achieving a 'one platform three-stage rocket' at that time. A platform means that we need a basic large-scale model platform, and at that time, we had already seen the scale effect value of the model, so this was a must do. On this platform, we have planned a three-stage rocket: from AI intelligent customer service, to AI marketing, and then intelligent interactive media, referring to products such as digital humans.



AI customer service is actually a more concrete product, and we have been doing it well since 2020. At that time, we also started doing AI marketing, mainly to upgrade marketing with AI. We also hope that AI can bring some disruptive innovations to marketing, such as using AI voice for marketing promotion, relationship maintenance, and even as assistants and guides. In this field, we further see the value of intelligent interaction, including extending to new human-machine interfaces such as digital humans today. These directions were planned by us at that time, and we are still following them today.



Intelligent customer service is actually a relatively mature track, and there is still huge space for AI marketing in the future. Look at several foreign giants, many of their AI commercialization includes marketing. Intelligent interaction includes AIGC, which is not just about digital humans, but also AI generated graphics, AI generated audio and video. These technologies can support the emergence of many new apps and formats, which means it can redo social, content, search, and e-commerce. This imaginative space is enormous.



In fact, it is very similar to the Internet in those days. When the Internet first appeared, people thought that selling gateways was the most profitable. Later, the early business model was just sending emails. But if you look back, you will find that these are just the first dishes. Later, so many giant companies and trillion level companies hardly existed in the earliest Internet era 20 or 30 years ago.



Tiger Sniff: What is the proportion of JD's investment in AI in recent years?


He Xiaodong: We have been steadfastly investing in AI. The inspiration DeepSeek gave us is that our entire plan has accelerated, which is actually a dynamic competition. DeepSeek has made the competition around AI even more intense, and of course, there are more opportunities.



Tiger Sniff: Have you ever been indecisive in 2023?


He Xiaodong: In 2023, I was conflicted about how to allocate resources in terms of investment scale, priority, and time and space.



Tiger Sniff: Is there any correlation between your investment in AI and your commercialization?


He Xiaodong: Our larger projects are invested through special projects. JD Exploration Research Institute has many AI directions dedicated to long-term research, and the group has long-term confidence in AI.



Tiger Sniff: When you were planning JD's AI strategy, did you refer to any models from domestic and foreign companies?


He Xiaodong: In the early stage, we mainly conducted research and judgment based on the development trend of technology, and accumulated and broken through core technologies through the trend of technology. This responsibility was undertaken by students from JD Exploration Research Institute and Basic Algorithms.



In the second stage, we aim to establish a capability platform so that we can implement and generate product applications. The third stage is to strengthen the ecology, and we need to develop the technology ecology, product ecology, and service ecology. From what we are doing now, we are in the third stage. But we also see that having APIs alone is not enough for a platform. You need benchmark, demonstrative, and flagship products. On the one hand, these products themselves can bring profits, and on the other hand, these products can show benchmark cases to ecological partners, which can attract everyone to work together on ecology.



Tiger Sniff: What are your core predictions about the future of AI at this moment?


He Xiaodong: Firstly, synthetic data should become a trend, similar to our synthetic materials, becoming the foundation of the future. What we use now is mainly popular data, which is hundreds of billions or trillions of data on the Internet. What we care about is the intelligence in the data rather than the data itself. So when training models, we actually want to extract the intelligence from these data. But the intelligence that mass data can provide is approaching its limit. If we want to further improve the intelligence level of the model, we cannot rely on the intelligence in mass data, but need data with higher intelligence density. That's also why many models now use math Olympiad questions for training, and then distill and perform reinforcement learning through other models.



The synthesis of data and reinforcement learning, as well as the confrontation between models, are the key to the development of models in the future. On a deeper level, it is necessary to further enhance the intelligence level of these models, allowing them to enter the real world and take on tasks, interact, and engage in confrontation in the real world, in order to continue improving their intelligence level.



Tiger Sniff: What preparations has JD Exploration Research Institute made in these areas?


He Xiaodong: JD Exploration Research Institute has a three-step layout - from language modeling to multimodal intelligence, and then to embodied intelligence. This is actually the process of the entire AI moving towards AGI (General Artificial Intelligence) or ASI (Super Artificial Intelligence). That is to say, after expanding from language and cognitive intelligence to multimodal intelligence, the next step must be physical world intelligence.


Title: JD's hidden move, dormant for five years


Article link: https://www.huxiu.com/article/4171750.html


Read the original article: JD's hidden move has been dormant for five years

Miao Zhengqing

Miao Zhengqing


Likes Wang Wei and Bai Juyi, admires Ji Zha



Tiger Sniff Team


242 articles have been published in Tiger Sniff


If you have any objections or complaints about this article, please contact tougao@huxiu.com

People who are changing and want to change the world are all on the Tiger Sniff app




京东这步暗棋,蛰伏了五年

出品|虎嗅商业消费组

作者|苗正卿

题图|视觉中国

文章摘要
京东在AI领域重点发力B端市场,推出言犀大模型、数字人、智能体平台等产品,接入DeepSeek开源模型以降低成本。面对行业竞争,京东通过强化学习、合成数据等技术优化模型,解决幻觉问题,并布局三步走战略(语言大模型→多模态→具身智能),探索AI在客服、营销等场景的商业化路径。

• 技术突破:DeepSeek的深度推理和开源模式重塑行业格局,推动AI进入L2阶段

• 战略聚焦:京东重点布局B端AI工具,覆盖80万商家,发力客服、营销与数字人场景

• 开源冲击:DeepSeek开源打破闭源壁垒,倒逼企业探索差异化商业模式

• 三步路径:语言模型→多模态→具身智能,京东锚定AGI技术演进方向

• 应用攻坚:京东着力解决DeepSeek幻觉问题,提升B端场景的严肃应用可靠性

• 未来预判:合成数据与强化学习成关键,AI将深度融合实体世界任务交互

面对DeepSeek冲击,互联网大厂们正在重新调整AI策略。

3月初,腾讯元宝彻底拥抱DeepSeek,并借力DeepSeek完成热度反超。几乎与此同时,字节旗下火山引擎、飞书,阿里旗下国际站、阿里云、钉钉纷纷宣布接入DeepSeek。

一场互联网大厂的DeepSeek愈演愈烈,而这背后是各大厂针对AI的关键博弈。

在这波接入DeepSeek热潮中,京东也身处其中。2月初,京东云正式上线 DeepSeek - R1 和 DeepSeek - V3 模型,并率先在行业里推出DeepSeek一体机。

和腾讯、字节通过元宝、豆包发力AI to C市场不同,2024年京东在AI端重点发力言犀大模型、数字人、智能体平台、智能编码助手JoyCoder等B端产品。截至2024年底,京东平台上已经有80万商家使用了京东AI工具。

不过,摆在京东面前的也是更为激烈的2025年:随着字节、阿里纷纷在AI to B领域加注,以及百川、智谱等AI独角兽公司进一步转向B端市场,围绕AI to B的竞争进一步加剧。

DeepSeek到底改变了AI圈的哪些底层逻辑?to B市场到底有哪些关键机会?京东做好准备了吗?

在京东云宣布接入DeepSeek后,虎嗅第一时间与京东科技人工智能业务部总裁、京东探索研究院院长何晓冬进行了交流,请他分享了对于DeepSeek冲击、AItoB行业竞争以及AI技术趋势的一系列思考。值得注意的是,近日中国人工智能学会发布2024年度“吴文俊人工智能科学技术奖”公告,京东科技人工智能团队凭借“多模态交互式数字人关键技术及产业应用”项目荣获中国智能科学技术最高奖——吴文俊人工智能科学技术奖的特等奖,也是本年度唯一的特等奖。

下附交流实录,有删改:

DeepSeek重塑了商业格局

虎嗅:DeepSeek年初的这波热潮对你而言冲击比较大的地方是什么?

何晓冬:我觉得有两点很有意思,DeepSeek技术上做的深度推理以及它R1的开源模式。这两点深刻重塑了AI行业格局。

技术上讲,OpenAI 把API分了5个层次,就是L1到L5,第一层是偏人机对话语言理解这部分,第二层就开始涉及到深度推理。如果按照这个分类看,OpenAI发布的O1和DeepSeek发布的R1意味着我们正式从L1向L2阶段迈进了。

其实在2023年底,硅谷的技术圈就有说OpenAI在开发一个叫做Q-Star的强化学习算法项目为GPT-5做准备。当时我们只是听到这个名字,但一听到名字就能想到Q能力就是经典的强化学习。我们联想到,或许有点类似于AlphaGo这样的方式,就是会用强化学习配上搜索来做深度推理。只是当时我们觉得,围棋毕竟是一个固定域或者说封闭域,如果做到开放域,也就是大模型上的开放智能,其实是有很多技术挑战的。

但看得出来,2023年底,其实大家已经看到这个技术方向了,2024年7月o1就开始做出来了。2024年4月份的时候,我在清华大学参加一个AI人工智能论坛时,说了一个观点,未来可能有两个方向,一个是深度强化学习,就是用模型来训练模型,因为光靠人训练模型在GPT4已经快到顶了。第二个方向就是 “合成数据”,因为普通数据几乎用完了,下一步可能需要通过更加高效的合成数据来做这个事情。

可以说技术的进步确实挺快的,很多人虽然都看到了这个技术方向,但真正做出来的并不多。但对于未来大家应该更有信心了,尤其是模型开源和论文开源后。

另一个冲击或者说影响是,开源本身重塑了商业格局。比如OpenAI就首先受到了冲击,它的商业模式受到很大影响。但说到底,开源是一个非常好的事情,它降低了模型的成本,打开了应用的空间,大小公司都可以更低成本去运用AI能力构建应用,这会带来生态的百花齐放。

京东也在考虑怎么利用这个机会,包括我们也快速把DeepSeek上到了京东云,给我们大小客户提供这个能力,我们还提供私有化部署DeepSeek的能力,包括面向行业的解决方案。

虎嗅:当刚才你也说到,其实从23年底大家都在讨论或者说都看到了一些趋势,但为什么做出来的只有这么一家公司?

何晓冬:2024年上半年,很多人还是把精力放在了“大”上,比如追求更大的模型、追求更多的卡,搞万亿级或者10万亿级规模,大家只看到了所谓的尺寸Scale,并没有真正深度认识到推理的重要性。

到了2024年下半年,大家看到了推理的重要性。DeepSeek是一个非常纯粹的技术公司,他们基于这种纯粹性更能够看到技术本身的深刻性,所以在推理这些方面投入会更加坚决。

虎嗅:DeepSeek引发的另一个热议话题就是开源,好像一年前乃至于2024年四季度,大模型圈对于开源闭源的判断并不像今天这样,为什么DeepSeek坚决走这条路?

何晓冬:回到一年前,只有OpenAI的技术领先且能比别人拉开一个身位,它不开源,所以大家会觉得开源的意义没有那么大。其实一年多前,Meta的Llama模型也在开源,但它的效果确实比OpenAI差一些。以及,大家会觉得,闭源或许是一个可以成立的商业模式。

DeepSeek为什么对于开源这件事影响这么大呢?我觉得是因为它用一个很高质量的模型冲击了OpenAI的护城河,大家会觉得开源生态能够建起来了。所以很多时候,关键公司、关键的人的一些影响,会改变大家对一些问题的根本性看法。

虎嗅:开源的想象力空间打开后,对于京东在做的事情有哪些影响?

何晓冬:AI的技术竞赛会持续下去,这意味着需要进一步投入。我们不能停也不会停。从京东做大模型的实践中,我们看到在应用层确实现出现了很多机会,尤其是DeepSeek带来的周边生态影响。大量客户都意识到可以用相对低廉的价格部署一个高质量的DeepSeek模型。所以你看京东云以及其他的云,都在部署。随之而来的,也带来了一些机会。

在技术和应用上,大家都会有更大的动作。在技术上,我们会吸收这些最新的论文里面的技术,比如进一步做强化训练、进一步使用包括蒸馏在内的技术。市面上现在头部的模型,都在尝试这种综合性蒸馏以提高效率。

在应用上,我们在拥抱DeepSeek本身,快速接入到我们的AI全系产品中,直接给客户提供端到端的服务。比如用它来写直播文稿、营销文案,我们一直在尝试。但同时也发现了一些问题,比如幻觉太高。在严肃使用的场景里,需要我们微调的地方还有很多,比如用DeepSeek生成权益文案,有时候会出现一些并不存在的权益,对于商家而言这是不可接受的。对于整个B端而言,开源的DeepSeek目前在适配中存在的最大挑战就是幻觉,我们在想各种办法把幻觉降下来。

AI to B市场拼的不只是技术

虎嗅:你觉得在AI to B市场,今年的竞争情况是怎样的?

何晓冬:其实AI to C和AI to B都是我们想做的。AI to B其实可以分为两个层次,一个叫做Model Service,一个叫做Software Service,前者是更加基础的服务。这个Model Service包括了模型、API接口,以及刚才提到的DeepSeek上云,以及更上层智能应用的开发。我们目前就在提供这样的能力,不久的将来我们会提供新一代的能力,比如API调用等。

Software Service,是一种端到端的产品,比如一些深度闭环的AI产品,包括智能客服、智能营销,还有数字人直播,就是这样的产品。

通过这两类AItoB产品,我们可以形成商业化闭环,我觉得大家都有机会。

虎嗅:好像和之前京东在AI上的思路没啥变化?

何晓冬:对,京东的战略是比较有定力的。DeepSeek开源,对于京东而言,可以让我们之前想做的一些商业化模式有更高的效率或者说更低的成本。但是DeepSeek开源并不会改变京东根本性的思路。

虎嗅:你们是选择MaaS多一些还是SaaS多一些?

何晓冬:这两边都在并行推进。在DeepSeek开源后,MaaS的机会确实不太一样。开源之前,我们做MaaS往往基于云之上,然后采用自己或者业界的开源模型,但当时业界的开源模型一般效果会差一点,所以自己的模型收费一般更高一些。但DeepSeek开源后带来的较大的冲击是,我们自己的收费模型需要和免费的开源DeepSeek做出差异化,才能让商业模式闭环。

所以现在的情况变了,一种是拼部署成本谁更低,包括了更低的成本或者更高的并行推理技术;第二种是在DeepSeek基础上做进一步的微调,或者说做强化学习、做蒸馏,改变它的幻觉问题,也就是在特定行业上做培育,帮助企业构建自己的大模型,这样DeepSeek的价值可以更好地呈现。这两个模式,都是当下MaaS可以做的,也是京东有技术积累,能够商业化的部分。

在SaaS部分,基座模型只是其中一部分,整体效率、场景、端到端体验都是核心问题。这就会衍生出在应用层的差异化,由此会带来新的商业模式。

这一部分我们现在做的比较多的是AI客服、AI营销以及数字人。以数字人为例,现在数字人已经不只是语言大模型和其他大模型的多模态结合了。它是一个综合性的产品级比拼,商家看重的是AI带动最终的销售效果。就好比在互联网时代,当互联网已经变成了基础设施,大家拼的就不是网速而是不同的应用了。而当时京东能够胜出,除了有技术因素,也包括了整体终端的体验、运营方法论、直播流量与玩法等一系列因素。

京东发力AI分了三步走

虎嗅:你们做AI时,当时有具体的目标吗?

何晓冬:早在2020年左右,我们提出过一个规划,现在回看我们其实还是按照这个规划走的。我们当时讲要做到“一个平台三级火箭”。一个平台是指,我们需要一个基础大模型平台,当时,我们已经看到模型的规模效应价值了,所以这个是必须要做的。而在这个平台上,我们规划了三级火箭:从AI智能客服,到AI营销,然后智能交互媒体,指的是数字人等产品。

AI客服其实是一个比较具象的产品,在2020年我们就做好了。当时我们也开始做AI营销,主要是想用AI把营销升级。我们也希望AI可以给营销带来一些颠覆性创新,比如用AI语音去做营销推广、关系维护,甚至做助理和导购。在这个领域,我们进一步看到了智能交互的价值,包括延展到今天数字人这样新型的人机的界面。这几个方向是我们当时规划好的,我们今天其实也依然按照这几个方向在做。

智能客服其实是一个比较成熟的赛道,AI营销未来还有巨大的空间。你看国外的几个巨头,它们AI的商业化很多都包含营销。智能交互包含AIGC,这里面不只是数字人,还有AI生图、AI生音视频,这些技术又能支撑很多新的APP和业态出现,也就是说,它可以把社交、内容、搜索、电商都重新做一遍。这个想象空间是巨大的。

它其实很像当年的互联网,互联网刚出现时,大家一窝蜂觉得卖网关最赚钱,后来出现的早期商业模式也只是发邮件。但你回看,会发现这些只是前菜,后来那么多巨头公司、万亿级别公司,在二三十年前最早的互联网时代几乎都不存在。

虎嗅:这几年京东在AI投入上的比例是怎样的?

何晓冬:我们一直在坚定的投入AI。 DeepSeek给我们的启示是,我们整个计划提速了,这其实是一个动态竞争,DeepSeek让围绕AI的竞争更加激烈了,当然机会也更多。

虎嗅:2023年的时候,你们纠结过吗?

何晓冬:2023年的时候纠结过,包括投入规模、以及先后次序、时间空间上到底怎么分配资源。

虎嗅:你们在AI上的投入,和你们实现商业化之间有一些关联吗?

何晓冬:我们比较大的项目是通过专项去投入。京东探索研究院里有很多致力于长期研究的AI方向,集团层面对AI是有长期信心的。

虎嗅:你们当时规划京东的AI战略时,参考了什么国内外公司的样板吗?

何晓冬:前期主要是通过技术的发展趋势做研判,通过技术趋势做核心技术的积累与突破,这个是由京东探索研究院以及基础算法的同学承担这个责任。

第二个阶段,我们是想形成一个能力平台,这样才能够落地生成产品化应用。第三个阶段则是强化生态,我们要把技术生态、产品生态、服务生态都做起来。从我们现在做的事情来看,我们处于第三个阶段。但我们也看到,平台光有API是不够的,你需要有标杆式的示范性、拳头产品。一方面这些产品本身可以带来利润,另一方面这些产品可以让生态伙伴看到标杆性案例,才能吸引大家来一起做生态。

虎嗅:此刻你对AI未来,有哪些核心预判?

何晓冬:首先是合成数据应该会成为趋势,它会类似我们合成材料一样,成为未来的基础。我们现在用的主要是大众数据,就是互联网上几千亿、几万亿的数据,我们关心的其实是数据里面的智力而不是数据本身,所以在训练模型时我们实际上是要把这些数据之中的智力榨取出来。但大众数据能够提供的智力已经快到边界了,接下来如果我们想进一步提升模型智力水平,不能依靠大众数据里面的智力了,而是需要更高智力密度的数据。这也是为什么现在很多模型用数学奥赛题做训练,然后再通过其他模型蒸馏、做强化学习。

合成数据和强化学习,模型与模型之间的对抗,是接下来模型发展的关键。而更深的一层是,要让这些模型的智力水平进一步提升让其走入真实世界,并在真实世界接任务、进行交互与对抗,继续提高智力水平。

虎嗅:在这些方面,京东探索研究院做了哪些准备工作吗?

何晓冬:京东探索研究院有一个三步台阶的布局——从语言大模型到多模态智能再到具身智能。这其实是整个AI走向AGI(通用人工智能)或者ASI(超级人工智能)的过程。也就是说,从语言和认知智能扩展到多模态智能后,下一步一定是实体世界智能。

文章标题:京东这步暗棋,蛰伏了五年

文章链接:https://www.huxiu.com/article/4171750.html

阅读原文:京东这步暗棋,蛰伏了五年_虎嗅网
如对本稿件有异议或投诉,请联系tougao@huxiu.com
正在改变与想要改变世界的人,都在虎嗅APP


Read0
share
Write a Review...
推荐文章
1  /  177