Login
Sign Up
Woofun AI reports that Yingmou Technology, a Chinese firm specializing in AI-driven 3D generation, has secured a new financing round worth hundreds of millions of yuan, led by Kaihui Capital and the Shanghai SDIC Lead Fund. This capital injection coincides with the official release of the company's Hyper3D Rodin Gen-2.5 model, a system capable of generating tens of millions of polygons and adopted by industry giants including NVIDIA. While the broader AI sector debates the abstract definition of 'world models,' Yingmou's leadership, including CEO Wu Di, maintains a pragmatic stance, distinguishing between mere visual environments and true embodied intelligence that allows for the manipulation of physical objects. Wu Di argues that a genuine world model must facilitate control and interaction, requiring assets that can be disassembled, edited, and integrated into rendering engines rather than simply viewed on a screen. The company's strategy addresses the critical question of object origin, ensuring that items like game monsters or industrial parts are not just visual demonstrations but functional production assets.
The newly launched Hyper3D Rodin Gen-2.5 represents a structural shift in 3D generation technology by applying the 'think-first, generate-later' methodology previously exclusive to large language models. Unlike traditional approaches that rely on synthesizing multi-view images to reconstruct geometry—a process prone to noise, imperfections, and visual distortions when views are discontinuous—this new model dynamically allocates computational resources based on object complexity. Data compiled by Woofun AI shows that in the first month following its launch, Rodin Gen-2.5 achieved a 400% increase in both subscription user growth and revenue. This metric underscores a market preference for functional utility over mere visual fidelity; while images and videos serve viewing purposes, the 3D industry demands assets that can be textured, modified, and handed off to external teams for further processing. The model's ability to generate production-ready assets in varying timeframes—4 seconds for rough drafts, 20 seconds for balanced detail, and 80 seconds for high-precision models with microstructures like skin pores and scales—demonstrates a sophisticated understanding of resource efficiency.
Prior to August 2024, the industry faced a significant bottleneck where AI-generated 3D tools were largely confined to the prototype stage, unsuitable for core sectors like 3A gaming and film post-production due to geometric flaws. Yingmou Technology's team broke this deadlock by introducing the native large-scale model framework CLAY, detailed in a groundbreaking paper presented at SIGGRAPH. This technical evolution allows the model to determine the complexity of generated results dynamically, mirroring the inference process of large language models but applied to geometric structures and material details. Zhang Qixuan, the CTO of Yingmou Technology, noted that while previous iterations remained at the preview stage, Rodin Gen-2.5 has enabled game studios to utilize the Extreme-High mode for high-quality generation. For instance, artists can now model slightly raised scales on a monster in high detail and combine them with low-polygon models, ensuring seamless integration into subsequent production steps. The model's 12K native 3D texture generation capability further addresses the 'skin' aspect of 3D assets, utilizing a method that freezes surface light fields to eliminate projection distortions and blind spots common in multi-view projection techniques.
The technical advantages of this approach manifest in three distinct areas: full 3D consistency, physical realism, and superior quality. Materials and textures are rendered uniformly across the entire space without missing details, while the system offers perfect support for physically based PBR materials, including metallicity, roughness, highlights, and normals. With 12K resolution, the generated geometry and textures rival expensive real-scene scans, providing ample room for post-production adjustments. When combined with the Thinking Effort design framework, the system can generate a complete asset with high-precision geometry and 12K textures in as little as 5 seconds. In a capital market environment that prioritizes paying customers over user counts, Yingmou Technology has adopted a B2B-focused strategy similar to Anthropic. Approximately 80% of the company's revenue is derived from overseas markets, with North America serving as the primary hub. The revenue distribution between B2B and C2B professional users stands at roughly 4:6, with B2B customer revenue exceeding that of all competitors in the sector and a customer retention rate nearing 100%.
Industry validation for Hyper3D's technology is evident through integrations with major platforms including Unity AI Beta, OctaneRender, Canva, and Figma. The company's operational structure reflects the broader trend of youthfulness and agility in the AI sector, with a team of around 60 employees, including interns, where the technical team comprises two-thirds of the workforce. Many core algorithm team members have collaborated since their undergraduate days, resulting in minimal turnover; only three core algorithm experts have departed, joining NVIDIA, Disney's R&D department, or Tencent. This stability allows the team, largely born in the 2000s and originating from the MARS laboratory at ShanghaiTech University, to maintain a singular focus on ambitious goals. Wu Di established the laboratory in 2016 after joining the university in 2015, and the team's trajectory has consistently expanded from scanning the real world and creating digital humans to generating individual objects and now entire scenes. Despite the small size of the organization, their consistent direction over the past few years has positioned them as a critical player in the emerging landscape of world models.
As the AI industry continues to re-examine the concept of world models, with various entities focusing on environment generation, game creation, or embodied intelligence, no definitive answer exists regarding the true nature of these systems.
However, the path forward requires the creation of real, controllable elements within a digital world before such a world can be entered, interacted with, and manipulated. Yingmou Technology's success in securing hundreds of millions of yuan and achieving a 400% revenue surge highlights the market's readiness for production-grade 3D assets. The company's ability to bridge the gap between visual demonstration and industrial application marks a pivotal moment for the sector. This development suggests that the future of world models will be built not on abstract definitions, but on the granular, functional components that Yingmou has successfully commercialized.