SenseTime, a Chinese AI firm greatest recognized for its facial recognition know-how, launched a brand new open supply mannequin on Tuesday that it claims can each generate and interpret photographs far quicker than prime fashions developed by US opponents. SenseNova U1 may assist the corporate reclaim misplaced floor after it slipped from its place among the many main gamers in China’s AI improvement race.
The mannequin’s secret sauce is its skill to “read” photographs with out translating them to textual content first, dashing up the method and lowering the quantity of computing energy required. “The model’s entire reasoning process is no longer limited to text. It can reason with images as well,” Dahua Lin, cofounder and chief scientist at SenseTime, mentioned in an interview with WIRED.
Lin, who can be a professor of knowledge engineering on the Chinese University of Hong Kong, says that fashions able to processing photographs instantly will allow robots to raised perceive the bodily world sooner or later.
Like DeepSeek’s newest flagship mannequin, SenseTime says U1 may be powered by Chinese-made chips. “Several Chinese domestic chipmakers have finished optimizing compatibility with our new model,” Lin says. On launch day, 10 Chinese chip designers, together with Cambricon and Biren Technology, introduced their {hardware} helps U1.
That flexibility issues as a result of US export controls limit Chinese companies from accessing the world’s most superior AI chips, notably these used for coaching, which at this level are primarily developed by Western corporations like Nvidia. “We will continue to push for training on more different chips,” Lin says. But he additionally acknowledges that SenseTime “may still need to use the best chips to ensure the speed of our iteration.”
SenseTime launched U1 without cost on Hugging Face and GitHub, one other signal of how Chinese corporations have gotten among the most lively contributors to open supply AI.
SenseTime was based in 2014 and have become a world chief in pc imaginative and prescient, which is utilized in functions like facial recognition and autonomous driving. But when ChatGPT and different AI programs powered by pure language processing turned the most well liked factor within the tech business, SenseTime started struggling to show a revenue and fell behind newer Chinese startups like DeepSeek and MiniMax.
SenseTime says it hopes that releasing SenseNova-U1 publicly for anybody to make use of will assist it meet up with each home and Western AI gamers. Lin says the corporate lastly made the choice final yr to concentrate on open supply due to the useful suggestions it will get from researchers, which allows the corporate to iterate quicker. “In this day and age, being open source or closed source is not the winning factor; the speed of iteration is,” Lin explains.
Going open supply additionally helps SenseTime proceed collaborating with worldwide researchers with out the interference of geopolitics. The firm has been sanctioned repeatedly by the US authorities in recent times over allegations that its facial recognition know-how helped energy surveillance programs used to observe and detain Uyghurs and different minority teams in China’s Xinjiang area. As a outcome, US companies are restricted from investing in SenseTime and promoting sure applied sciences to it with out a license. (SenseTime has denied the allegations.)
Seeing Clearly
In an accompanying technical report, SenseTime claims that SenseNova-U1 generates higher-quality photographs than all different open supply fashions at the moment in the marketplace. Its efficiency is akin to main Chinese closed supply fashions like Alibaba’s Qwen and ByteDance’s Seedream, nevertheless it nonetheless lags behind business leaders like GPT-Image-2.0, which got here out only a week in the past.
But the mannequin’s fundamental promoting level is its skill to generate photographs a lot quicker than all of these fashions. It depends on an modern technical construction referred to as NEO-Unify that SenseTime previewed earlier this yr.
https://www.wired.com/story/chinese-ai-giant-sensetime-is-running-its-new-model-on-chinese-chips/