News Center

location: Home > News Center > News > Content

AIR Research | Large Models Achieve Zero-Shot Single-Cell Identity Recognition

Source:       Time:2024-05-24

The team of Professor Nie Zaiqing from AIR, in collaboration with the Tsinghua-affiliated startup ShuiMu Molecule, has developed a large model for single-cell identity understanding called LangCell. This model provides a unified representation of single-cell data and natural language, marking the first model capable of annotating new cell types without the need for labeling.

In addition, LangCell significantly improves performance in various tasks related to cell identity understanding, including batch correction, classification of disease subtypes, and identification of cellular pathways. Even without the use of textual information, the model's incorporated cell encoder module achieves optimal performance across these tasks.
Moreover, LangCell has constructed a cell-natural language text dataset, scLibrary, which comprises approximately 27.5 million entries covering eight dimensions of descriptive information, including cell types, developmental stages, tissues and organs, and diseases, making it a veritable "encyclopedia of cells." The related paper has been accepted for presentation at ICML 2024, and associated work is now open-sourced on GitHub ( GitHub link), allowing researchers and medical professionals worldwide to utilize LangCell for research and exploration.
Paper Link: https://arxiv.org/abs/2405.06708  
Read More: https://air.tsinghua.edu.cn/info/1007/2247.htm


上一条:AI in Wuxi! Tsinghua AIR's Wuxi Innovation Center Established 下一条:AIR Research | AIR Creates a Virtual Hospital, Enabling AI Doctors to Self-Evolve

关闭

Relevant news

Email:Airoffice@air.tsinghua.edu.cn
Tel:(010)82151160  

Address:12 / F, block C, Qidi science and technology building, Tsinghua Science and Technology Park, Haidian District, Beijing

wechat

Jing ICP Bei No. 15006448 | all rights reserved@ Institute of intelligent industry, Tsinghua University