- PhD Level
- Posts
- Biggest Gene Data Drop Ever! š§¬
Biggest Gene Data Drop Ever! š§¬
Daily news that is actually intellectually stimulating.

PhD Level: Daily Curated Tech News for Entrepreneurs, by Entrepreneurs
Dear reader,
If we want to merge AI and biotech, we need data. And freshly last week, the biggest gene data has dropped!
Letās get into it ā
Xaira Drops Massive Perturbāseq Dataset: XāAtlas/Orion

AI-driven biotech firm Xaira Therapeutics has released XāAtlas/Orion, the largest publicly available genome-wide Perturbāseq dataset. Built using its scalable FiCS platform, the resource includes 8 million single cells across all human proteinācoding genes, with deep sequencing (~16,000 UMIs per cell) and dose-dependent perturbation measurementsāmarking an unprecedented scale for genetic effect profiling
My take: If you havenāt followed Bo Wang, you need to follow Bo Wang. He was the guys I have been following and he did a great job releasing scGPT (single-cell GPT) and more works on this. Xaira is just a tip of the iceberg. This is a game-changerāby democratizing vast, high-resolution data and embracing dose-response dynamics, XāAtlas/Orion empowers both AI researchers and wetālab scientists to simulate and prioritize experiments with unparalleled precision.
Takeaways
Why It Matters
Scale & open access: Vast data enabling the development of āvirtual cellā AI models
Nuanced insight: Moves beyond binary gene edits by capturing continuous, dose-dependent changes
Boosts drug discovery: Models built on this dataset can simulate gene behavior under varied conditionsāaccelerating target identification and reducing lab workload
Read more: Source
Did you find this news intellectually stimulating? |
Some affiliate links we endorse:
Stay curious,
The PhDLevel Team
āļøš» Powered by caffeine & curiosity