How are you using RNAseq foundation models?

asaravia · May 19, 2026, 5:25pm

Hi AWG members,

As many of you may know, there are several bulk and single cell RNAseq foundation models (FMs) available, such as BulkRNABert, MOJO, BulkFormer, CellWhisperer, scFoundation, and scGPT. I am curious how many of you are using RNAseq FMs in your research. If you are using these FMs or others will you please let me know in the comments and also tell me how you are using them (i.e. what questions are you asking, what data/information are getting out of them)?

Thanks!

@AWGall

Michelle · May 19, 2026, 6:22pm

Hello, I’m using the GPT-5.5 model. Since I’m studying Behavioral Neuroscience and nutraceuticals, I ask it this type of question: What is the vitamin precursor to serotonin? Could you provide the sources you’ve verified? It’s necessary to verify the sources to ensure they are correct.

Michelle · May 19, 2026, 6:31pm

O scGPT como funciona?

asaravia · May 19, 2026, 6:38pm

Thanks, @Michelle, so in your example you’re using the GPT-5.5 large language model (as opposed to a specific RNAseq model). Am I understanding your use case correctly?

AliReza-H · May 20, 2026, 9:08am

Hi @asaravia

I use foundation models in opthalmology, mainly RETFound ,

It’s trained on huge number of retinal images, using self-supervised learning approach, and it look at patterns which is not clear to us ! But for some reason outperforms classical models in many tasks.

So when i want to do something ( not studied , or discovered) with retinal images ,

I simply process image into RETFound encoder and collect the outputs as embeddings .

Then i work with them directly , for example i ask what is biological age of this retina,

Or what is heart and kidney status of the person with this retinal image !

ManishSharma · May 20, 2026, 1:52pm

Hi @asaravia , thanks for starting this conversation!

Yes, we have been experimenting with scGPT and BulkRNABert in our work. With scGPT we are primarily using it for cell type annotation on single cell data and the main question we are asking is whether the model can reliably identify rare cell populations without us having to manually define marker genes each time. The results are promising but we do find it needs fine-tuning on our specific tissue context to get reliable outputs.

With BulkRNABert we are exploring whether we can get meaningful embeddings across samples from different cohorts and use those for downstream clinical outcome prediction. So far the embeddings are quite useful for clustering samples but the actual prediction tasks need more labeled data to fine-tune properly.

We have not worked with CellWhisperer or scFoundation yet but they are on our list. Would be very curious to hear from others who have used them, especially on non-human datasets since that is something we are moving toward.

What tissue types or organisms are you working with? That would help understand which FM makes the most sense for your use case.

Topic		Replies	Views
Exploring the use of AI Foundation Models AI/ML AWG Topics	6	150	May 7, 2026
AI/ML Subgroup for Genetic Perturbation Predictive Modeling (GPPM) AIML AWG Open Projects omics , news , subgroups , perturb-seq , transcriptomics	44	1847	June 1, 2026
Interesting piece on 'Why AI Keep Failing on Microbiome Predicition' AI/ML AWG Topics aimlawg , microbesawg , substack	1	74	March 4, 2026
Interesting Bio/Omics AI/ML Papers and Code AI/ML AWG Topics	4	119	January 7, 2025
RNA Metrics Feedback Requested OSDR Feedback rna-seq , metrics , qc	0	82	October 23, 2024

How are you using RNAseq foundation models?

Related topics