Skip to Content
MethodsMethods Resources

Methods Resources

Learning resources for ML methods in science.

Deep Learning Fundamentals

Neural Networks: Zero to Hero

github.com/karpathy/nn-zero-to-hero 

Andrej Karpathy’s tutorial series — the best starting point for understanding neural networks.


Graph Neural Networks

ResourceDescription
DeepChem Tutorials GNNs for molecular property prediction
PyTorch Geometric Tutorials General GNN tutorials

Transformers & Language Models

ResourceDescription
Transformers for Chemistry LLMs in chemistry and materials
awesome-scientific-language-models Comprehensive list of scientific LLMs

Scientific Language Models

General Scientific LLMs

ModelDescriptionSizeLinks
SciBERTBERT model for scientific textBasePaper  / GitHub 
GalacticaLarge Language Model for Science125M-120BPaper 
DARWINDomain-specific models for natural science7BPaper 
SciGLMScientific instruction-tuned model6BPaper 
INDUSEfficient language models for science38M-125MPaper 

Document Representation

ModelDescriptionLinks
SPECTERDocument representation using citationsPaper 
SciNCLContrastive learning for scientific documentsPaper 
SciMultMulti-task contrastive learning (138M)Paper 

Chemistry & Materials LLMs

ModelDescriptionSizeLinks
ChemBERTChemical reaction extractionBasePaper 
MatSciBERTMaterials domain language modelBasePaper 
BatteryBERTBattery database enhancementBasePaper 
ChemDFMChemistry dialogue foundation model13BPaper 
ChemLLMChemical large language model7BPaper 
LlaSMolChemistry instruction tuning dataset6.7B-7BPaper 
KALE-LMKnowledge-enhanced science model8BPaper 

Molecule-Language Models

ModelDescriptionSizeLinks
Text2MolCross-modal molecule retrieval-Paper 
KV-PLMMolecule structure-text bridge-Paper 
MolT5Molecule-language translation60M-770MPaper 
MoleculeSTMMulti-modal structure-text model-Paper 

Math & Reasoning Models

ModelDescriptionSizeLinks
MathBERTPre-trained for mathematics educationBasePaper 
MinervaSolving quantitative reasoning problems-Paper 
WizardMathMathematical reasoning via reinforcement7B-70BPaper 
MAmmoTHMath generalist through hybrid tuning7B-70BPaper 
MetaMathBootstrap mathematical questions-Paper 
ToRATool-integrated reasoning agent7B-70BPaper 
LlemmaOpen math language model7B-34BPaper 
DeepSeekMathMathematical reasoning limits7BPaper 
InternLM-MathVerifiable reasoning via Lean47B-20BPaper 

Table Understanding

ModelDescriptionSizeLinks
TAPASWeakly supervised table parsingBasePaper 
TaBERTTable and natural language fusionBasePaper 
TAPEXNeural SQL executor pre-training140M-406MPaper 
OmniTabFew-shot table QA with synthetic data406MPaper 
TableLlamaOpen generalist table models7BPaper 

Physics & Astronomy

ModelDescriptionSizeLinks
astroBERTLanguage model for astronomyBasePaper 
AstroLLaMASpecialized astronomy foundation model7BPaper 
PhysBERTPhysics scientific literature embeddingsBasePaper 

Multimodal (Vision + Language)

ModelDescriptionSizeLinks
G-LLaVAMulti-modal geometry problem solving7B-13BPaper 

Generative Models

ResourceDescription
AI4Chemistry Course Includes generative models for chemistry

Scientific Machine Learning (SciML)

Neural Differential Equations

Paper/MethodDescriptionLinks
Neural ODEsContinuous-depth neural networksPaper 
Universal Differential EquationsCombining DEs with MLPaper  / Code 
Hamiltonian NNsPhysics-preserving neural networksPaper  / Code 
Neural CDEsNeural controlled differential equationsPaper  / Code 

Physics-Informed Neural Networks

Paper/MethodDescriptionLinks
PINNsPhysics-informed neural networksPaper  / Code 
DeepONetDeep operator networksPaper  / Code 
Fourier Neural OperatorLearning in Fourier spacePaper 
SINDySparse identification of dynamical systemsPaper 
NVIDIA PhysicsNeMoFramework for physics-ML modelsgithub 
PINAPhysics-informed networks in PyTorchgithub 

SciML Software

Julia

PackageDescriptionLink
DifferentialEquations.jlComprehensive DE solvingdocs 
DiffEqFlux.jlNeural DEs in Juliadocs 
NeuralPDE.jlPhysics-informed neural networksdocs 

Python

PackageDescriptionLink
torchdiffeqPyTorch neural ODEsgithub 
torchdynPyTorch neural DEs librarygithub 
diffraxJAX-based differential equationsgithub 
DeepXDEDeep learning for scientific computinggithub 
pysindyPython sparse identificationgithub 
NeuroMANCERNeural modules for controlgithub 
SciANNTensorFlow physics-informed NNsgithub 

SciML Books & Courses

ResourceAuthorLink
Parallel Computing and SciMLChris Rackauckasbook.sciml.ai 
Data-Driven Science and EngineeringBrunton & KutzCambridge 

Video Channels

ChannelFocus
Steve Brunton Data-driven methods, SINDy
Physics Informed ML PINNs tutorials

AI Tools for Research

Document Parsing & Processing

ToolDescriptionLink
MinerUSOTA document parsing (1.2B params)github 
DoclingMulti-format conversion with layout reconstructionIBM 
NougatAcademic document understandinggithub 
GROBIDMetadata extraction using MLgithub 
MarkerPDF to Markdown/JSON conversiongithub 
PaperQA2High-accuracy RAG for PDFs with citationsgithub 

Paper-to-Code & Reproducibility

ToolDescriptionLink
AutoP2CLLM agent generating repos from papersarxiv 
ResearchCodeAgentMulti-agent codification systemarxiv 

Research Agents

AgentDescriptionLink
The AI ScientistAutonomous research systemarxiv 
ChemCrowChemistry research agentsarxiv 
BioDiscoveryAgentBiological discovery automationgithub 
ToolUniverse600+ scientific tools (Harvard)github 

Literature & Knowledge Management

ToolDescriptionLink
Semantic ScholarAI-powered academic searchsemanticscholar.org 
OpenAlexOpen scholarly papers catalogopenalex.org 
Research RabbitLiterature discovery platformresearchrabbit.ai 
Jupyter AIJupyterLab AI extensiongithub 

Awesome Lists

ListFocus
awesome-ai-for-science AI tools for scientific research
awesome-learning-digital-chemistry General learning resources
awesome-scientific-language-models Scientific LLMs
awesome-scientific-machine-learning SciML resources
awesome-pinn Physics-informed neural networks