Introduction

Foundry-ML is a Python library that simplifies access to machine learning-ready datasets in materials science and chemistry.
Features
Search & Discover - Find datasets by keyword or browse the catalog
Rich Metadata - Understand datasets before downloading with detailed schemas
Easy Loading - Get data in Python, PyTorch, or TensorFlow format
Automatic Caching - Fast subsequent access after first download
Publishing - Share your own datasets with the community
AI Integration - MCP server for AI assistant access
CLI - Terminal-based workflows
Quick Example
Installation
For cloud environments (Colab, remote Jupyter):
What's Next?
Getting Started
User Guide
Features
Project Support
This work was supported by the National Science Foundation under NSF Award Number: 1931306 "Collaborative Research: Framework: Machine Learning Materials Innovation Infrastructure".
Foundry brings together components from:
Last updated
Was this helpful?