Introduction

Foundry

Foundry-ML is a Python library that simplifies access to machine learning-ready datasets in materials science and chemistry.

Features

  • Search & Discover - Find datasets by keyword or browse the catalog

  • Rich Metadata - Understand datasets before downloading with detailed schemas

  • Easy Loading - Get data in Python, PyTorch, or TensorFlow format

  • Automatic Caching - Fast subsequent access after first download

  • Publishing - Share your own datasets with the community

  • AI Integration - MCP server for AI assistant access

  • CLI - Terminal-based workflows

Quick Example

Installation

For cloud environments (Colab, remote Jupyter):

What's Next?

Project Support

This work was supported by the National Science Foundation under NSF Award Number: 1931306 "Collaborative Research: Framework: Machine Learning Materials Innovation Infrastructure".

Foundry brings together components from:

Last updated

Was this helpful?