🌐 EN | 🇯🇵 JP | Last sync: 2025-11-16

Active Learning Introductory Series v1.0

Strategic exploration to find optimal solutions with fewer experiments

📖 Total Learning Time: 100-120 minutes 📊 Difficulty: Intermediate to Advanced 💻 Code Examples: 28 📝 Exercises: 12 📋 Case Studies: 5

Series Overview

This series is educational content with a 4-chapter structure that allows learners to progress from beginners just starting to learn Active Learning to those who want to develop practical Materials Exploration skills.

Active Learning is a Machine Learning technique that actively selects data with the highest information value through a limited number of experiments. In Materials Exploration, by intelligently deciding which samples to measure next, you can achieve target performance with one-tenth or fewer experiments compared to Random Sampling. Toyota's Catalyst development achieved an 80% reduction in experiments, while MIT's battery Materials Exploration increased development speed 10-fold.

Why This Series is Necessary

Background and Challenges: The greatest challenge in Materials Science is the vastness of the search space and the high cost of experiments. For example, Catalyst screening involves tens of thousands of candidate materials, and evaluating a single sample can take days to weeks. Measuring all samples is physically and economically impossible. Traditional Random Sampling wastes valuable experimental resources on low-information-value samples.

What You Will Learn in This Series: This series systematically teaches Active Learning from theory to practice through executable Code Examples and Materials Science case studies. You will acquire practical skills from day one, including Query Strategies (data selection strategy), Uncertainty Estimation techniques, Acquisition Function design, and automatic integration with experimental equipment.

Features:

Target Audience:

How to Learn

Recommended Learning Order

flowchart TD A["Chapter 1: Why Active Learning is Needed"] --> B["Chapter 2: Uncertainty Estimation Techniques"] B --> C["Chapter 3: Acquisition Function Design"] C --> D["Chapter 4: Application to Materials Exploration"] style A fill:#e3f2fd style B fill:#fff3e0 style C fill:#f3e5f5 style D fill:#e8f5e9

For Beginners (No prior Active Learning knowledge):

For Intermediate Learners (Bayesian Optimization experience):

For Practical Skill Enhancement (Implementation-focused over theory):

Learning Flowchart

flowchart TD Start["Start Learning"] --> Q1{"Bayesian Optimization
Experience?"} Q1 -->|First time| PreBO["Prerequisite: BO
Introductory Series"] Q1 -->|Experienced| Q2{"Active Learning
Experience?"} PreBO --> Ch1 Q2 -->|First time| Ch1["Start from Chapter 1"] Q2 -->|Basic knowledge| Ch2["Start from Chapter 2"] Q2 -->|Implementation
experience| Ch3["Start from Chapter 3"] Ch1 --> Ch2["Go to Chapter 2"] Ch2 --> Ch3["Go to Chapter 3"] Ch3 --> Ch4["Go to Chapter 4"] Ch4 --> Complete["Series Complete"] Complete --> Next["Next Steps"] Next --> Project["Personal Project"] Next --> Robotic["Robotics Experiment
Automation"] Next --> Community["Research
Community"] style Start fill:#4CAF50,color:#fff style Complete fill:#2196F3,color:#fff style Next fill:#FF9800,color:#fff

Chapter Details

Chapter 1: The Need for Active Learning

📖 Reading Time: 20-25 minutes 📊 Difficulty: Intermediate 💻 Code Examples: 6-8

Learning Content

Learning Objectives

Read Chapter 1 →

Chapter 2: Uncertainty Estimation Techniques

📖 Reading Time: 25-30 minutes 📊 Difficulty: Intermediate to Advanced 💻 Code Examples: 7-9

Learning Content

Learning Objectives

Uncertainty Estimation Flow

flowchart TD A["Training Data"] --> B{"Model
Selection"} B -->|Ensemble| C["Random Forest/
LightGBM"] B -->|Deep Learning| D["MC Dropout"] B -->|GP| E["Gaussian Process"] C --> F["Calculate
Prediction Variance"] D --> F E --> F F --> G["Select Samples with
High Uncertainty"] G --> H["Experiment Execution"] H --> A style A fill:#e3f2fd style B fill:#fff3e0 style G fill:#e8f5e9

Read Chapter 2 →

Chapter 3: Acquisition Function Design

📖 Reading Time: 25-30 minutes 📊 Difficulty: Intermediate to Advanced 💻 Code Examples: 6-8

Learning Content

Learning Objectives

Acquisition Function Comparison

Acquisition Function Characteristics Exploration Tendency Computation Cost Recommended Use
EI Expected Improvement Balanced Medium General Optimization
PI Probability of Improvement Exploitation-focused Low Fast Exploration
UCB Upper Confidence Bound Exploration-focused Low Wide-range Search
Thompson Probabilistic Balanced Medium Parallel Experiments

Read Chapter 3 →

Chapter 4: Applications and Practice in Materials Exploration

📖 Reading Time: 25-30 minutes 📊 Difficulty: Advanced 💻 Code Examples: 6-8

Learning Content

Learning Objectives

Closed-Loop Optimization

flowchart LR A["Candidate Proposal
Active Learning"] --> B["Experiment Execution
Robotics"] B --> C["Measurement &
Evaluation
Sensors"] C --> D["Data Accumulation
Database"] D --> E["Model Update
Machine Learning"] E --> F["Acquisition Function
Evaluation &
Next Candidate"] F --> A style A fill:#e3f2fd style B fill:#fff3e0 style C fill:#f3e5f5 style D fill:#e8f5e9 style E fill:#ffebee style F fill:#fce4ec

Read Chapter 4 →

Overall Learning Outcomes

Upon completing this series, you will acquire the following skills and knowledge:

Knowledge Level (Understanding)

Practical Skills (Doing)

Application Ability (Applying)

FAQ (Frequently Asked Questions)

Q1: What is the difference between Active Learning and Bayesian Optimization?

A: Active Learning and Bayesian Optimization are closely related but have different focuses:

Commonality: Both perform "intelligent sampling leveraging uncertainty". Bayesian Optimization can be viewed as a special case of Active Learning.

Q2: Can I understand this with limited Machine Learning experience?

A: Yes, if you have basic Machine Learning knowledge (linear regression, decision trees, cross-validation, etc.). However, we recommend the following prerequisites:

Q3: Which Uncertainty Estimation technique should I choose?

A: Choose based on problem characteristics and available resources:

Recommendation: Start with Ensemble methods, then transition to GP or Dropout as needed.

Q4: Can I learn without experimental equipment?

A: Yes, you can. This series teaches fundamentals with simulation data, provides practice with open datasets (Materials Project, etc.), and teaches closed-loop concepts and code examples. You will acquire knowledge that can be immediately applied when you use experimental equipment in the future.

Q5: Are there any industrial applications with proven results?

A: Many successful examples exist:

Prerequisites and Related Series

Prerequisites

Required:

Strongly Recommended:

Complete Learning Path

flowchart TD Pre1["Prerequisite:
Python Basics"] --> Pre2["Prerequisite:
Materials Informatics
Introduction"] Pre2 --> Pre3["Prerequisite:
Bayesian Optimization
Introduction"] Pre3 --> Current["Active Learning
Introduction"] Current --> Next1["Next: Robotics
Experiment Automation"] Current --> Next2["Next: Reinforcement
Learning Introduction"] Current --> Next3["Application: Real
Materials Exploration
Project"] Next1 --> Advanced["Advanced: Autonomous
Experimental Systems"] Next2 --> Advanced Next3 --> Advanced style Pre1 fill:#e3f2fd style Pre2 fill:#e3f2fd style Pre3 fill:#fff3e0 style Current fill:#4CAF50,color:#fff style Next1 fill:#f3e5f5 style Next2 fill:#f3e5f5 style Next3 fill:#f3e5f5 style Advanced fill:#ffebee

Key Tools

Tool Name Purpose License Installation
modAL Active Learning specialized library MIT pip install modAL-python
scikit-learn Machine Learning foundation BSD-3 pip install scikit-learn
GPyTorch Gaussian Process (GPU-compatible) MIT pip install gpytorch
BoTorch Bayesian Optimization (PyTorch) MIT pip install botorch
pandas Data management BSD-3 pip install pandas
matplotlib Visualization PSF pip install matplotlib
numpy Numerical computation BSD-3 pip install numpy

Next Steps

Recommended Actions After Series Completion

Immediate (Within 1-2 weeks):

Short-term (1-3 months):

Let's Get Started!

Are you ready? Start from Chapter 1 and begin your journey to revolutionize Materials Exploration with Active Learning!

Chapter 1: Why Active Learning is Needed →

Disclaimer