Machine Learning Part1
Machine Learning Part1
Trainers: Jordan A. Caraballo-Vega, Mark L. Carroll, Jules Kouatchou, Jian Li, Caleb S. Spradlin
Agriculture
Empower the global
community Climate
to incorporate Disasters
Earth-observing data into
Health & Air Quality
environmental
management and Land
decision-making
Water Resources
CAPACITY
BUILDING
• Prerequisites:
• Session 1 of our on-demand Fundamentals of Remote Sensing series or have
equivalent experience.
• Attendees will need access to Google Drive and Google Colab. To access
these resources, users must use an email ending in ‘gmail.com’.
• We will have the video of this demonstration within the training recording
available within 48 hours after the presentation for you to go through at your
own pace.
Part 1: Overview of
Machine Learning Part 2: Part 3: Homework
Training Data and Model Tuning, Independent
Land Cover Parameter
practice and
Classification Optimization, and
Additional application
Example
Machine Learning
Algorithms
Trainers: Jordan A. Caraballo-Vega, Mark L. Carroll, Jules Kouatchou, Jian Li, Caleb S. Spradlin
Jules Kouatchou
Chief Programmer/Analyst
Jordan A. Caraballo-Vega Caleb S. Spradlin
Computer Engineer Software Developer
• Recognize the most common machine learning methods used for processing
Earth Science data
• Describe the benefits and limitations of machine learning for Earth Science
analysis
• Explain how to apply basic machine learning algorithms and techniques in a
meaningful manner to remote sensing data
The following quote from Arthur Samuel describes what Machine Learning (ML) is:
• Programming Languages
• Software Packages
NASA’s Applied Remote Sensing Training Program Image Source: Stack Overflow 2022 26
Machine Learning Frameworks in Python
• Scikit-Learn — One of the most important libraries (Swiss Knife) for Machine Learning as it
provides a number of simple and efficient tools for data analysis. It provides functionality for
classification, regression, clustering algorithms, dimensionality reduction, model selection, and
data preprocessing.
• TensorFlow — Library was developed by engineers and researchers working on the Google Brain
team that conducts machine learning and neural networks research. It allows researchers to
push boundaries in discovering state-of-the-art (SOTA) results, and also allows developers to
create ML-powered applications.
• Keras — High-level neural networks API, which can be implemented on top of TensorFlow or
Theano used for building and training deep learning models. It allows for easy and fast
prototyping and supports both convolutional neural networks and recurrent networks.
• PyTorch — Provides functionality largely centered around building and training neural
networks—the backbone of deep learning. PyTorch offers scalable distributed training of models
across single or multiple CPUs and GPUs. the first release was in September 2016, but it has
quickly been widely adopted by industry such as Tesla & Uber.
Data
• Which scientific question would
you like to address?
Dimensionality Complexity
Transformers
LSTM XGBoost
water)
Algorithm decision branch based on data structure.
• Post-Session Assignment
• Trainers:
– Jordan A. Caraballo-Vega: [email protected]
– Jules Kouatchou: [email protected]
– Caleb S. Spradlin: [email protected]
– Jian Li: [email protected]
– Brock Blevins: [email protected]
• Training Webpage:
– https://appliedsciences.nasa.gov/join-mission/training/english/arset-fundamentals-
machine-learning-earth-science
• Crankshaw, D., & Gonzalez, J. (2018). Prediction-Serving Systems: What happens when we
wish to actually deploy a machine learning model to production?. Queue, 16(1), 83-97.
• Elders, A., Carroll, M. L., Neigh, C. S., D'Agostino, A. L., Ksoll, C., Wooten, M. R., & Brown, M.
E. (2022). Estimating crop type and yield of small holder fields in Burkina Faso using multi-
day Sentinel-2. Remote Sensing Applications: Society and Environment, 27, 100820.
• Fleming, S. W., Watson, J. R., Ellenson, A., Cannon, A. J., & Vesselinov, V. C. (2021).
Machine learning in Earth and environmental science requires education and research
policy reforms. Nature Geoscience, 14(12), 878-880.
• Prša, A., Kochoska, A., Conroy, K. E., Eisner, N., Hey, D. R., IJspeert, L., ... & Winn, J. N. (2022).
TESS Eclipsing Binary Stars. I. Short-cadence Observations of 4584 Eclipsing Binaries in
Sectors 1–26. The Astrophysical Journal Supplement Series, 258(1), 16.
• Reichstein, M., Camps-Valls, G., Stevens, B., Jung, M., Denzler, J., & Carvalhais, N. the
National Energy Research Supercomputing Center in Lawrence Berkeley National
Laboratory, Berkeley, CA, USA: Deep learning and process understanding for data-driven
Earth system science. Nature, 566, 195-204.
• Yu, S., & Ma, J. (2021). Deep learning for geophysics: Current and future trends. Reviews of
Geophysics, 59(3), e2021RG000742.
• Jordan A. Caraballo-Vega
• Mark L. Carroll
• Jules R. Kouatchou
• Jian Li
• Caleb S. Spradlin
• Brock Blevins
• Melanie Follette-Cook
• Erika Podest
• Brian Powell
• Akiko Elders