Components of Ai System Design PDF
Components of Ai System Design PDF
DATA ACQUISITION
Data acquisition is the process of collecting and gathering raw data from various sources to be used for
building machine learning models. This stage is crucial as the quality and relevance of the data directly affect
the success of the model.
Sources: Data can be acquired from databases, APIs, web scraping, sensors, user inputs, publicly available
datasets, or manual entry.
Types: Structured (e.g., SQL databases), unstructured (e.g., text, images), or semi-structured (e.g., JSON,
XML).
Challenges: Incomplete data, high costs, storage requirements, privacy concerns, and real-time data
collection.
MODEL SELECTION
Model selection involves choosing the most appropriate machine learning algorithm(s) for the task at hand.
Types of Models:
Supervised Learning: Regression (e.g., Linear Regression, Decision Trees) or Classification (e.g., SVM, Random Forest,
Neural Networks).
Unsupervised Learning: Clustering (e.g., K-means, DBSCAN) or Dimensionality Reduction (e.g., PCA).
Reinforcement Learning: For tasks where the model learns via trial and error, such as robotics or game playing.