Generative AI
Generative AI
“Generative AI is the most powerful tool for creativity that has ever been
created. It has the potential to unleash a new era of human innovation.”
-Elon Musk
Its prevalence in the tech world makes it an essential tool for CSE
students. The best colleges in India have incorporated generative AI into
their curriculum to ensure their graduates are industry-ready.
Generative AI Interfaces
Chat GPT
As a CSE graduate, exploring the capabilities of GPT-3 for tasks like code
generation, content creation, and language translation could be path-
breaking.
DALL-E
GitHub Copilot
Transformer Models
Transformer models, like OpenAI’s GPT series, have gained prominence
for their attention-based architecture, enabling contextual understanding
and generating coherent and contextually relevant content.
Reinforcement Learning
Natural Language Processing
Cognitive Modelling
Advanced Computer Vision and Video Analytics
Image and Video Processing
Conclusion
Students intrigued by AI will find their place at the Best Colleges In India,
training to contribute to the transformative power of AI in the digital era.
Generative AI has the potential to bring in a new era of human innovation,
and while everyone is clamouring to understand it, CSE students have the
opportunity to be a part of this transformation.
https://www.youtube.com/watch?v=aKSVZjti9q0
It can create attack simulations for training security analysts and testing
system effectiveness, identify threats, and establish predictive models.
Airbus, for instance, reduced aircraft partition wall weight by 45%, using
generative design.
Tapestry, the parent company of brands like Kate Spade and Coach,
utilizes generative AI to automate online personalization.
In the medical field, the impact will be notable in product and R and D.
Generative AI isn't just a trend, it's a driving force behind the next wave of
innovation, efficiency, and personalized experiences across industries.
In this chapter, we offer you essential knowledge for building and training deep learning
models, including Generative Adversarial Networks (GANs). We are going to explain the
basics of deep learning, starting with a simple example of a learning algorithm based on
linear regression. We will also provide instructions on how to set up a deep learning
programming environment using Python and Keras. We will also talk about the importance of
computing power in deep learning; we are going to describe guidelines to fully take
advantage of NVIDIA GPUs by maximizing the memory footprint, enabling the CUDA
Deep Neural Network library (cuDNN), and eventually using distributed training setups
with multiple GPUs. Finally, in addition to installing the libraries that will be necessary for
upcoming projects in this book, you will test your installation by building, from scratch, a
simple and efficient Artificial Neural Network (ANN) that will learn from data how to
classify images of handwritten digits.
Deep learning is a subset of machine learning, which is a field of artificial intelligence that
uses mathematics and computers to learn from data and map it from some input to some
output. Loosely speaking, a map or a model is a function with parameters that maps the input
to an output. Learning the map, also known as mode, occurs by updating the parameters of
the map such that some expected empirical loss is minimized. The empirical loss is a measure
of distance between the values predicted by the model and the target values given the
empirical data.
Notice that this learning setup is extremely powerful because it does not require having an
explicit understanding of the rules that define the map. An interesting aspect of this setup is
that it does not guarantee that you will learn the exact map that maps the input to the output,
but some other maps, as expected, predict the correct output.
This learning setup, however, does not come without a price: some deep learning methods
require large amounts of data, specially when compared with methods that rely on feature
engineering. Fortunately, there is a large availability of free data, specially unlabeled, in
many domains.
Meanwhile, the term deep learning refers to the use of multiple layers in an ANN to form
a deep chain of functions. The term ANN suggests that such models informally draw
inspiration from theoretical models of how learning could happen in the brain. ANNs, also
referred to as deep neural networks, are the main class of models considered in this book.
Despite its recent success in many applications, deep learning is not new and according to Ian
Goodfellow, Yoshua Bengio, and Aaron Courville, there have been three eras:
Linear regression is used to estimate the parameters of a model to describe the relationship
between an output variable and the given input variables. It can be mathematically described
as a weighted sum of input variables:
Here, the weight, , and inputs, , are vectors in ; in other words, they are real-valued vectors
with dimensions, as a scalar bias term, and as a scalar term that represents the valuation of
the function at the input . In ANNs, the output of a single neuron without non-linearities is
similar to the output of the linear model described in the preceding linear regression equation
and the following diagram:
Logistic regression is a special version of regression where a specific non-linear function,
the sigmoid function, is applied to the output of the linear model in the earlier linear
regression equation:
The In ANNs, the non-linear model described in the logistic regression equation is similar to
the output of a single neuron with a sigmoid non-linearity in the following diagram:
A combination of such neurons defines a hidden layer in a neural network, and the neural
networks are organized as a chain of layers. The output of a hidden layer is described by the
following equation and diagram: