PublishedBookPreview FrontPages
PublishedBookPreview FrontPages
net/publication/383551470
CITATION READS
1 666
4 authors, including:
Prasanth Aruchamy
Vel Tech Rangarajan Dr.Sagunthala R&D Institute of Science and Technology
77 PUBLICATIONS 1,242 CITATIONS
SEE PROFILE
All content following this page was uploaded by Balasubramaniam .S on 11 September 2024.
Artificial Intelligence.
Machine Learning, Convolutional Neural Networks and Large Language
Models
Edited by: Leonidas Deligiannidis, George Dimitoglou and
Hamid R. Arabnia,
ISBN ----, e-ISBN (PDF) ----
Edited by
S. Balasubramaniam, Seifedine Kadry, A. Prasanth
and Rajesh Kumar Dhanaraj
Editors
S. Balasubramaniam A. Prasanth
218 D/ 50, Asambu Road Sri Shanmugavel Weaving Mills
Vadasery, Nagercoil 629001 Dindigul 624709
Tamil Nadu, India Tamil Nadu, India
[email protected] [email protected]
ISBN 978-3-11-142463-7
e-ISBN (PDF) 978-3-11-142507-8
e-ISBN (EPUB) 978-3-11-142551-1
www.degruyter.com
Preface
Generative artificial intelligence (generative AI or GAI) and large language models
(LLM) are machine learning algorithms that operate in an unsupervised or semi-
supervised manner. These algorithms leverage pre-existing content, such as text, pho-
tos, audio, video, and code, to generate novel content. The primary objective is to pro-
duce authentic and novel material. In addition, there exists an absence of constraints
on the quantity of novel material that they are capable of generating. New material
can be generated through the utilization of Application Programming Interfaces
(APIs) or natural language interfaces, such as the ChatGPT developed by Open AI and
Bard developed by Google.
The field of generative artificial intelligence stands out due to its unique charac-
teristic of undergoing development and maturation in a highly transparent manner,
with its progress being observed by the public at large. The current era of artificial
intelligence is being influenced by the imperative to effectively utilize its capabilities
in order to enhance corporate operations. Specifically, the use of large language
model (LLM) capabilities, which fall under the category of generative AI, holds the
potential to redefine the limits of innovation and productivity. However, as firms
strive to include new technologies, there is a potential for compromising data privacy,
long-term competitiveness, and environmental sustainability.
This book delves into the exploration of GAI and LLM. It examines the historical
and evolutionary development of GAI models, as well as the challenges and issues
that have emerged from these models and LLM. This book also discusses the necessity
of generative AI-based systems and explores the various training methods that have
been developed for GAI models, including LLM pretraining, LLM fine-tuning, and re-
inforcement learning from human feedback. Additionally, it explores the potential
use cases, applications, and ethical considerations associated with these models. This
book concludes by discussing future directions in generative AI and presenting vari-
ous case studies that highlight the applications of GAI and LLM.
https://doi.org/10.1515/9783111425078-202
Contents
Preface V
List of Contributors XI
Reshmi L. B., Vipin Raj R., Balasubramaniam S., and K. Satheesh Kumar
12 Generative AI and LLM: Case Study in Finance 231
Index 273
About the Editors
Dr. Balasubramaniam S. is working as an Assistant Professor in the School of
Computer Science and Engineering, Kerala University of Digital Sciences,
Innovation and Technology (Formerly IIITM-K), Digital University Kerala,
Thiruvananthapuram, Kerala, India. He has around 10+ years of experience in
teaching, research, and industry. He has completed his Postdoctoral Research
in Department of Applied Data Science, Noroff University College, Kristiansand,
Norway. He holds a PhD in Computer Science and Engineering from Anna
University, Chennai, India, in 2015. He has published nearly 20 research papers in
reputed SCI/WoS/Scopus indexed journals. He has also granted with one
Australian patent, one Indian patent, and published three Indian patents. He
has presented papers at conferences, contributed chapters to the edited books, and edited a number of
books published by international publishers. His research and publication interests include machine
learning and deep learning based disease diagnosis, cloud computing security, generative AI, and
electric vehicles.
Orcid Id: https://orcid.org/my-orcid?orcid=0000-0003-1371-3088
LinkedIn: https://www.linkedin.com/in/dr-balasubramaniam-s-6873533b/
Google Scholar: https://scholar.google.co.in/citations?user=1KGLST0AAAAJ&hl=en
Academic url: https://duk.ac.in/personnel/balasubramaniam-s/
https://doi.org/10.1515/9783111425078-204
X About the Editors
His research interests include Internet of Things, blockchain, wireless sensor networks, medical image
processing, and machine learning.
Google Scholar: https://scholar.google.co.in/citations?user=JrH8j3kAAAAJ&hl=en
LinkedIn: https://www.linkedin.com/in/dr-a-prasanth-m-e-ph-d-9528591b4/