0% found this document useful (0 votes)
63 views

Sample Internship

Uploaded by

Jeevan Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
63 views

Sample Internship

Uploaded by

Jeevan Kumar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 56

SEMESTER INTERNSHIP

ANDHRA PRADESH
STATE COUNCIL OF HIGHER EDUCATION
(A STATUTORY BODY OF GOVERNMENT OF ANDHRA PRADESH)
PROGRAM BOOK FOR

SEMESTER INTERNSHIP

Name of the Student : VAKADA KEERTY

Name of the College : GIET Engineering College

Registration Number : 21T91A1246

Period of Internship : 8 Weeks From: 03-06-2024 To: 27-07-2024

Name & Address of the Intern Organization: INTERNATIONAL INSTITUTE OF


DIGITAL TECHNOLOGIES(IIDT)-APSHE in Collaboration with BLACKBUCKS
ENGINEERS- Tirupati

Jawaharlal Nehru Technological University, Kakinada

2024-2025
An Internship Report on
HEART DISEASE DETECTION
Submitted in accordance with the requirement for the degree of
Bachelor of Technology

Under the Faculty Guideship of


Mr. D. Ramesh, M.Tech., (Ph.D.),
Assistant Professor

Department of

Information Technology
in
GIET Engineering College

Submitted by:

NAME OF THE STUDENT

Reg.No.: 21T91A1246

GIET Engineering College


(Affiliated to JNTUK, Kakinada and approved by AICTE, Accredited by NAAC, New Delhi)
Chaitanya Knowledge City, Velugubanda, Rajamahendravaram,
Andhra Pradesh, India - 522396.
Student’s Declaration

I, VAKADA KEERTY a student of GIET ENGINEERING COLLEGE Program B.Tech.,


Reg.No.21T91A1246 of the Department of INFORMATION TECHNOLOGY, GIET
ENGINEERING COLLEGE do hereby declare that I have completed the mandatory internship
from 03 JUNE 2024 to 27 JULY 2024 in INTERNATIONAL INSTITUTE OF DIGITAL
TECHNOLOGIES (IIDT) – APSCHE in Collaboration with BLACKBUCK ENGINEERS under
the Faculty Guideship of Mr. D. Ramesh, M.Tech., (Ph.D.), Assistant Professor department of
INFORMATION TECHNOLOGY, GIET ENGINEERING COLLEGE .

VAKADA KEERTY - 21T91A1246


Official Certification

This is to certify that VAKADA KEERTY Reg. No. 21T91A1246 has completed her
internship in INTERNATIONAL INSTITUTE OF DIGITAL TECHNOLOGY (IIDT) – APSCHE
in Collaboration with BLACKBUCK ENGINEERS on HEART DISEASE DETECTION using
AI-ML-DS under my supervision as a part of partial fulfillment of the requirement for Degree of
BACHELORS OF TECHNOLOGY in the department of INFORMATION TECHNOLOGY,
GIET ENGINEERING COLLEGE.

Faculty Guide Head of the Department


Mr. D. Ramesh, M.Tech., (Ph.D.) Mr. M. Sreenivasu, M.Tech., (Ph.D.)
Assistant Professor Associate Professor
Certificate from Intern Organization
ACKNOWLEDGMENT

Before getting into the thickest of things, I would like to think personalities who were part
of my project in numerous ways, those who give me outstanding support from birth of the project
I hereby wish to express my deep sense of gratitude to Mr.D.Ramesh, Assistant Professor,
Department of Information Technology, GIET Engineering College for the esteemed guidance,
moral support and invaluable advice provided by him for the success of project.
I am highly indebted to Dr.M.Vijaya Sekhar Babu, Principal of GIET Engineering
College, Rajamahendravaram, for his support during the tenure of the project.
I am very much obliged to my beloved Mr.M.Sreenivasu, Head of the Department of
Information Technology, GIET Engineering college for providing the opportunity to undertake
this project and encouragement in completion of this project.
I am also thankful to all the staff members of Information Technology Department who
have co-operated in making my project a success. I would like to thank my parents and all friends
who extended their help, encouragement and moral support either directly or indirectly in my
project work.
Thanks for Your Valuable Guidance and kind support.

Submitted by

VAKADA KEERTY - 21T91A1246


CONTENTS

SL.NO. TITTLE PAGE NO.

1 EXECUTIVE SUMMARY 1

2 OVERVIEW OF THE ORGANIZATION 3

3 INTERNSHIP PART 5

4 ACTIVITY LOG 7

5 OUTCOMES DESCRIPTION 23

6 SOURCE CODE 33

7 PHOTOS & VIDEO LINKS 39

8 CONCLUSION 42

9 REFERENCES 43
Heart Disease Detection

CHAPTER 1
EXECUTIVE SUMMARY
This internship report provides a comprehensive overview of my experience during the
short internship at IIDT, in collaboration with APSCHE and Blackbuck Engineers, focusing on
Artificial Intelligence, Machine Learning, and Data Science applications.

Learning Objectives and Outcomes Achieved:


Practical Application of AI and ML:
Gained hands-on experience in developing machine learning models, specifically
focusing on a logistic regression model for heart disease detection, achieving an
accuracy of 85.25%. This deepened my understanding of model training and evaluation
techniques.
Data Preprocessing Skills:
Enhanced my skills in data collection, cleaning, and preprocessing, using Python
libraries such as Pandas and NumPy to prepare datasets for analysis and model training.
Software Testing Methodologies:
Acquired knowledge of various software testing strategies, including black box and
white box testing, enabling me to validate the functionality and performance of
software applications.
Collaboration and Communication:
Developed effective collaboration skills by working with cross-functional teams,
improving my ability to communicate technical concepts and insights to non-technical
stakeholders.
Project Management and Documentation:
Improved my project management skills through the documentation of processes,
testing results, and methodologies, ensuring clarity and consistency in project
deliverables.

Sector and Organization Overview:


The internship was conducted at Blackbuck Engineers, a prominent organization in the technology
sector, focusing on innovative solutions that leverage artificial intelligence and machine learning
to optimize business processes and improve decision-making. The partnership with IIDT and
APSCHE emphasizes a commitment to advancing education and skill development in emerging
technologies.

GIET Engineering College 1|Page


Heart Disease Detection

Summary of Activities:
Throughout the internship, I engaged in a variety of activities, including:
 Developing and validating a logistic regression model for heart disease detection, focusing
on precision, recall, and F1 score metrics to assess model performance.
 Participating in software testing phases, implementing both black box and white box
testing techniques to ensure software reliability and performance.
 Collaborating with mentors and peers on data science projects, contributing to
brainstorming sessions and presenting findings to enhance collective learning.
 Documenting the entire process, including methodologies, challenges faced, and solutions
implemented, to provide a clear record of the project’s progression.

GIET Engineering College 2|Page


Heart Disease Detection

CHAPTER 2
OVERVIEW OF THE ORGANIZATION
Introduction of the Organization

Blackbuck Engineers is a leading technology firm specializing in Artificial


Intelligence (AI), Machine Learning (ML), and Data Science (DS) solutions. The organization is
dedicated to providing innovative solutions that help businesses optimize their operations, improve
decision-making processes, and drive efficiency. With a team of skilled professionals, Blackbuck
Engineers leverages cutting-edge technologies to deliver tailored solutions to various industries.

Vision, Mission, and Values of the Organization

Vision: To be a global leader in providing AI and ML solutions that empower businesses to unlock
their full potential through data-driven decision-making.

Mission: To develop innovative and sustainable technology solutions that enhance productivity
and efficiency while fostering a culture of continuous improvement and learning.

Values:

 Innovation: Encouraging creative problem-solving and continuous learning.


 Integrity: Upholding ethical standards and transparency in all business dealings.
 Collaboration: Fostering teamwork and cooperation to achieve common goals.
 Customer Focus: Prioritizing client needs and delivering exceptional service.

Policy of the Organization, in Relation to the Intern Role

The organization promotes a policy of inclusivity and mentorship, providing interns with
opportunities to gain practical experience in a supportive environment. Interns are encouraged to
participate actively in projects, receive guidance from experienced professionals, and contribute
to the organization’s objectives while developing their skills.

Organizational Structure

Blackbuck Engineers operates with a flat organizational structure that fosters open communication
and collaboration. Key departments include:

 Research and Development (R&D): Focuses on developing new AI and ML algorithms


and technologies.

GIET Engineering College 3|Page


Heart Disease Detection
 Data Science Team: Responsible for data analysis, model development, and
implementation.
 Software Development: Handles software engineering tasks and application
development.
 Sales and Marketing: Manages client relationships and market outreach.

Roles and Responsibilities of the Employees in Which the Intern is Placed

During the internship, I was placed in the Data Science Team, where the roles and responsibilities
included:

 Data Analysts: Collect, preprocess, and analyse data to extract meaningful insights.
 Data Scientists: Develop and validate machine learning models, ensuring they meet
performance standards.
 Machine Learning Engineers: Implement algorithms and optimize model performance
for production environments.

Performance of the Organization in Terms of Turnover, Profits, Market


Reach and Market Value
Blackbuck Engineers has demonstrated consistent growth in recent years, achieving significant
turnover and profit margins. The organization has expanded its market reach, serving clients across
various sectors, including healthcare, finance, and manufacturing. Its commitment to innovation
has positioned it favourably in the tech industry, enhancing its market value and reputation.

Future Plans of the Organization

Looking ahead, Blackbuck Engineers plans to expand its offerings by integrating advanced
technologies such as deep learning and artificial intelligence into its solutions. The organization
aims to invest in research and development to stay at the forefront of technological advancements.
Additionally, there are plans to strengthen partnerships with educational institutions to foster talent
development and enhance community engagement.

GIET Engineering College 4|Page


Heart Disease Detection

CHAPTER 3
INTERNSHIP PART
Description of Activities/Responsibilities in the Intern Organization

During my internship at Blackbuck Engineers, I was immersed in a dynamic and innovative


environment that focused on AI, ML, and Data Science. The experience was structured around
several key activities and responsibilities, which I will detail below.

Working Conditions

The working conditions at Blackbuck Engineers were conducive to learning and collaboration.
The office was equipped with modern facilities and tools that fostered a creative work
environment. Team members were encouraged to share ideas, engage in discussions, and
participate in brainstorming sessions. Remote working options were also available, allowing
flexibility in completing tasks.

Weekly Work Schedule

The typical weekly work schedule consisted of 5 working days, from Monday to Friday. Each day
involved a mix of collaborative team meetings, individual project work, and skill development
sessions. The schedule was as follows:

Monday: Team meeting to discuss project updates and goals for the week.

Tuesday to Thursday: Focused work on assigned tasks, including data analysis and model
development.

Friday: Review session where interns presented their work, received feedback, and discussed
challenges encountered during the week.

Equipment Used

Throughout the internship, I had access to various tools and technologies essential for data analysis
and machine learning, including:

 Programming Languages: Python, R


 Libraries and Frameworks: Scikit-learn, TensorFlow, Pandas, NumPy
 Data Visualization Tools: Matplotlib, Seaborn
 Development Environment: Jupyter Notebook, Google Colab

GIET Engineering College 5|Page


Heart Disease Detection
 Collaboration Tools: GitHub for version control, Slack for communication, and Trello for
task management.

Tasks Performed

My responsibilities encompassed a variety of tasks that provided hands-on experience in the field.
Key tasks included:

Data Collection and Preprocessing: Gathering datasets from various sources


and preparing them for analysis by cleaning and transforming the data.
Exploratory Data Analysis (EDA): Performing EDA to identify patterns,
trends, and anomalies within the data using visualization techniques.
Model Development: Assisting in the development of machine learning
models, including logistic regression, and evaluating their performance using
metrics likeaccuracy, precision, recall, and F1 score.
Documentation and Reporting: Maintaining detailed documentation of the
processes followed and results obtained, and preparing reports for team
meetings.
Collaboration on Projects: Participating in team projects, contributing ideas
and insights, and collaborating with data scientists and engineers to improve
existing modelsand algorithms.

Skills Acquired

During my internship, I acquired a variety of skills that are essential for a career in AI, ML, and
Data Science:

Technical Skills: Enhanced my proficiency in programming languages such as


Python and R, as well as gained hands-on experience with machine learning
libraries and data visualization tools.
Analytical Skills: Developed strong analytical skills by performing data
analysis and interpreting results to inform decision-making.
Collaboration and Communication: Improved my ability to work in a team
setting, communicate ideas effectively, and present findings to peers and
supervisors.
Problem-Solving: Strengthened my problem-solving skills by tackling real-
world data challenges and finding solutions through critical thinking.

GIET Engineering College 6|Page


Heart Disease Detection

CHAPTER 4: ACTIVITY LOG

ACTIVITY LOG FOR THE FIRST WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 7|Page


Heart Disease Detection

WEEKLY REPORT
WEEK – 1 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 8|Page


Heart Disease Detection

ACTIVITY LOG FOR THE SECOND WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 9|Page


Heart Disease Detection

WEEKLY REPORT
WEEK – 2 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 10 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE THIRD WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 11 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 3 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 12 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE FOURTH WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 13 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 4 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 14 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE FIFTH WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 15 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 5 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 16 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE SIXTH WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 17 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 6 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 18 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE SEVENTH WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 19 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 7 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 20 | P a g e


Heart Disease Detection

ACTIVITY LOG FOR THE EIGHTH WEEK

Person In-
Day & Brief description of the daily
Learning Outcome Charge
Date activity
Signature

Day - 1

Day – 2

Day – 3

Day – 4

Day – 5

Day - 6

GIET Engineering College 21 | P a g e


Heart Disease Detection

WEEKLY REPORT
WEEK – 8 (From Dt ……………...to Dt .......................... )

Objective of the Activity Done:

Detailed Report:

GIET Engineering College 22 | P a g e


Heart Disease Detection

CHAPTER 5: OUTCOMES DESCRIPTION


Work Environment Experienced
During my internship with IIDT and APSCHE in collaboration with Blackbucks, I had the
opportunity to work in a dynamic and supportive work environment that provided valuable
learning experiences. Here is a detailed description of various aspects of the work environment:
People Interactions:
The organization fostered a collaborative atmosphere, encouraging regular interactions
with team members and supervisors. This supportive environment made it easy to ask
questions, seek guidance, and understand new concepts related to AI, ML, and data
science. Feedback was constructive, helping me improve my skills and approach to
projects. Weekly review sessions provided opportunities to present progress and gain
insights from team members.
Facilities Available and Maintenance:
The workspace was well-equipped, with access to necessary technology and resources.
We had designated workstations, computers with required software, and a comfortable
setup that facilitated efficient working. Facilities were well-maintained, ensuring
smooth day-to-day operations and enabling interns to focus on their tasks without
logistical issues.
Clarity of Job Roles:
From the start, job roles and expectations were clearly defined, which helped in
maintaining focus on responsibilities and meeting deadlines. Specific tasks were
assigned based on each intern’s strengths and learning objectives, promoting individual
growth while achieving project goals. Regular meetings with mentors clarified tasks and
provided a roadmap for each project phase.
Protocols, Procedures, and Processes:
The organization followed structured protocols for task assignments, report
submissions, and review meetings. There was a well-defined project management
system that made it easier to track progress, understand expectations, and ensure task
completion within deadlines. Documented processes helped streamline daily activities,
making it simple to stay aligned with project requirements.
Discipline and Time Management:
The team maintained a disciplined schedule, with set work hours and project timelines.
Punctuality and meeting deadlines were emphasized, which helped develop strong time-

GIET Engineering College 23 | P a g e


Heart Disease Detection
management skills. Daily goals and deadlines encouraged interns to stay organized and
manage time efficiently, a critical skill in the fast-paced tech industry.
Harmonious Relationships and Socialization:
The organization encouraged building harmonious relationships, with an emphasis on
respect and professionalism. I experienced a friendly and respectful environment where
team members collaborated openly and shared ideas freely. There were informal
gatherings where team members connected and shared insights, helping create a
welcoming and cohesive atmosphere.
Mutual Support and Teamwork:
Mutual support and teamwork were central to the working culture. Colleagues and
supervisors were always willing to help, answer questions, and provide resources.
Regular check-ins and collaborative tasks strengthened the sense of teamwork, allowing
us to learn from each other’s experiences and support each other in achieving project
milestones.
Motivation:
The organization actively motivated interns by recognizing and rewarding progress,
which encouraged continuous improvement. Constructive feedback from supervisors
and acknowledgment of accomplishments were motivating factors that kept me engaged
and focused throughout the internship.
Space and Ventilation:
The workspaces were spacious, well-ventilated, and designed to ensure a comfortable
working environment. The setup helped maintain focus and productivity by reducing
distractions and allowing adequate space for each individual to work effectively.

Real-Time Technical Skills Acquired


Throughout my internship with IIDT, APSCHE, and Blackbucks, I gained valuable technical skills
and hands-on experience that enhanced my proficiency in AI, ML, and data science. Here’s an
overview of the job-related skills and hands-on experience I acquired:
Machine Learning Model Development:
 Developed a solid understanding of machine learning models, including
supervised and unsupervised learning techniques.
 Gained hands-on experience in implementing algorithms like Logistic
Regression, Decision Trees, Random Forest, and K-Nearest Neighbours (KNN).
 Worked with feature engineering, data cleaning, and preprocessing techniques
to improve model accuracy and efficiency.

GIET Engineering College 24 | P a g e


Heart Disease Detection

Data Preprocessing and Data Wrangling:


 Acquired skills in data wrangling, which involved cleaning, transforming, and
restructuring raw data into a usable format.
 Used tools like pandas and NumPy for data manipulation and handling missing
values, outliers, and scaling features.
 Gained experience with exploratory data analysis (EDA) to identify data
patterns, distributions, and correlations that improved decision-making in model
selection.
Python Programming for Data Science:
 Strengthened my proficiency in Python programming, particularly in using
libraries like pandas, NumPy, scikit-learn, and Matplotlib for data analysis and
machine learning.
 Learned to write clean, efficient, and modular code that improved model
readability and ease of maintenance.
 Utilized Jupyter Notebooks to document code and results, making the data
science workflow easier to follow.
Model Evaluation and Metrics:
 Acquired skills in evaluating model performance using metrics like accuracy,
precision, recall, F1-score, and ROC-AUC score.
 Gained hands-on experience with cross-validation and hyperparameter tuning
techniques to optimize model performance and avoid overfitting.
 Learned to interpret model results critically, balancing precision and recall based
on project requirements.
Data Visualization:
 Developed skills in data visualization, using Matplotlib and Seaborn to create
meaningful plots and graphs that highlighted data insights.
 Used visualizations to present findings effectively, aiding in communicating
data-driven decisions to supervisors and peers.
 Created correlation heatmaps, histograms, scatter plots, and box plots to
understand data distributions and relationships among features.
Working with Real-Time Datasets:
 Gained experience handling large datasets, which required efficient data
processing techniques to manage memory and computation time.

GIET Engineering College 25 | P a g e


Heart Disease Detection
 Worked on real-time datasets related to healthcare, which added practical
relevance to the projects, helping me understand data collection challenges,
privacy considerations, and ethical practices in data science.
Neural Networks and Deep Learning (Introduction):
 Explored the basics of neural networks and deep learning as part of the
internship's broader objectives.
 Understood the fundamental components of neural networks, including layers,
neurons, and activation functions, setting a foundation for future learning in deep
learning.
Version Control (Git):
 Learned to use Git for version control, collaborating on projects with peers and
ensuring code integrity.
 Used GitHub repositories to manage project updates, track changes, and
maintain a consistent codebase, which is essential in team-based data science
projects.

Managerial Skills Acquired


During my internship with IIDT, APSCHE, and Blackbucks, I gained critical managerial skills that
contributed to my professional growth, including planning, leadership, teamwork, and effective
time management. Here’s a detailed breakdown of the competencies I developed:
Planning and Goal Setting:
 Learned to set clear objectives for each project phase, from data collection to
model evaluation, ensuring all tasks aligned with the overall project goals.
 Developed weekly goals and milestones, which helped in tracking progress and
maintaining steady growth in technical and analytical skills.
Time Management and Productivity:
 Improved my ability to prioritize tasks and allocate time effectively, balancing
technical tasks with reporting and documentation responsibilities.
 Adopted productivity techniques, like the Pomodoro method, to manage
intensive coding sessions, which helped maintain high performance and avoid
burnout.
 Leveraged project management tools to monitor timelines, ensuring projects
remained on schedule and delivering work efficiently.

GIET Engineering College 26 | P a g e


Heart Disease Detection

Teamwork and Collaboration:


 Engaged actively in team meetings, contributing ideas and collaborating on tasks
like data cleaning, model tuning, and result interpretation.
 Practiced constructive feedback exchanges and learned to adapt to different
communication styles, which fostered a harmonious and cooperative team
environment.
 Learned the importance of delegating tasks based on individual strengths,
ensuring each team member contributed their unique skills to the project’s
success.
Leadership and Initiative:
 Demonstrated leadership by taking the initiative to address data quality issues,
suggesting improvements in preprocessing techniques that ultimately enhanced
model accuracy.
 Assisted junior team members in debugging code and understanding complex
concepts, fostering a supportive learning environment within the team.
 Volunteered to lead mini-projects and guide discussions on model results, which
helped me build confidence in leading technical tasks.
Behaviour and Workmanship:
 Cultivated a professional work ethic, ensuring punctuality, reliability, and
attention to detail in all tasks and deliverables.
 Demonstrated a positive attitude toward challenges, viewing them as
opportunities for improvement, which motivated me to push through complex
data and modelling issues.
 Maintained a high standard of workmanship, ensuring accuracy and precision in
data analysis, coding, and report writing, which contributed to my professional
credibility.
Decision-Making and Problem-Solving:
 Gained experience in making data-driven decisions, such as choosing the best
algorithm based on dataset characteristics and project requirements.
 Enhanced my analytical thinking by assessing model performance metrics
critically, which helped in selecting the most effective models for the given tasks.
 Learned to identify potential issues early and devise practical solutions,
demonstrating the importance of adaptability in a dynamic work environment.

GIET Engineering College 27 | P a g e


Heart Disease Detection

Performance Analysis and Weekly Improvement:


 Engaged in self-evaluation by assessing weekly accomplishments, identifying
areas for improvement, and setting new learning objectives for the upcoming
week.
 Analysed model performance metrics and applied that insight to optimize
results, which improved my technical competencies and decision-making skills.
 Practiced continuous learning by setting weekly technical goals, such as
mastering new libraries or improving EDA skills, fostering a culture of ongoing
self-improvement.
Productive Use of Resources:
 Utilized available resources, including online tutorials and mentorship sessions,
to enhance my knowledge and troubleshoot complex issues effectively.
 Developed a habit of documenting code and methodologies systematically,
making future reference easier and enabling efficient knowledge sharing with
the team.

Communication Skills
Improving communication skills is essential for building stronger professional relationships,
fostering team collaboration, and effectively conveying ideas. Here are ways I can work on specific
aspects of my communication skills:
Oral Communication:
Improvement Plan: Participate actively in discussions, ask clarifying questions, and seek
opportunities for public speaking. Practicing oral summaries of technical concepts can help
articulate complex ideas more clearly.
Written Communication:
Improvement Plan: Practice concise and structured writing by regularly documenting technical
processes and creating summaries. Joining writing workshops or reviewing feedback on written
reports can help improve clarity, grammar, and flow.
Conversational Abilities:
Improvement Plan: Engage more in casual conversations with colleagues to enhance
conversational ease. Practicing active listening and staying mindful of non-verbal cues will
improve the natural flow of conversations.

GIET Engineering College 28 | P a g e


Heart Disease Detection

Confidence Levels:
Improvement Plan: Work on confidence by preparing and rehearsing key points before speaking
in meetings. Receiving constructive feedback from colleagues or mentors can reinforce
confidence and reduce self-doubt.
Anxiety Management:
Improvement Plan: Practice mindfulness and deep-breathing techniques to manage anxiety
during presentations or public speaking. Gradually increasing exposure to challenging speaking
situations can help build resilience.
Understanding Others:
Improvement Plan: Develop better listening skills by summarizing others’ points before
responding. This shows attentiveness and helps in grasping others’ perspectives more fully.
Getting Understood by Others:
Improvement Plan: Ensure clarity by using simple language, particularly for technical terms,
and check for understanding. Pausing and inviting questions during conversations can ensure
alignment.
Extempore Speech:
Improvement Plan: Practice impromptu speaking through daily exercises, such as discussing a
random topic for a minute. This helps in structuring thoughts quickly and speaking confidently
under minimal preparation.
Ability to Articulate Key Points:
Improvement Plan: Focus on identifying and emphasizing key points, perhaps through bullet
points or summaries. Practicing this in both spoken and written formats can improve
conciseness and clarity.
Closing Conversations:
Improvement Plan: Learn to conclude discussions by summarizing action items or final
thoughts. Practicing this habit after every meeting or discussion can make it a natural part of
my communication style.
Maintaining Niceties and Protocols:
Improvement Plan: Observe and incorporate common professional etiquette, such as greeting
colleagues politely and following meeting protocols. Practicing these with consistency will
improve professional courtesy.

GIET Engineering College 29 | P a g e


Heart Disease Detection

Greeting, Thanking, and Appreciating Others:


Improvement Plan: Make a habit of acknowledging others’ efforts by expressing thanks and
offering specific compliments. This simple habit can enhance relationships and establish a
positive rapport with team members.

Enhancing Abilities
Enhancing abilities in group discussions, team participation, and leadership can greatly impact my
effectiveness and collaborative strength in a professional environment. Here’s a breakdown of how
I could work on these aspects:
Group Discussions:
 Enhancement Strategy: Improve active listening skills to better understand
diverse viewpoints, which can make responses more relevant and constructive.
Preparing for discussions by researching the topic beforehand would also allow
for informed contributions.
 Action Steps: Practice summarizing others’ points to show attentiveness and
build on their ideas. Working on framing concise, thoughtful responses will
increase the impact and clarity of my contributions.
Participation in Teams:
 Enhancement Strategy: Develop a proactive approach by volunteering for tasks,
sharing insights, and supporting teammates whenever possible. Openly
communicating availability and offering help can enhance team dynamics.
 Action Steps: Actively participate in team discussions, voice opinions
respectfully, and stay open to feedback. Regularly updating the team on task
progress can ensure alignment and accountability.
Contribution as a Team Member:
 Enhancement Strategy: Aim to bring unique value to the team by leveraging
strengths like analytical thinking, attention to detail, or technical knowledge.
Being mindful of group objectives and aligning personal tasks with those goals
can amplify the team’s success.
 Action Steps: Set personal goals aligned with team objectives, and periodically
assess contributions to ensure they add value. Encouraging other team members
and appreciating their efforts can also foster a positive and productive team
environment.

GIET Engineering College 30 | P a g e


Heart Disease Detection

Leading a Team or Activity:


 Enhancement Strategy: Build leadership skills by practicing effective
communication, delegating tasks based on team members’ strengths, and
fostering an inclusive environment. Encouraging collaboration and aligning the
team with a clear vision or goal is key to successful leadership.
 Action Steps: Focus on defining clear objectives, setting expectations, and
maintaining open lines of communication to address challenges proactively.
Providing constructive feedback and recognizing team achievements can keep
morale high and the team motivated.

Technological Developments
During my internship, I observed several technological developments directly relevant to the fields
of Artificial Intelligence (AI), Machine Learning (ML), and Data Science (DS). These
advancements are shaping the landscape of data-driven decision-making, predictive analytics, and
automation, making them invaluable in a variety of professional applications.
Advanced Machine Learning Algorithms:
 Observation: Rapid advancements in algorithms, particularly in areas like deep
learning, reinforcement learning, and ensemble methods (e.g., XGBoost and
Random Forest), have led to improved accuracy and efficiency in model training
and deployment.
 Relevance: These algorithms enable more precise predictions and better
handling of complex datasets, which are essential for projects in healthcare
diagnostics, customer behaviour prediction, and risk analysis.
Cloud-Based Data Platforms and Services:
 Observation: Platforms like AWS, Google Cloud, and Microsoft Azure offer
comprehensive machine learning and data analytics tools that simplify data
storage, processing, and model deployment.
 Relevance: Cloud solutions enable scalability, data security, and real-time
processing. Leveraging these platforms allows faster experimentation and
deployment of models without the need for extensive on-premises infrastructure,
making them valuable for remote and collaborative work environments.

GIET Engineering College 31 | P a g e


Heart Disease Detection

Automated Machine Learning (AutoML):


 Observation: AutoML tools, such as Google’s AutoML and Azure ML, are
transforming the way data models are built by automating processes like feature
selection, hyperparameter tuning, and model evaluation.
 Relevance: These tools reduce the complexity and time associated with model
development, making ML accessible to professionals with varying levels of
expertise. They also allow data scientists to focus on refining problem statements
and interpreting results rather than spending excessive time on routine tasks.
Data Visualization Tools:
 Observation: Technologies like Tableau, Power BI, and Python libraries such as
Matplotlib and Seaborn have advanced in interactive and user-friendly ways,
supporting more sophisticated data storytelling.
 Relevance: These tools provide essential capabilities for communicating insights
clearly and effectively to stakeholders, which is especially important in client
presentations and decision-making processes.
Big Data Technologies and Frameworks:
 Observation: Tools like Apache Hadoop, Spark, and Kafka continue to evolve,
offering better performance and flexibility for processing large-scale data in
distributed environments.
 Relevance: These frameworks allow organizations to handle vast amounts of
unstructured and structured data efficiently, which is crucial for real-time
analytics, trend analysis, and high-velocity data applications.
Natural Language Processing (NLP) and Transformers:
 Observation: The development of transformer models like BERT, GPT, and T5
has revolutionized NLP tasks such as text summarization, sentiment analysis,
and language translation.
 Relevance: These advancements have a profound impact on areas requiring text
analysis, including customer service, content recommendation, and sentiment
analysis for brand management, making data more accessible and actionable.

GIET Engineering College 32 | P a g e


Heart Disease Detection

CHAPTER 6: SOURCE CODE


Choosing Right Algorithm
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler, LabelEncoder
from sklearn.linear_model import LogisticRegression
from sklearn.ensemble import RandomForestClassifier
from sklearn.tree import DecisionTreeClassifier
from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,
roc_auc_score
# Load the dataset
file_path = r'C:\Users\user\Downloads\heart-disease.csv'
df = pd.read_csv(file_path)
# Data Preprocessing
# Encode categorical variables
label_encoder = LabelEncoder()
df['cp'] = label_encoder.fit_transform(df['cp'])
df['thal'] = label_encoder.fit_transform(df['thal'])
df['target'] = label_encoder.fit_transform(df['target'])
# Fill missing values in 'ca' column with the median value
df['ca'].fillna(df['ca'].median(), inplace=True)
# Split the data into features and target variable
X = df.drop('target', axis=1)
y = df['target']
# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Standardize the feature variables
scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)

GIET Engineering College 33 | P a g e


Heart Disease Detection

# Define models (excluding KNN)


models = {
'Logistic Regression': LogisticRegression(C=1.0, solver='liblinear', random_state=42),
'Random Forest': RandomForestClassifier(n_estimators=100, random_state=42),
'Decision Tree': DecisionTreeClassifier(random_state=42)
}
# Evaluate each model and store accuracies
accuracies = {}
for name, model in models.items():
model.fit(X_train, y_train)
y_pred = model.predict(X_test)
y_pred_prob = model.predict_proba(X_test)[:, 1] if hasattr(model, 'predict_proba') else None
accuracy = accuracy_score(y_test, y_pred)
accuracies[name] = accuracy
# Get accuracy of Logistic Regression
logistic_regression_accuracy = accuracies['Logistic Regression']
# Remove models with higher accuracy than Logistic Regression
models_to_keep = {name: model for name, model in models.items() if accuracies[name] <=
logistic_regression_accuracy}
# Evaluate and print results for models to keep
print(f"Logistic Regression Accuracy: {logistic_regression_accuracy:.4f}\n")
for name, model in models_to_keep.items():
model.fit(X_train, y_train)
y_pred = model.predict(X_test)
y_pred_prob = model.predict_proba(X_test)[:, 1] if hasattr(model, 'predict_proba') else None
accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)
f1 = f1_score(y_test, y_pred)
roc_auc = roc_auc_score(y_test, y_pred_prob) if y_pred_prob is not None else 'N/A'
print(f"\n{name}:")

GIET Engineering College 34 | P a g e


Heart Disease Detection

print(f"Accuracy: {accuracy:.4f}")
print(f"Precision: {precision:.4f}")
print(f"Recall: {recall:.4f}")
print(f"F1 Score: {f1:.4f}")
print(f"ROC AUC: {roc_auc:.4f}" if roc_auc != 'N/A' else "ROC AUC: N/A")

Build the Predictive model


# Function to predict heart disease based on simple thresholding rules
def predict_heart_disease(age, thalach, age_threshold=50, thalach_threshold=140):
return 1 if age > age_threshold and thalach < thalach_threshold else 0
# Function to make a dynamic prediction
def make_prediction(inputs):
return predict_heart_disease(inputs['age'], inputs['thalach'])
# Function to collect input data dynamically
def collect_input_data():
input_data = {}
input_data['age'] = int(input("Enter age: "))
input_data['sex'] = int(input("Enter sex (1 = male, 0 = female): "))
input_data['cp'] = int(input("Enter chest pain type (0-3): "))
input_data['trestbps'] = int(input("Enter resting blood pressure: "))
input_data['chol'] = int(input("Enter cholesterol level: "))
input_data['fbs'] = int(input("Enter fasting blood sugar (1 = > 120 mg/dl, 0 = < 120 mg/dl): "))
input_data['restecg'] = int(input("Enter resting ECG results (0-2): "))
input_data['thalach'] = int(input("Enter maximum heart rate achieved: "))
input_data['exang'] = int(input("Enter exercise induced angina (1 = yes, 0 = no): "))
input_data['oldpeak'] = float(input("Enter oldpeak (depression induced by exercise): "))
input_data['slope'] = int(input("Enter slope of the peak exercise ST segment (0-2): "))
input_data['ca'] = int(input("Enter number of major vessels (0-3): "))
input_data['thal'] = int(input("Enter thalassemia (0 = normal, 1 = fixed defect, 2 = reversible
defect): "))
return input_data

GIET Engineering College 35 | P a g e


Heart Disease Detection

# Collect input data


input_data = collect_input_data()
# Example prediction
prediction = make_prediction(input_data)
print(f"Heart disease prediction (1 = Yes, 0 = No): {prediction}")
# Function to create visualizations
def plot_patient_data(df, chest_pain_distribution):
# Ensure no infinite values exist
df.replace([np.inf, -np.inf], np.nan, inplace=True)
fig, axs = plt.subplots(2, 2, figsize=(15, 12))
# Histogram of age distribution
sns.histplot(df['age'], bins=30, kde=True, ax=axs[0, 0])
axs[0, 0].set_title('Age Distribution of Patients')

# Bar chart of chest pain types


chest_pain_distribution.plot(kind='bar', ax=axs[0, 1])
axs[0, 1].set_title('Distribution of Chest Pain Types')
# Scatter plot of thalach vs. age
sns.scatterplot(x='age', y='thalach', data=df, ax=axs[1, 0])
axs[1, 0].set_title('Relationship between Thalach and Age')
# Box plot of age distribution with/without heart disease
sns.boxplot(x='target_1', y='age', data=df, ax=axs[1, 1])
axs[1, 1].set_title('Age Distribution of Patients with and without Heart Disease')
plt.tight_layout()
plt.show()

GIET Engineering College 36 | P a g e


Heart Disease Detection

Conclusion
import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import StandardScaler, LabelEncoder
from sklearn.linear_model import LogisticRegression
from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score,
roc_auc_score
# Load the dataset
file_path = r'C:\Users\user\Downloads\heart-disease.csv'
df = pd.read_csv(file_path)
# Data Preprocessing
# Encode categorical variables
label_encoder = LabelEncoder()
df['cp'] = label_encoder.fit_transform(df['cp'])
df['thal'] = label_encoder.fit_transform(df['thal'])
df['target'] = label_encoder.fit_transform(df['target'])
# Fill missing values in 'ca' column with the median value
df['ca'].fillna(df['ca'].median(), inplace=True)
# Split the data into features and target variable
X = df.drop('target', axis=1)
y = df['target']
# Split the dataset into training and testing sets
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42)
# Standardize the feature variables
scaler = StandardScaler()
X_train = scaler.fit_transform(X_train)
X_test = scaler.transform(X_test)
# Train the Logistic Regression model
logistic_regression = LogisticRegression(C=1.0, solver='liblinear', random_state=42)
logistic_regression.fit(X_train, y_train)
# Predict on the test set
y_pred = logistic_regression.predict(X_test)

GIET Engineering College 37 | P a g e


Heart Disease Detection

y_pred_prob = logistic_regression.predict_proba(X_test)[:, 1]
# Evaluate the Logistic Regression model
accuracy = accuracy_score(y_test, y_pred)
precision = precision_score(y_test, y_pred)
recall = recall_score(y_test, y_pred)
f1 = f1_score(y_test, y_pred)
roc_auc = roc_auc_score(y_test, y_pred_prob)
# Print evaluation metrics
print(f"Logistic Regression Evaluation Metrics:")
print(f"Accuracy: {accuracy:.4f}")
print(f"Precision: {precision:.4f}")
print(f"Recall: {recall:.4f}")
print(f"F1 Score: {f1:.4f}")
print(f"ROC AUC: {roc_auc:.4f}")

GIET Engineering College 38 | P a g e


Heart Disease Detection

CHAPTER 7: PHOTOS & VIDEO LINKS

Fig. 7.1: A histogram displays the distribution of a continuous variable by grouping data into bins.

GIET Engineering College 39 | P a g e


Heart Disease Detection

Fig. 7.2: A Tree Map displays the selected dimensions and are ordered in hierarchy.

Fig 7.3: A pie chart displays the relative proportions of different Bin Sizes within a whole.

Fig. 7.4: A bar chart is employed to compare the values of different discrete categories.

GIET Engineering College 40 | P a g e


Heart Disease Detection

Fig 7.5: A Tree Map displays the selected dimensions and are ordered in hierarchy.

Fig 7.7: Output of the Program

Fig 7.6: Output of the Program Fig 7.8: Output of the Program

GIET Engineering College 41 | P a g e


Heart Disease Detection

CHAPTER 8
CONCLUSION

The AI, Machine Learning, and Data Science internship with IIDT-APSCHE in
collaboration with Blackbucks marked a pivotal step in my professional and technical growth.
Through rigorous engagement with real-world datasets and advanced algorithms, I developed a
comprehensive understanding of data science processes, from data cleaning and feature
engineering to model building and evaluation. The hands-on application of supervised and
unsupervised machine learning models enabled me to address complex data-driven challenges,
preparing me to implement data science solutions in a practical setting.

This experience underscored the critical role of data-driven insights in decision-making


and allowed me to observe how AI and machine learning methodologies can transform raw data
into valuable information for industry applications. Working within an industry-aligned framework
enhanced my ability to think analytically and adapt to changing project requirements, equipping
me with the problem-solving and project management skills essential for successful work in this
domain.

Ultimately, this internship not only expanded my technical expertise but also fostered a
professional work ethic, communication, and collaboration skills that will be essential throughout
my career. The insights and competencies gained here lay a solid foundation for future projects
and reflect the immense value of applied learning in AI, Machine Learning, and Data Science.

GIET Engineering College 42 | P a g e


Heart Disease Detection

CHAPTER 9
REFERENCES

 https://www.youtube.com/watch?v=Y6W2NGxZuhQ&t=5s&ab_channel=TEDxTalks

 https://www.youtube.com/watch?v=nQDFV7D_5CE&feature=youtu.be

 https://www.youtube.com/live/_yRtuIZxtIY

 https://www.youtube.com/live/BYivE50s3l8

 https://www.geeksforgeeks.org/machine-learning-models/

 https://pmc.ncbi.nlm.nih.gov/articles/PMC10378171/

 https://ieeexplore.ieee.org/document/9734880

GIET Engineering College 43 | P a g e


Heart Disease Detection

Student Self Evaluation of the Short-Term Internship

Student Name: Registration No.:

Term of Internship: From: To:


Date of Evaluation:

Organization Name & Address:

Please rate your performance in the following areas:

Rating Scale: Letter grade of CGPA calculation to be provided

1 Oral communication 1 2 3 4 5
2 Written communication 1 2 3 4 5
3 Proactiveness 1 2 3 4 5
4 Interaction ability with community 1 2 3 4 5
5 Positive Attitude 1 2 3 4 5
6 Self-confidence 1 2 3 4 5
7 Ability to learn 1 2 3 4 5
8 Work Plan and organization 1 2 3 4 5
9 Professionalism 1 2 3 4 5
10 Creativity 1 2 3 4 5
11 Quality of work done 1 2 3 4 5
12 Time Management 1 2 3 4 5
13 Understanding the Community 1 2 3 4 5
14 Achievement of Desired Outcomes 1 2 3 4 5
15 OVERALL PERFORMANCE 1 2 3 4 5

Date: Signature of the Student

GIET Engineering College 44 | P a g e


Heart Disease Detection

Evaluation by the Supervisor of the Intern Organization

Student Name: Registration No.:

Term of Internship: From: To:


Date of Evaluation:

Organization Name & Address:

Name & Address of the Supervisor


with Mobile Number

please rate the student’s performance in the following areas:

Please note that your evaluation shall be done independent of the Student’s self-
evaluation

Rating Scale: 1 is lowest and 5 is highest rank

1 Oral communication 1 2 3 4 5
2 Written communication 1 2 3 4 5
3 Proactiveness 1 2 3 4 5
4 Interaction ability with community 1 2 3 4 5
5 Positive Attitude 1 2 3 4 5
6 Self-confidence 1 2 3 4 5
7 Ability to learn 1 2 3 4 5
8 Work Plan and organization 1 2 3 4 5
9 Professionalism 1 2 3 4 5
10 Creativity 1 2 3 4 5
11 Quality of work done 1 2 3 4 5
12 Time Management 1 2 3 4 5
13 Understanding the Community 1 2 3 4 5
14 Achievement of Desired Outcomes 1 2 3 4 5
15 OVERALL PERFORMANCE 1 2 3 4 5

Date: Signature of the Supervisor

GIET Engineering College 45 | P a g e


Heart Disease Detection

INTERNAL ASSESMENT STATEMENT

Name of the Student:


Programme of Study:
Year of Study:
Group:
Register No /H.T. No:
Name of the College:
University:

Sl.No Evaluation Criterion Maximum Marks


Marks Awarded
1. Activity Log 10
2. Internship Evaluation 30
3. Oral Presentation 10
GRAND TOTAL 50

Date: Signature of the Faculty Guide

GIET Engineering College 46 | P a g e


Heart Disease Detection

EXTERNAL ASSESSMENT STATEMENT


Name of the Student:
Programme of Study:
Year of Study:
Group:
Register No /H.T. No:
Name of the College:
University:

Sl.No Evaluation Criterion Maximum Marks


Marks Awarded
1. Activity Log 10
2. Internship Evaluation 30
3. Oral Presentation 10
GRAND TOTAL 50

Signature of the Faculty Guide Signature of the Internal Expert

Signature of the External Expert

GIET Engineering College 47 | P a g e


Heart Disease Detection

GIET Engineering College 48 | P a g e

You might also like