Big Data With ML
Big Data With ML
Submitted by-
1. Amit Acharjee 202010007004
2.Anandam Paul 202010007005
3.Sourajit Deb 202010007043
4.Tusharadri Paul 202010007049
5.Kukipriya Kutum 202010007028
Figure 1:
Relation of big data
4
Importance Of Machine Learning In Big Data
Pattern recognition: Machine learning algorithms can identify patterns, trends, and
correlations within vast amounts of data that would be difficult or impossible for humans to
recognize manually. Eg: SVM.
Predictive analytics: Machine learning models can analyze historical data to predict future
outcomes, helping businesses anticipate trends, customer behavior, and market changes. Eg:
gradient boosting machines (GBM).
Anomaly detection: Machine learning techniques can automatically detect unusual patterns
or anomalies in big data, such as fraudulent transactions, network intrusions, or equipment
failures. Eg: isolation forest.
IMPORTANCE OF MACHINE 5
Figure 3:
Big Data and
Healthcare
Case Study: Healthcare Analytics (contd..) 9
Applications:
Predictive Analytics for Disease Prevention: It identifies patterns and risk factors,
allowing healthcare providers to proactively intervene and implement preventive measures.
Personalized Medicine: By analyzing genetic, clinical, and lifestyle data, personalized
medicine ensures that treatment plans are optimized for the unique characteristics of each
patient, leading to more effective outcomes with fewer side effects.
Challenges:
Data Privacy and Security
Interoperability
Benefits:
Improved Patient Outcomes
Cost Reduction
Case Study: Supply Chain Optimization 10
ML can be used in analyzing the big data of supply chain for demand forecasting and managing inventory
accordingly.
It also includes using the transportation data like shortest routes , traffic, transport mode to optimize cost.
Suppliers performance can also be measured and select accordingly to minimize supply disruptions.
Figure 4:
Uses of Big data and ML in
supply chain optimization
Case Study: Supply Chain Optimization 11
(contd..)
ML can also provide insights of market from big data which will support decision
making.
ML can also be used to warehouse layout data and optimize space, time and labor
cost.
And in the end ML can also process customers reviews, product quality to ensure
customer satisfaction and quality control
Case Study : Human Resource & Talent Management 12
Human Resource and Talent Management involves leveraging data analytics
to optimize the acquisition, development, and retention of talent within an organization.
It encompasses a strategic approach to aligning the workforce with business goals and maximizing individual
and collective performance.
Applications:
Recruitment Analytics: Utilizing data analytics in recruitment processes to identify and attract top talent.
Figure 5:
HR & Talent Management
Case Study : Human Resource & Talent Management(Cntd..)
Employee Engagement Analysis: 13
Assessing employee engagement through surveys, feedback, and performance metrics.
Data analytics helps identify factors influencing engagement, enabling proactive interventions
to enhance workplace satisfaction.
Predictive analytics can identify employees at risk of leaving, allowing for targeted retention
efforts.
Challenges:
Data Quality and Accuracy
Ethical Considerations
Benefits:
Informed Decision-Making
Strategic Workforce Planning
Challenges and Considerations 14
Output generation rate is slower than big data streaming rate as input in ML
models creating high latency.
Difficulties in finding skilled experts needed for operating such complex models.
Training the model with large complex data is also one of the major issues.
The organizations using ML for big data analytics will have to comply with the
security regulations of government to avoid legal risks.
Future Trends
Automated Machine Learning (AutoML): Increasing adoption of AutoML for simplifying and
16
automating the machine learning model development process, making it more accessible for non-experts.
Explainable AI (XAI): Growing emphasis on Explainable AI to make machine learning models more
interpretable, transparent, and understandable for improved trust and accountability.
Edge Computing Integration: Greater integration of machine learning models with edge computing,
enabling real-time analytics and reducing the need for centralized processing.
Federated Learning: Rise of federated learning, allowing models to be trained across decentralized
devices without sharing raw data, enhancing privacy and scalability.
AI-Driven Automation for Data Management: Increasing use of AI-driven automation in data
management tasks, including data cleaning, integration, and governance, to enhance efficiency and reduce
manual efforts.
Conclusion 17
Using Machine learning in Big data also comes with fair share of
challenges.
2. https://www.geeksforgeeks.org/difference-between-big-data-and-machi
ne-learning/
3. https://chat.openai.com/
19
THANK YOU