Intership Project Report
Intership Project Report
Analytics Dashboard
Industrial Training
PROJECT REPORT
Submitted by
Hemang Joshi (22BCS13785)
BACHELOR OF ENGINEERING
IN
Chandigarh University
MAY 2024
1
CERTIFICATE
Certified that this project report “Credit Transactions and Customer Analytics
Dashboard” is the training w o rk of “HEMANG JOSHI” at “Solitaire
Infosystems Pvt. Ltd” who carried out the project work under the supervision of
Mr. Praduman kr. from to .
SIGNATURE SIGNATURE
2
ACKNOWLEDGEMENT
The successful completion of this project marks the beginning of an ever-ongoing learning
experience in converting ideas and concepts into practical systems. This project was a
significant learning journey for me at every step, boosting my confidence to work in a
professional setup. I believe the knowledge gained during this project will lead to bright
prospects in the future.
First and foremost, I would like to extend my heartfelt gratitude to Mr. Praduman Kr.,
the Python industrial trainer, for giving me the invaluable opportunity to work in this
esteemed organization. This experience not only increased my awareness of the latest
technological advancements but also emphasized the importance of teamwork and
collaboration. Again with a deep sense of gratitude, I express my sincere thanks to Mr.
Praduman Kr. for his unwavering support and continuous guidance, without which it
would have been challenging to complete this project. His insights, encouragement, and
constructive feedback have been instrumental in shaping the project and my overall
learning experience.
I also extend my gratitude to the entire staff and faculty at Solitaire Infosystems Pvt. Ltd.
Their keen interest in my project and valuable suggestions were crucial in guiding me
through various challenges. Their collective efforts have significantly contributed to the
project's success.
Additionally, I am grateful to my peers and fellow trainees for their camaraderie and the
collaborative spirit that enriched my learning experience. The group discussions,
brainstorming sessions, and peer reviews were invaluable in refining the project and
broadening my understanding.
Lastly, I express my sincere appreciation to everyone who directly or indirectly contributed
to the successful completion of this project. The guidance and encouragement I received
have not only helped me complete this project but have also set a strong foundation for my
future professional growth.
3
Profile of Organization
VISSION: To become the most trusted and respected IT service provider globally,
delivering vibrant, dynamic, and value-based solutions centered on clients, team, and
international standards. We aim to be a reliable partner, act ethically, acknowledge
mistakes, and make necessary changes.
SERVICES
CONTACT INFORMATION
Phone: +91-987-665-6700
Email: [email protected]
Website: https://solitaireinfosystems.com
4
TABLE OF CONTENT
List of Figures…………………………………………………………………………
List of Tables…………………………………………………………………………..
Abstract……………………………………………………………………………..
Chapter 1. INTRODUCTION………………………………………………………
1.1. Client Identification/Need Identification /Relevant Contemporary Issue
1.2. Identification of Problem
1.3. Identification of Tasks
1.4. Timeline
1.5. Organization of Report
Chapter 2. LITERATURE REVIEW……………………………………………….
2.1 Timeline of the reported problem
2.2 Proposed solutions
2.3 Bibliometric analysis
2.4 Review Summary
2.5 Problem Definition
2.6 Goals/Objectives
Chapter 3. DESIGN FLOW/PROCESS…………………………………………….
3.1. Evaluation & Selection of Features
3.2. Design Constraints
3.3. Analysis and features subject to constraints
3.4. Design Flow
3.5. Design Selection
3.6 Implementation plan/methodology
Chapter 4. RESULTS AND VALIDATION………………………………………..
4.1. Implementation of solution
Chapter 5. CONCLUTION AND FUTURE WORK……………………………….
5.1 & 5.2. Conclusion and Future Work
5
List of Figures
6
List of Tables
7
ABSTRACT
The goal of this project, which is being carried out by Solitaire Infosystems Pvt. Ltd. as part of a
six-week industrial training program, is to create interactive and thorough dashboards for credit
card transactions and consumer behavior analysis. The project provides a comprehensive solution
for financial data analysis by combining DAX queries for dynamic data calculation, Power BI for
enhanced data visualization, MySQL for reliable database administration, and Python for further
data processing.
The Credit Card Transaction Report dashboard offers comprehensive breakdowns by card
category, expense kind, education level, job type, and transaction mode in addition to showcasing
important performance metrics like total income, interest earned, transaction amounts, and
transaction counts. It also includes interactive filters based on gender, card type, and transaction
methods (swipe, chip, and online), as well as quarterly revenue trends.
With an emphasis on customer demographics, the Credit Card Customer Report provides
information on revenue distribution by age group, state, marital status, income level, dependency
status, and educational attainment.
Additionally, it offers a thorough study of customer jobs and their financial contributions, tracking
weekly revenue trends by gender. By providing a thorough understanding of transaction patterns,
client behaviors, and demographic trends, these dashboards help financial institutions make data-
driven decisions and develop strategic plans. The project demonstrates the benefits of data
visualization in improving financial analysis and operational efficiency, underscoring the practical
application of merging numerous technologies to address real-world business difficulties.
The industrial training at Solitaire Infosystems Pvt. Ltd. sharpened skills in database
administration, data visualization, and analytical processing, equipping participants with a robust
toolkit for financial analysis and corporate intelligence.
8
CHAPTER 1.
INTRODUCTION
1.1. Client Identification/Need Identification/Identification of relevant
Contemporary issue
Client Identification: The client base for this project primarily consists of financial
institutions ranging from banks to fintech companies involved in credit card transaction
processing and consumer financial services. These entities handle vast amounts of
transactional data daily and require sophisticated analytical tools to extract meaningful
insights.
Financial institutions leverage such tools not only to understand transaction patterns and
customer behavior but also to enhance operational efficiency, mitigate risks, and drive
strategic decision-making.
Need Identification: Financial institutions have identified several critical needs that drive
their requirement for advanced analytics and visualization tools:
A. Comprehensive Data Analysis: There is a need for in-depth analysis of credit card
transaction data across various dimensions such as card categories, expense types,
demographic segments (education level, job type), and transaction modes (swipe, chip,
online). This analysis helps in understanding spending patterns, identifying trends, and
predicting future behaviors.
B. Key Performance Metrics: Metrics like total income generated, interest earned,
transaction amounts, and counts are essential for evaluating financial performance and
profitability. These metrics provide insights into revenue streams, cost structures, and
overall financial health.
D. Trend Analysis and Forecasting: Analyzing quarterly revenue trends and other temporal
patterns helps in forecasting future revenues and identifying seasonal variations or market
trends. This capability supports proactive planning and resource allocation.
9
Identification of Relevant Contemporary Issue: In today's financial landscape, several
pressing issues shape the need for advanced data analytics and visualization tools:
A. Comprehensive Data Analysis: There is a need for in-depth analysis of credit card
transaction data across various dimensions such as card categories, expense types,
demographic segments (education level, job type), and transaction modes (swipe, chip,
online). This analysis helps in understanding spending patterns, identifying trends, and
predicting future behaviors.
B. Key Performance Metrics: Metrics like total income generated, interest earned,
transaction amounts, and counts are essential for evaluating financial performance and
profitability. These metrics provide insights into revenue streams, cost structures, and
overall financial health.
D. Trend Analysis and Forecasting: Analyzing quarterly revenue trends and other temporal
patterns helps in forecasting future revenues and identifying seasonal variations or market
trends. This capability supports proactive planning and resource allocation.
The financial services sector is grappling with a myriad of challenges related to the
effective analysis and visualization of vast quantities of credit card transaction data. As
digital transactions continue to rise, the volume of data grows exponentially, making
traditional data management and analysis techniques insufficient. Key problems include:
A. Data Overload:
10
• Redundancy and Noise: Large datasets often contain redundant or
irrelevant information that needs to be filtered out to extract valuable
insights.
• Inconsistent Formats: Different systems may use varied data formats and
structures, necessitating extensive data cleaning and normalization efforts.
C. Real-Time Analysis:
D. Customer Insight:
11
E. Operational Inefficiencies:
1. Requirement Gathering:
• Gather Data from Various Sources: Collect data from multiple sources
such as transaction databases, customer information systems, and external
data feeds.
12
• Ensure Data is Clean, Accurate, and Integrated: Perform data cleaning
to remove duplicates, correct errors, and standardize formats. Integrate the
cleaned data into a central database using ETL (Extract, Transform, Load)
processes.
3. Data Analysis:
4. Dashboard Design:
5. Development:
6. Testing:
13
• Gather Feedback from Users: Conduct user acceptance testing (UAT)
sessions to collect feedback and make necessary adjustments before the
final deployment.
7. Deployment:
By following these detailed tasks, financial institutions can build a robust, scalable, and user-
friendly solution that addresses the complexities of credit card transaction data analysis and
customer behavior insights.
14
1.4. Identification of Tasks
Chapter 1: Introduction
15
Chapter 2: Literature Review/Background Study
Implementation of solution
Testing and validation
Data analysis and interpretation
Conclusion
Future work
16
CHAPTER 2.
LITERATURE REVIEW
• Context: The advent of digital payments and online shopping led to a surge in credit
card transactions. Financial institutions began accumulating vast amounts of data from
these transactions.
• Challenges: This rapid increase in data volume outpaced traditional data management
and analysis techniques, resulting in inefficiencies and inaccuracies.
• Documentation: Industry reports from this period highlight the struggles of banks and
credit card companies in managing and extracting value from the burgeoning data sets.
• Documentation: Regulatory guidelines and case studies from this era document
the shift towards more sophisticated data management practices and the initial
adoption of big data tools.
• Context: The global pandemic drastically accelerated the shift towards digital
transactions, with more consumers opting for online shopping and contactless
payments.
• Challenges: The pandemic highlighted the inadequacies of existing data analysis
tools in handling real-time transaction data and providing actionable insights
quickly.
• Documentation: Numerous industry reports and academic studies from the 2020s
illustrate the surge in digital transactions and the corresponding need for advanced
analytical solutions.
17
2.2. Proposed solutions
Over the years, various solutions have been proposed to tackle the challenges of credit card
transaction data analysis:
Approach: Initially, data was analyzed manually using basic tools like
spreadsheets.
Effectiveness: This approach was only suitable for small datasets and provided
limited insights.
Challenges: It was labor-intensive, error-prone, and not scalable.
Approach: The use of tools like Microsoft Excel became prevalent for data
visualization and basic analysis.
Effectiveness: These tools offered improved visualization capabilities but were
still limited in handling large datasets and performing complex analyses.
Challenges: They lacked advanced analytical features and real-time processing
capabilities.
Standalone BI Tools
Custom-Built Solutions
18
Modern Integrated Platforms
Analyzing previous solutions based on their key features, effectiveness, and drawbacks:
Standalone BI Tools
Custom-Built Solutions
19
Modern Integrated Platforms
The literature review indicates that while past solutions have made strides in addressing
various aspects of credit card transaction data analysis, they often fall short in providing
comprehensive, real-time integration and analysis. This project aims to address these
shortcomings by leveraging advanced technologies such as DAX queries, Power BI,
MySQL, and Python to create an integrated, real-time analytical and visualization platform.
The primary problem facing financial institutions is the inefficiency in analyzing and
visualizing vast amounts of credit card transaction data in real-time. Specific issues
include:
What is to be done:
How it is to be done:
Avoid reliance on manual data processing or basic reporting tools that lack
scalability and real-time capabilities.
20
2.6. Goals/Objective
Data Integration:
Real-Time Analysis:
Comprehensive Dashboards:
Key Metrics:
Identify and visualize key performance metrics that drive business insights.
Customer Insights:
Ensure robust data security and compliance with relevant regulations, such as
PCI DSS and GDPR.
These objectives are designed to be precise, tangible, and measurable, ensuring that the project
remains focused and delivers concrete, actionable outcomes. By leveraging advanced technologies
and integrating various data sources, the project aims to create a powerful platform for financial
analysis and decision-making. The successful implementation of this project will demonstrate the
practical application of modern data analytics and visualization tools in addressing complex real-
world business challenges.
21
CHAPTER 3.
DESIGN FLOW/PROCESS
From the literature review, several features have been identified as crucial for an
effective credit card transaction data analysis solution. These features need to be
critically evaluated to determine their relevance and importance in the final solution.
Key features include:
Real-Time Data Integration: Ensures that the analysis reflects the most
current data by integrating data from various sources in real-time.
Advanced Data Analytics: Utilizes sophisticated analytics tools to derive
insights from large datasets, leveraging technologies like DAX queries and
Python scripts.
Interactive Dashboards: Provides user-friendly dashboards with customizable
views and interactive features, enabling users to filter data based on various
parameters.
Scalability: The system should be scalable to handle increasing amounts of
data and users, ensuring long-term viability.
User Access Control: Features to manage different levels of user access and
permissions, ensuring that data is only accessible to authorized users.
Data Visualization: Advanced visualization techniques to present data in an
intuitive and understandable manner, facilitating easy interpretation of complex
datasets.
After evaluating the features identified in the literature, the following features have
been deemed essential for the solution:
22
3.2. Design Constraints
Regulations:
Data Privacy and Security: Must comply with PCI DSS for data security and
GDPR for data privacy, ensuring the protection of sensitive financial and
personal data.
Economic:
Environmental:
Health:
User Ergonomics: Ensuring the system does not adversely affect the health
of users through ergonomic design and reducing the risk of repetitive strain
injuries. This includes designing intuitive and user-friendly interfaces.
Manufacturability:
Ethical Use of Data: Sensitivity to social and political factors that may affect
data usage, ensuring that the system is used responsibly and ethically. This
involves transparent data handling practices and respecting user consent and
data rights.
Cost Considerations:
23
3.3. Analysis and Feature finalization subject to constraints
Removal of Features:
Modification of Features:
Addition of Features:
Introduce a basic alert system for critical transaction anomalies, providing real-
time notifications for suspicious activities.
Implement cost-effective data storage solutions, ensuring that the system can
handle large datasets without incurring excessive costs.
Data Collection: Gathering data from multiple sources into a centralized database is the
first critical step in ensuring comprehensive and accurate analysis. For this project, data is
collected from Kaggle, a popular platform for datasets, which offers a wealth of publicly
available credit card transaction and customer demographic data. This centralized database
will be structured to handle large volumes of data efficiently, ensuring that all relevant
information is stored in a consistent and accessible format.
24
Key Steps:
Identify Data Sources: Determine the datasets available on Kaggle that are
relevant to the project, such as credit card transaction logs, customer demographics,
and financial data.
Data Extraction: Download the datasets from Kaggle and use ETL (Extract,
Transform, Load) processes to ensure that the data is collected in a consistent
format and is ready for integration into the centralized database.
Data Loading: Load the extracted data into a centralized MySQL database,
ensuring that the database schema is designed to handle large datasets and supports
efficient querying.
Data Processing: Using Python scripts and DAX queries for data preprocessing and
analytics is essential for transforming raw data into actionable insights. Python scripts
will be used to clean, preprocess, and transform the data, ensuring that it is ready for
analysis. DAX (Data Analysis Expressions) queries will be utilized within Power BI
for dynamic and complex data analysis, allowing for the creation of sophisticated data
models.
Key Steps:
25
Key Steps:
User Access: Setting up user access controls and permissions is essential for maintaining
data security and ensuring that sensitive information is only accessible to authorized users.
This involves defining different levels of access based on user roles and responsibilities,
ensuring that each user can only access the data and functionalities relevant to their role.
Key Steps:
Role Definition: Define user roles and permissions based on organizational needs
and data sensitivity.
Access Control Implementation: Implement access control mechanisms within
the centralized database and Power BI, ensuring that only authorized users can
access and modify the data.
User Authentication: Set up authentication processes to verify user identities and
manage user sessions securely.
Deployment: Deploying the system on a local server infrastructure with high storage
capacity ensures that the system is robust, reliable, and capable of handling large volumes
of data. Local deployment provides greater control over the infrastructure, ensuring that
performance, security, and scalability requirements are met.
26
Key Steps:
Infrastructure Setup: Set up local server infrastructure with high storage capacity
and necessary computing resources.
System Installation: Install the required software components, including the
MySQL database server, Python environment, and Power BI service.
Configuration: Configure the system components to ensure seamless integration
and optimal performance. This includes database tuning, network configuration,
and security settings.
Testing and Optimization: Conduct thorough testing to ensure that the system
performs as expected. Optimize system components to improve performance and
efficiency.
By following this design flow, the system will be effectively set up to handle the collection,
processing, and visualization of credit card transaction data from Kaggle, providing valuable
insights and supporting data-driven decision-making.
Comparison of Designs:
Design Selection:
Design 1 is selected due to its straightforward implementation, lower cost, and use of well-known
tools like Power BI and Python, which provide sufficient functionality for the project's needs.
Design 2, while offering advanced capabilities, introduces complexity and higher costs that may
not be justified for this project.
27
3.6. Implementation plan/methodology
The implementation plan for the credit card transaction and customer behavior analysis
project involves several key stages, from data collection to final dashboard presentation.
This plan ensures a structured approach, enabling efficient handling of data and its
transformation into actionable insights.
Implementation Steps
1. Data Collection:
o Source: Gather data from Kaggle.
o Format: Download datasets in CSV format.
o Upload: Import CSV files into MySQL tables stored on a local
machine.
2. Data Processing:
o Scripts: Use Python scripts to clean and preprocess the data. This
includes handling missing values, normalizing data formats, and
ensuring data integrity.
o Transformation: Apply DAX queries within Power BI for further
data transformation and calculation of required metrics.
3. Data Visualization:
o Dashboard Design: Create interactive dashboards in Power BI.
Design various visual components like charts, graphs, and tables to
represent key performance metrics and insights.
o Customization: Implement interactive filters and customizable views
to allow users to explore the data from different perspectives.
4. Documentation:
o Technical Documentation: Prepare comprehensive documentation
detailing the data collection, processing, and visualization steps.
Include information on the data schema, preprocessing scripts, DAX
queries, and dashboard design elements.
o User Guides: Create user guides to help stakeholders navigate the
dashboards and utilize the interactive features effectively.
5. Presentation:
o Preparation: Develop a presentation summarizing the project’s
objectives, methodology, and key findings.
o Delivery: Present the final dashboards and insights to stakeholders,
demonstrating how the system can be used to make data-driven
decisions.
28
CHAPTER 4.
RESULTS ANALYSIS AND VALIDATION
The implementation of the credit card transaction and customer behavior analysis project
leverages modern tools and technologies at various stages to ensure efficient and effective
execution. Below are the details of the implementation across different facets of the project:
Analysis
Data Cleaning and Preprocessing: Python scripts were employed to clean and
preprocess the raw data collected from Kaggle. This included handling missing
values, normalizing data formats, and removing any inconsistencies.
Data Transformation: DAX queries in Power BI were utilized to further transform
and calculate necessary metrics from the processed data. This included creating
calculated columns and measures to support insightful visualizations.
Dashboard Design: The design of the dashboards was carried out in Power BI. The
tool's capabilities allowed for creation interactive and visually appealing
dashboards that provide critical insights into credit card transactions and customer
behaviors.
29
Visual Components: The dashboards included various charts, graphs, and tables
to present data such as revenue, transaction amounts, customer demographics, and
spending patterns.
Figure 5: Dashboard
Report Preparation
The successful implementation of the project relied on a combination of modern tools and
technologies at each stage, from data analysis and preprocessing to dashboard creation and
presentation. By utilizing these tools effectively, the project was able to deliver valuable insights
into credit card transactions and customer behavior.
30
Figure 6: Final Dashboard design
31
CHAPTER 5.
CONCLUSION AND FUTURE WORK
5.1. Conclusion
The implementation of the credit card transaction data analysis and visualization platform
using MySQL and Power BI has yielded promising results. The expected outcome was to
create an efficient, real-time data integration and visualization system that provides deep
insights into transaction patterns and customer behaviors.
Expected Results/Outcome:
Minor Performance Lag: There was a slight delay in real-time data processing,
especially when handling exceptionally large datasets. This was due to the limited
computational resources available on the local setup.
Complexity in Data Transformation: Some advanced data transformation tasks
required more complex DAX queries, leading to a longer development time than
initially anticipated.
The performance lag was primarily due to the limitations of local hardware
resources. A more powerful setup or optimization of existing resources could
mitigate this issue.
The complexity in data transformation arose from the intricate nature of the data
relationships and the need for precise calculations to ensure accurate insights.
32
5.2. Future Work
Way Ahead:
Change in Approach:
By addressing the current limitations and incorporating these future enhancements, the platform
can become a more robust, scalable, and user-friendly solution for analyzing and visualizing credit
card transaction data.
33