Indumathy Sandrasekaran: Data Scientist
Indumathy Sandrasekaran: Data Scientist
Indumathy Sandrasekaran: Data Scientist
Data Scientist
VA +1 601-207-2158 [email protected]
Summary
• A Data Analytics Graduate with 4 years of experience in manipulating and analyzing large data sets for
customer insights and 7 years of total IT experience in highly quantitative environments like Pharma, auto
industries.
• Proficiency in data mining, project management of processes using advanced analytics and machine learning.
• Experience in transforming raw data into actionable strategic knowledge to gain insight into business
processes, and thereby guide and help businesses in their decision-making and run efficiently.
• Extensive programming skills in analytical programming languages such as R, Python.
• Excellent knowledge in creating Databases, Tables, Stored Procedures, DDL/DML Triggers, Views, User-
defined data types, effective functions, Cursors, and Indexes.
• Skilled in machine learning algorithms such as Linear and Logistic Regression, Random Forest, Naive Bayes,
Singular Value Decomposition, K-Means, K-nearest neighbors, Neural Networks.
• Expert in Visualization and dashboards using Tableau, R Shiny, ggplot2, matplotlib, and seaborn.
• Proficient in Natural Language Processing (NLP), Text Analytics using R and Python (NLTK, genism).
• Strong Communication and Presentation Skills completed in past assignments with developers, project
managers, subject-matter experts, stakeholders, system implementers, and application end-users.
• Good problem solving, reporting, and statistical skills with excellent communication and interpersonal skills.
Education
Master’s in Decision Analytics | GPA: 4.0
Post Baccalaureate of Data Science | GPA: 3.75
Virginia Commonwealth University, VA
Skills
Methodology: SDLC, Agile, Waterfall
Languages: Python, R, SQL
Statistics & Machine Learning: Classification, Regression, Decision Trees, Random Forests, Naive
Bayes, KNN, K-means
Packages: ggplot2, Scikit learn, NumPy, Pandas, NLTK, PySpark, Beautiful Soup,
Matplotlib, Seaborn, SciPy
Visualizations: Tableau, MS Excel, Power BI
Cloud Based Technologies: AWS
Databases: MySQL, SQL Server, MS Access
Version Control & Other Tools: GitHub, MS Office, OLAP, OLTP
Experience
Alpha Recon | Jul 2021 – Current | Data Scientist
• Collect data for analyzing business results or creating and managing new studies.
• Correlate similar data to find actionable results.
• Assist with data cleaning, processing, and characterization.
• Assist with statistical models, analytics, and other visualizations of data.
Mphasis (HP Subsidiary), Chennai, India | February 2013-August 2016 | Lead Application Engineer
• Developed statistical analysis on daily and monthly data for commodities.
• Applied linear regression models for sales channel monitoring purposes.
• Developed customer segmentation algorithm in R leading to a 22% increase in market share.
• Performed analysis of data to derive patterns and other useful insights.
• Created interfaces using python to track the backups.
• Built fuzzy matching algorithm using k-nearest neighbors to identify non-exact matching duplicates.
• Resolved system test and validation problems to provide normal program functioning.
CSS Corp Pvt. Ltd. | Chennai, India| November 2009 - December 2011 | Database Support Engineer
• Installed, Configured, Setup and maintained Microsoft SQL Server databases.
• Set up and controlled user-profiles and access levels for each database segment to protect important data.
• Created in performance tuning, query Optimization.
• On-call support rotation for 24/7 production datacenter environment as required.
• Performed Incident, Change, and Problem Management.
Capstone Project:
Liberty Mutual Insurance, VA | May 2021 – Jun 2021
• Involved in the entire data science project life cycle and actively involved in all the phases including data
extraction, data cleaning, statistical modelling, and data visualization with large data sets of structured and
unstructured data.
• Visualizing and presenting dashboards to stakeholders using Tableau & Ggplot2 by utilizing various
plotting techniques.
• Used Pandas, NumPy, Seaborn, Keras, SciPy, Matplotlib, and Scikit-learn in Python for developing various
machine learning algorithms.
Elder Research, VA | Capstone Project | Jan 2021 – Apr 2021
• Research on machine learning methods for the prediction or the analysis of Alzheimer's disease (AD) for the
next 5 years was performed with Elder Research. ADNI dataset was used for this analysis. This project is part
of the SBIR grant and the prediction is useful for Alzheimer's research in diagnosing the patients in advance.
The results were displayed on a Dash Dashboard. The Random Forest classifier was used and the results were
displayed on the DASH dashboard, and prediction showed the predicted values in terms of the following 2-
Dementia, 1-Mild Cognitive Impairment, 0-Cognitively Normal. All- pairs technique, evaluation of the LB1 and
LB2 datasets were performed, best classifiers were selected using the K-best algorithm, model is built and the
optimal prediction accuracy was around 68%.