Contents UNIT 42
Contents UNIT 42
[Document title]
[Document subtitle]
Moorche
Contents
Statistics:.....................................................................................................................................................2
Types of statistics:.......................................................................................................................................2
Types of descriptive statistics:....................................................................................................................2
Examples of descriptive statistic:...............................................................................................................4
Types of Inferential statistics:.....................................................................................................................4
Exploratory Data Analysis:..........................................................................................................................5
Confirmatory Data Analysis:.......................................................................................................................6
Mean, Median, and Mode:.........................................................................................................................7
TYPES AND SOURCES OF DATA...................................................................................................................7
TYPES OF DATA...........................................................................................................................................7
Qualitative and Quantitative Raw Data Analysis:......................................................................................8
Key difference between qualitative and quantitative data.......................................................................9
Applications of statistics in business........................................................................................................10
Deductive and inductive approaches:.......................................................................................................11
Economical survey of Pakistan 2023..........................................................................................................12
Altas Honda:..............................................................................................................................................13
Estimated worldwide motor vehicle production from 2000 to 2023(in million vehicles)......................17
Statistics:
Statistics is a powerful tool used to make sense of data in various fields such as science, business,
economics, social sciences, and more. It provides methods for collecting, organizing, analyzing,
interpreting, and presenting data, enabling us to draw meaningful conclusions and make informed
decisions.
At its core, statistics involves the study of variability and uncertainty. It allows us to quantify uncertainty
through measures such as probabilities, confidence intervals, and hypothesis testing. By understanding
and managing uncertainty, we can mitigate risks and optimize outcomes in diverse scenarios.
One of the fundamental concepts in statistics is the distinction between descriptive and inferential
statistics. Descriptive statistics involve summarizing and presenting data, often through measures like
averages, percentages, and graphical representations. Inferential statistics, on the other hand, enable us
to draw conclusions or make predictions about a population based on a sample of data.
Statistics encompasses a wide range of techniques, including probability theory, hypothesis testing,
regression analysis, and experimental design. These techniques are applied in various fields to analyze
data, test hypotheses, model relationships, and make predictions.
In today's data-driven world, the importance of statistics cannot be overstated. From guiding policy
decisions to optimizing business strategies, statistics plays a crucial role in informing choices and driving
progress. A solid understanding of statistical principles empowers individuals to critically evaluate data,
identify patterns, and extract meaningful insights, making it an indispensable tool in both academic and
practical contexts.
Types of statistics:
The two main types of statistics are descriptive statistics and inferential statistics.
1. Descriptive Statistics: Descriptive statistics involve methods for summarizing and describing the
characteristics of a dataset. These methods provide a concise overview of the data, including
measures such as central tendency (e.g., mean, median, mode) and measures of variability (e.g.,
range, variance, standard deviation). Descriptive statistics also include graphical representations
like histograms, bar charts, and pie charts, which visually present the data distribution.
2. Inferential Statistics: Inferential statistics involve using sample data to make inferences or draw
conclusions about a larger population. These methods allow researchers to generalize findings
from a sample to the population from which the sample was drawn. Inferential statistics include
techniques such as hypothesis testing, confidence intervals, regression analysis, and analysis of
variance (ANOVA). These methods help assess relationships between variables, test hypotheses,
and make predictions based on the data.
Mean: The mean, or average, is the sum of all values in a dataset divided by the total
number of observations. It provides a measure of the central location of the data.
Median: The median is the middle value in a sorted dataset. It divides the data into two
equal halves, with half of the observations lying above and half below.
Mode: The mode is the value that appears most frequently in a dataset. It represents
the most common value or category in the data.
Range: The range is the difference between the maximum and minimum values in a
dataset. It provides a simple measure of the spread or variability of the data.
Variance: The variance measures the average squared deviation of each data point from
the mean. It quantifies the spread of the data around the mean.
Standard Deviation: The standard deviation is the square root of the variance. It
provides a measure of the average distance of data points from the mean, with higher
values indicating greater variability.
Skewness: Skewness measures the asymmetry of the distribution of data around its
mean. Positive skewness indicates a longer right tail, while negative skewness indicates
a longer left tail.
Percentiles: Percentiles divide a dataset into 100 equal parts. For example, the 25th
percentile (also known as the first quartile) is the value below which 25% of the
observations fall.
Quartiles: Quartiles divide a dataset into four equal parts. The first quartile (Q1)
represents the 25th percentile, the second quartile (Q2) represents the median, and the
third quartile (Q3) represents the 75th percentile.
5. Graphical Representations:
Mean: The average exam score is calculated by summing up all individual scores and
dividing by the total number of students.
Median: The middle score when all scores are arranged in ascending order.
Variance: The average of the squared differences between each score and the mean.
Standard Deviation: The square root of the variance, indicating the average distance of
scores from the mean.
25th Percentile (Q1): The score below which 25% of the students scored.
50th Percentile (Median, Q2): The score below which 50% of the students scored.
75th Percentile (Q3): The score below which 75% of the students scored.
5. Graphical Representations:
Histogram: A bar graph showing the frequency distribution of scores within predefined
intervals.
2. Confidence Intervals: Confidence intervals provide a range of values within which a population
parameter, such as a mean or proportion, is likely to lie with a certain level of confidence. The
confidence level represents the probability that the interval contains the true population
parameter.
4. Analysis of Variance (ANOVA): ANOVA is a statistical method used to compare means across two
or more groups to determine whether there are significant differences between them. It helps to
identify which groups differ from each other and to what extent.
5. Chi-Square Test: The chi-square test is a statistical test used to determine whether there is a
significant association between two categorical variables. It compares the observed frequencies
of the categories with the expected frequencies under the null hypothesis of independence.
6. T-Tests: T-tests are statistical tests used to compare the means of two groups and determine
whether there is a significant difference between them. There are different types of t-tests,
including independent samples t-test, paired samples t-test, and one-sample t-test.
7. Nonparametric Tests: Nonparametric tests are statistical tests that do not make assumptions
about the distribution of the data. These tests are used when the data do not meet the
assumptions of parametric tests, such as normality or homogeneity of variance. Examples include
the Wilcoxon signed-rank test, Mann-Whitney U test, and Kruskal-Wallis test.
Advantages
Flexible ways to generate hypotheses.
More realistic statements of accuracy
Does not require more than data can support.
Promotes deeper understanding of processes.
Statistical learning
Disadvantages
Usually does not provide definitive answers.
Difficult to avoid optimistic bias produced by overfitting.
Requires judgement and artistry - can't be cook booked.
Advantages
Disadvantages
1. Internal sources
When data is collected from reports and records of the organization itself,
they are known as the internal sources.
For example, a company publishes its annual report’ on profit and loss, total
sales, loans, wages, etc.
2. External sources
When data is collected from sources outside the organization, they are known as the
2. external sources. For example, if a tour and travel company obtain information on
3. Karnataka tourism from Karnataka Transport Corporation, it would be known as an
4. external source of data.
TYPES OF DATA
A) Primary data
Primary data means first-hand information collected by an investigator.
It is collected for the first time.
It is original and more reliable.
For example, the population census conducted by the government of India after
every ten years is primary data.
B) Secondary data
4. Telephonic interview
5. Mailed questionnaire.
Qualitative data analysis involves the examination and interpretation of non-numeric data to identify
patterns, themes, and insights. This type of analysis is often used in social sciences, humanities, and
qualitative research studies. In qualitative raw data analysis, researchers immerse themselves in the
data, often through methods such as coding, categorizing, and thematic analysis. Coding involves
systematically labeling and organizing segments of text or other qualitative data based on recurring
themes or concepts. Categorizing involves grouping similar codes or themes together to identify broader
patterns or trends within the data. Thematic analysis involves identifying and interpreting underlying
themes or patterns that emerge from the data. Throughout the qualitative data analysis process,
researchers aim to uncover the richness and complexity of the data, gaining deeper insights into
participants' experiences, perspectives, and behaviors.
Quantitative data analysis involves the examination and interpretation of numerical data to understand
relationships, patterns, and trends. This type of analysis is commonly used in scientific research,
economics, and business analytics. In quantitative raw data analysis, researchers use statistical methods
to analyze the data and draw conclusions based on numerical evidence. This may involve calculating
descriptive statistics such as means, medians, and standard deviations to summarize the data's central
tendency and variability. It may also involve conducting inferential statistics such as hypothesis testing
and regression analysis to test hypotheses, make predictions, or infer relationships between variables.
Throughout the quantitative data analysis process, researchers aim to quantify and measure
phenomena, allowing for rigorous and replicable findings that contribute to scientific knowledge and
decision-making processes.
5. The reasoning used to synthesize data in qualitative research is inductive whereas in the case of
quantitative research the reasoning is deductive.
6. Qualitative research is based on purposive sampling, where a small sample size is selected with a view
to get a thorough understanding of the target concept. On the other hand, quantitative research relies
on random sampling; wherein a large representative sample is chosen in order to extrapolate the results
to the whole population.
7. Verbal data are collected in qualitative research. Conversely, in quantitative research measurable data
is gathered.
8. Inquiry in qualitative research is a process-oriented, which is not in the case of quantitative research.
9. Elements used in the analysis of qualitative research are words, pictures, and objects while that of
quantitative research is numerical data.
10. Qualitative Research is conducted with the aim of exploring and discovering ideas used in the
ongoing processes. As opposed to quantitative research the purpose is to examine cause and effect
relationship between variables.
11. Lastly, the methods used in qualitative research are in-depth interviews, focus groups, etc. In
contrast, the methods of conducting quantitative research are structured interviews and observations.
12. Qualitative Research develops the initial understanding whereas quantitative research recommends
a final course of action.
Applications of statistics in business
Statistics plays a crucial role in various aspects of business operations, decision-making, and strategy
development. Here are several key applications of statistics in business:
1. Market Research and Customer Analysis: Businesses use statistical techniques to collect and
analyze data on consumer preferences, demographics, buying behavior, and market trends. This
information helps businesses identify target markets, develop marketing strategies, and tailor
products and services to meet customer needs effectively.
2. Sales Forecasting and Demand Planning: Statistical methods such as time series analysis,
regression analysis, and predictive modeling are used to forecast future sales, demand for
products, and inventory requirements. Accurate sales forecasts enable businesses to optimize
production, manage inventory levels, and plan resources efficiently.
3. Quality Control and Process Improvement: Statistical process control (SPC) techniques are used
to monitor and improve the quality of products and processes. Methods such as control charts,
Pareto analysis, and Six Sigma help businesses identify and reduce defects, minimize variation,
and enhance overall quality and efficiency.
4. Financial Analysis and Risk Management: Statistics is applied in financial analysis to analyze
financial data, assess performance, and make investment decisions. Techniques such as ratio
analysis, regression analysis, and risk modeling help businesses evaluate profitability, manage
financial risks, and make informed investment decisions.
7. A/B Testing and Experimentation: Statistical hypothesis testing and experimentation techniques
are used to evaluate the effectiveness of marketing campaigns, website designs, pricing
strategies, and product features. A/B testing allows businesses to compare different versions of
a marketing asset or product and determine which performs better based on statistical
significance.
1. Deductive Approach:
Definition: The deductive approach starts with a general theory or hypothesis and then
tests it against specific observations or evidence. It involves moving from the general to
the specific, using logical reasoning to draw conclusions.
Application in Business: In the business context, the deductive approach often begins
with an existing theory or framework derived from prior research, established
principles, or theoretical models. Researchers formulate specific hypotheses based on
these theories and then collect data to test the validity of these hypotheses. For
example, a researcher might start with the theory that employee satisfaction leads to
higher productivity. They would then develop specific hypotheses, such as "Employees
who receive regular feedback from supervisors are more satisfied," and conduct surveys
or experiments to test these hypotheses.
2. Inductive Approach:
Application in Business: In the business context, the inductive approach often begins
with collecting and analyzing data from real-world observations or experiences.
Researchers look for patterns, themes, or trends in the data and use these to develop
new theories or concepts. For example, a company might analyze customer feedback
data and observe recurring complaints about a particular product feature. Based on
these observations, they may develop a new theory about customer preferences or
needs, leading to the development of improved product designs or marketing strategies.
Comparison:
Deductive Approach:
Inductive Approach:
The provided data outlines the composition of GDP (at current prices) using the expenditure approach
for the fiscal years 2021, 2022, and 2023. The table presents various components of GDP, including
household consumption, NPISH (Non-Profit Institutions Serving Households) consumption, general
government consumption, gross fixed investment, exports, and imports, among others.
Over the three fiscal years, there's a noticeable increase in most components of GDP. Household
consumption remains the largest contributor to GDP, followed by gross fixed investment and exports.
The growth rates for these components are significant, indicating a robust economy during this period.
However, imports also experience substantial growth rates, leading to a negative impact on net exports.
Overall, aggregate demand, which encompasses consumption, investment, and exports, shows steady
growth across the years. Domestic demand, representing consumption and investment, also follows a
similar trend. However, the negative contribution of net exports suggests a trade deficit during these
years. Despite this, GDP experiences notable growth rates, indicating a strong and expanding economy
throughout the period covered by the survey.
Altas Honda:
The firm implemented a number of initiatives to reduce waste creation and resource utilisation at its
manufacturing facilities. One of the activities was the use of reusable vinyl sheets instead of non-
recyclable bubble plastic sheets in packaging, which reduced waste output by approximately 1.3 tonnes
per year. Paper and cardboard waste reduced by 135 tonnes.
Recycled aluminium waste from the company's manufacturing processes amounted to 492 tonnes,
accounting for 5% of total nonferrous casting materials used. This shows the firm's commitment to
responsible resource management.
The company also focused on water conservation. Employees were encouraged to conserve water and
use water-efficient technology and equipment. The company provided clean drinking water and sanitary
facilities at all workers and emphasised the importance of sustainable water management.
Monitoring water usage and outflow found a 3% decrease in intensity per vehicle over the previous
year, totaling 713,878 cubic metres. 107,082 cubic metres (15%) of manufacturing water were cycled,
processed, and securely discharged into the environment.
The firm constructed two wastewater treatment facilities at its industrial sites to efficiently manage
wastewater. The collected wastewater was safely dumped into the sewer system after being thoroughly
cleaned of hazardous substances. The organisation followed regulations and maintained environmental
quality requirements during the treatment process.
Paint shops implemented water-saving measures, including biweekly tank cleaning instead of weekly.
Through this the endeavour reduced annual water use by 144 cubic metres while also reducing grease,
spray booth additives, and diesel.
The company also focused on water conservation. It urged employees to conserve water and, when
possible, deploy water-efficient technology and equipment. The company provided access to clean
drinking water and sanitary facilities at all of its locations and emphasised the importance of sustainable
water management. Monitoring of water usage and outflow found that 713,878 cubic metres of water
were utilised during the year, with a 3% decrease in intensity per vehicle compared to the previous year.
107,082 cubic metres (15%) of the water used in the manufacturing process was recycled after being
adequately cleaned and safely released back into the environment.
Interesting trends can be seen when comparing Honda’s financial data for 2022 and 2023. Toyota's gross
revenue made up 97.6% of their overall income in 2021, while other income made up 2.4%. In the
meantime, 61.6% of the total income was spent on materials, services, and other costs. While other
income increased to 3.6%, total revenue slightly declined to 96.4% in 2023. In addition, expenses rose to
62.4% of total revenue.
It's crucial to remember that these numbers do not, by themselves, paint a whole picture of Honda's
financial performance. Additional elements like operating income, taxes, and net profit margin should
be taken into account in order to evaluate their total profitability and financial stability. Additionally, the
information provided is fictitious and based on projected numbers for 2022 and 2023. The official
financial reports and statements of honda should be checked for the most recent and accurate financial
information.
The graph above, given by Jica (2011), depicts the total number of cars sold in India and Pakistan based
on engine size. The cars used in India in 2009-2010 are shown in the pie chart on the left. 80% of Indian
car purchases were for vehicles with an engine capacity of 800cc to 1000cc or less. Only 18% were for
vehicles with engines ranging from 1000cc to 1300cc or larger, and only 2% for vehicles with engines of
1300cc or more. While the right-hand pie chart reveals that, at 49%, Pakistan primarily purchases high-
end vehicles with 1300cc or more. The majority of the 21% sold in 2009-10 were automobiles ranging
from 800cc to 1000cc to 1300cc at 30%.
As lower-middle-class Indians buy their first cars, there is a clear difference between the two countries.
In Pakistan, the lower middle class buys secondhand automobiles, whilst new cars are only available to
the upper classes and those with greater salaries. This image is especially useful for business strategy
when determining which automobile models should be developed by the major automakers.
Estimated worldwide motor vehicle production from 2000 to 2023(in million vehicles)
From 2000 to 2023, global motor vehicle production fluctuated, driven by variables such as economic
conditions and the COVID-19 epidemic.
Production peaked in 2000 at 59 million vehicles and gradually rose until 2007, when it reached 73
million. However, the worldwide economic slump of 2008 and 2009 reduced sales to 63 million vehicles.
Despite this setback, the industry recovered and manufacturing continued to climb until 2019, when it
peaked at 97 million vehicles.
The commencement of the COVID-19 pandemic in 2020 had a substantial influence on worldwide motor
vehicle production, resulting in a decrease of 78 million vehicles. The pandemic disrupted supply
networks, causing manufacturing shutdowns and lowering consumer demand. However, as the world
adjusted to the challenges faced by the pandemic, production began to recover. Sales began to climb
again in 2021, with production reaching 85 million vehicles in 2022.
This increasing trend in car production implies a recovery from the pandemic's negative impacts, as well
as an increase in global demand for autos. The industry's resilience and adaptability to changing
conditions are obvious as it seeks to suit the demands and tastes of consumers all around the world. As
economies stabilise and international mobility resumes, the motor vehicle production graph is likely to
maintain its upward trend, indicating a favourable prognosis for the industry.
Effective data analysis is essential for organisations to make well-informed decisions grounded in
evidence rather than relying on intuition alone. Through the analysis of data, businesses can discover
patterns and trends, enabling them to seize growth opportunities, streamline processes, and minimise
risks. This tool allows for precise predictions, boosts efficiency, and fosters creativity. In today's data-
driven world, organisations can gain a competitive edge by harnessing the power of data analysis to
extract valuable insights from vast amounts of information.
• Determine: Consider the underlying objective of your data requirement before diving into it.
Identifying the questions that need to be addressed during the identification phase. What, for example,
is the customer's perception of our brand? What type of packaging is more appealing to our potential
clients, instead? Once the questions have been stated, you can proceed with the next step.
• Gather: This stage is where you start collecting the required data, as its name implies. Here, you
outline the data sources you will be using and explain your approach to utilising them. Information can
be collected from various sources, such as internal or external sources, surveys, interviews,
questionnaires, focus groups, and other methods. It's important to note that data collection methods for
quantitative and qualitative scenarios will vary.
• Efficient: Once you have gathered the necessary data, ensure it is organised and ready for analysis. It is
important to note that not all the data obtained will be relevant. In fact, there is a high likelihood of
encountering duplicate or improperly structured data when collecting large volumes of data in different
formats. In order to avoid this, it is crucial to thoroughly clean your data by eliminating any empty
spaces, duplicate records, and formatting issues prior to commencing your work. By implementing this
approach, you can ensure the use of high-quality data in your analysis.
• Examine: One can begin the process of analysing and manipulating data to draw meaningful
conclusions by employing a range of methods, such as statistical analysis, regressions, neural networks,
text analysis, and other techniques. At this stage, you uncover trends, correlations, variances, and
patterns that can help you find the answers to the questions you initially came up with during the
identification phase. There are numerous solutions accessible for individuals to effectively manage their
data, whether they are researchers or everyday individuals. Predictive analytics, data mining, business
intelligence, and visualisation tools are just a few examples.
• Understand and analyse: One of the most important steps is the final one; it's time to analyse your
findings. During this phase, the researcher formulates action plans based on the findings. One can
determine, for example, the packaging preferences of customers, such as their colour preference (red or
green) and material preference (plastic or paper). At this stage, it is important to recognise any
limitations and find ways to address them.