0% found this document useful (0 votes)

8 views24 pages

SQL Intervie Data Analyst

The document provides a comprehensive set of SQL interview questions and answers tailored for a Data Analyst position at American Express, focusing on key concepts such as SQL queries, joins, window functions, and data analysis techniques. It includes practical examples and explanations for various SQL operations, optimization strategies, and database design principles. Additionally, it covers scenario-based questions related to data analysis and customer churn, highlighting the differences between OLAP and OLTP databases.

Uploaded by

sdfaql94

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views24 pages

SQL Intervie Data Analyst

Uploaded by

sdfaql94

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 24

American Express Data Analyst

Interview Questions
Experience: 0-3 Years
CTC: 17-19 LPA
Prepared By Mandar Patil
SQL Questions:

1. Write a query to find duplicate rows in a table.

To detect duplicates, identify columns that should be unique and group
by them.

Example:
SELECT column1, column2, COUNT(*) AS count
FROM your_table
GROUP BY column1, column2
HAVING COUNT(*) > 1;

Explanation:
• GROUP BY combines rows with the same values in the specified
columns.
• HAVING COUNT(*) > 1 filters those combinations that occur more
than once, indicating duplicates.
Tip: Add ROW_NUMBER() or RANK() with CTE to highlight or delete
duplicates if needed.

Mandar Patil
2. Explain the difference between INNER JOIN and OUTER
JOIN with examples.

INNER JOIN:
Returns only matching records from both tables.

SELECT e.name, d.department_name

FROM employees e
INNER JOIN departments d ON e.department_id = d.department_id;
• Output: Only employees who belong to a department.
LEFT OUTER JOIN:
Returns all records from the left table, and matching records from the
right table. If no match, NULL is returned.
SELECT e.name, d.department_name
FROM employees e
LEFT JOIN departments d ON e.department_id = d.department_id;
• Output: All employees, with department info where available.

RIGHT OUTER JOIN:

Returns all records from the right table, and matching records from the
left.

FULL OUTER JOIN:

Returns all records from both tables, matching where possible.

Key Difference:
• INNER JOIN = intersection (matched data only)
• OUTER JOIN = union + NULLs (matched + unmatched data)

Mandar Patil
3. Write a query to fetch the second-highest salary from
an employee table.

Option 1: Using DISTINCT, ORDER BY, and LIMIT (MySQL/PostgreSQL)

SELECT DISTINCT salary

FROM employees
ORDER BY salary DESC
LIMIT 1 OFFSET 1;

Option 2: Using subquery (Generic SQL)

SELECT MAX(salary)
FROM employees
WHERE salary < (SELECT MAX(salary) FROM employees);

Explanation:
• The subquery fetches the highest salary.
• The outer query finds the maximum salary less than the highest —
giving the second highest.

4. How do you use GROUP BY and HAVING together?

Provide an example.

Use GROUP BY to group data and HAVING to filter aggregated results

(unlike WHERE, which filters raw rows).
SELECT department_id, COUNT(*) AS emp_count
FROM employees
GROUP BY department_id
HAVING COUNT(*) > 5;

Explanation:
• Groups employees by department.
• Filters groups where the count of employees is more than 5.

Mandar Patil
5. Write a query to find employees earning more than
their managers.

Assume the table employees has:

emp_id, name, salary, manager_id

SELECT e.name AS employee_name, e.salary, m.name AS manager_name,

m.salary AS manager_salary
FROM employees e
JOIN employees m ON e.manager_id = m.emp_id
WHERE e.salary > m.salary;

Explanation:
• Self-join: matches employees (e) with their managers (m).
• Filters those where employee's salary > manager's salary.

6. What is a window function in SQL? Provide examples of

ROW_NUMBER and RANK.

Definition:
A window function performs calculations across a set of table rows
related to the current row — without collapsing rows like GROUP BY.

Syntax:
FUNCTION_NAME() OVER (PARTITION BY column ORDER BY column)

Example: ROW_NUMBER()
Assigns a unique sequential number to each row within a partition.

SELECT name, department, salary,

ROW_NUMBER() OVER (PARTITION BY department ORDER BY salary
DESC) AS row_num
FROM employees;

Mandar Patil
Each employee within the same department gets a row number based on
salary rank (highest first).

Example: RANK()
Assigns the same rank to rows with equal values, but skips the next
rank(s).
SELECT name, department, salary,
RANK() OVER (PARTITION BY department ORDER BY salary DESC) AS
rank_num
FROM employees;
• If 2 employees have the same salary, both get rank 1, and the next
gets rank 3.

7. Write a query to fetch the top 3 performing products

based on sales.

Assume table sales_data has:

product_id, product_name, total_sales

SELECT product_id, product_name, total_sales

FROM sales_data
ORDER BY total_sales DESC
LIMIT 3;

Alternate using RANK() (if ties matter):

SELECT product_id, product_name, total_sales
FROM (
SELECT *, RANK() OVER (ORDER BY total_sales DESC) AS rank_num
FROM sales_data
) ranked_sales
WHERE rank_num <= 3;

Mandar Patil
8. Explain the difference between UNION and UNION ALL.
Feature UNION UNION ALL
Duplicates Removes duplicates Keeps all rows, including
duplicates
Performance Slower (because of sorting) Faster (no de-duplication)
Use case When you want distinct rows When duplicates are
meaningful
Example:
SELECT city FROM customers
UNION
SELECT city FROM vendors;
→ Returns a unique list of cities.

SELECT city FROM customers

UNION ALL
SELECT city FROM vendors;
→ Returns all cities, including duplicates.

9. How do you use a CASE statement in SQL? Provide an

example.

CASE lets you write conditional logic in SQL (similar to IF/ELSE).

SELECT name, salary,
CASE
WHEN salary >= 100000 THEN 'High'
WHEN salary >= 50000 THEN 'Medium'
ELSE 'Low'
END AS salary_category
FROM employees;

Explanation:
• Assigns a category based on salary value.
• Works inside SELECT, WHERE, ORDER BY, etc.

Mandar Patil
10. Write a query to calculate the cumulative sum of
sales.
Assume table sales has:
order_date, product_id, sales_amount

SELECT order_date, product_id, sales_amount,

SUM(sales_amount) OVER (PARTITION BY product_id ORDER BY
order_date) AS
cumulative_sales
FROM sales;

Explanation:
• SUM(...) OVER (...) calculates a running total per product based on
order date.
• PARTITION BY groups by product, and ORDER BY ensures the
accumulation follows chronological order.

11. What is a CTE (Common Table Expression), and how

is it used?

Definition:
A CTE (Common Table Expression) is a temporary, named result set that
you can reference within a SQL query.
It improves readability and simplifies complex subqueries or recursive
logic.

Syntax:
WITH cte_name AS (
SELECT ...
)
SELECT * FROM cte_name;

Mandar Patil
Example – Filter top-paid employees using CTE:

WITH HighEarners AS (
SELECT emp_id, name, salary
FROM employees
WHERE salary > 100000
)
SELECT * FROM HighEarners;

Benefits:
• Reusable and readable
• Allows recursion (e.g., hierarchical data)
• Avoids repeating subqueries

12. Write a query to identify customers who have

made transactions above $5,000 multiple times.

Assume transactions table has:

customer_id, transaction_amount

SELECT customer_id, COUNT(*) AS high_value_txns

FROM transactions
WHERE transaction_amount > 5000
GROUP BY customer_id
HAVING COUNT(*) > 1;

Explanation:
• Filters high-value transactions (> $5000).
• Groups them by customer.
• Returns customers who’ve done this more than once.

Mandar Patil
13. Explain the difference between DELETE and
TRUNCATE commands.
Feature DELETE TRUNCATE
Removes rows Yes (can use WHERE condition) Yes (removes all rows)
WHERE Yes No
supported?
Logging Logs each deleted row Minimal logging (faster)
(slower)
Rollback Can be rolled back (if within Can be rolled back (in some
transaction) RDBMS)
Identity reset Retains identity Resets identity (in most
DBs)
Use case Partial deletion or audit trail Full data wipe without audit
needed needed

14. How do you optimize SQL queries for better

performance?
Here are key SQL optimization techniques:
1. Use SELECT only required columns
-- Bad
SELECT * FROM orders;
-- Good
SELECT order_id, customer_id FROM orders;

2. Create proper indexes

• Index frequently used columns in JOIN, WHERE, ORDER BY.

3. Avoid functions on indexed columns

-- Slower (cannot use index)

WHERE YEAR(order_date) = 2024
-- Better
WHERE order_date BETWEEN '2024-01-01' AND '2024-12-31'

Mandar Patil
4. Use EXISTS instead of IN (for subqueries)
-- Prefer EXISTS (better for large datasets)
SELECT name FROM customers c
WHERE EXISTS (
SELECT 1 FROM orders o WHERE o.customer_id = c.customer_id );

5. Avoid unnecessary joins or nested subqueries

6. Use appropriate data types and avoid implicit conversions

7. Analyze execution plans (EXPLAIN or EXPLAIN ANALYZE)

15. Write a query to find all customers who have not

made any purchases in the last 6 months.

Assume:
• customers(customer_id, name)
• transactions(customer_id, transaction_date)

SELECT c.customer_id, c.name

FROM customers c
LEFT JOIN transactions t
ON c.customer_id = t.customer_id
AND t.transaction_date >= CURRENT_DATE - INTERVAL '6 months'
WHERE t.customer_id IS NULL;

Explanation:
• LEFT JOIN includes all customers.
• WHERE t.customer_id IS NULL ensures the customer had no
purchase in the last 6 months.

Mandar Patil
16. How do you handle NULL values in SQL? Provide
examples.
NULL represents missing or unknown data.
1. Using IS NULL / IS NOT NULL:
SELECT * FROM employees WHERE manager_id IS NULL;

2. Replace NULL using COALESCE() or IFNULL() (MySQL):

SELECT name, COALESCE(phone_number, 'Not Provided') AS contact
FROM customers;

3. Handling NULLs in aggregation (e.g., AVG, SUM):

• These functions ignore NULLs by default.
SELECT AVG(salary) FROM employees;

4. Conditional checks:
SELECT name,
CASE
WHEN salary IS NULL THEN 'Unknown'
ELSE 'Known'
END AS salary_status
FROM employees;

17. Write a query to transpose rows into columns.

Assume a table sales with:
region, month, sales_amount
We want to pivot month values into columns.

Using CASE:

SELECT region,
SUM(CASE WHEN month = 'Jan' THEN sales_amount ELSE 0 END) AS Jan,

Mandar Patil
SUM(CASE WHEN month = 'Feb' THEN sales_amount ELSE 0 END) AS Feb,
SUM(CASE WHEN month = 'Mar' THEN sales_amount ELSE 0 END) AS Mar
FROM sales
GROUP BY region;

2. Using PIVOT (SQL Server or Oracle syntax):

SELECT region, [Jan], [Feb], [Mar]
FROM (
SELECT region, month, sales_amount
FROM sales
) AS src
PIVOT (
SUM(sales_amount)
FOR month IN ([Jan], [Feb], [Mar])
) AS p;

18. Explain indexing and how it improves query

performance.
What is an index?
An index is a data structure that improves the speed of data retrieval
operations on a database table at the cost of additional space and write-
time performance.

How indexing helps:

Feature With Index Without Index
Search performance Fast (uses binary/tree search) Slow (scans every row
— full scan)
Used in WHERE, JOIN, ORDER BY, Inefficient for large
GROUP BY datasets
Types B-tree (default), Bitmap, -
Hash, etc.

Example:
-- Creating index
CREATE INDEX idx_customer_id ON transactions(customer_id);

Mandar Patil
• This helps queries like:
SELECT * FROM transactions WHERE customer_id = 101;

Important notes:
• Too many indexes can slow down INSERT/UPDATE.
• Avoid indexing columns with low cardinality (e.g., gender).
• Use composite indexes when querying multiple columns together.

19. Write a query to fetch the maximum transaction

amount for each customer.

Assume a transactions table:

Column Description
customer_id ID of the customer
transaction_id Unique transaction ID
amount Transaction amount

Query:
SELECT customer_id, MAX(amount) AS max_transaction
FROM transactions
GROUP BY customer_id;

Explanation:
• GROUP BY groups all transactions by customer.
• MAX(amount) returns the highest transaction for each group
(customer).

Mandar Patil
20. What is a self-join, and how is it used?
Definition:
A self-join is a regular join where a table is joined with itself.
It is useful when rows in a table are related to other rows in the same
table.

Example Use Case – Employees and Managers:

Assume:

emp_id name manager_id

1 Alice NULL
2 Bob 1
3 Carol 1
4 David 2

Here, manager_id refers to emp_id of another employee.

Query: Get employee names along with their manager names

SELECT e.name AS employee_name, m.name AS manager_name
FROM employees e
LEFT JOIN employees m
ON e.manager_id = m.emp_id;

Explanation:
• e is an alias for employees (as employee).
• m is another alias for the same table (as manager).
• The join links an employee to their manager using manager_id =
emp_id.

Mandar Patil
Data Analysis/Scenario-Based Questions

21. How would you design a database to store credit

card transaction data?
To store credit card transaction data, we need to normalize the structure
while keeping it scalable, secure, and query-efficient.

Suggested Schema Design:

1. Customers Table customer_id (PK), name, email, phone, address
2. Cards Table card_id (PK), customer_id (FK), card_number (masked),
card_type, status, issued_date
3. Merchants Table merchant_id (PK), name, category, location
4. Transactions Table transaction_id (PK), card_id (FK), merchant_id (FK),
transaction_date, amount, currency, status, location

Best Practices:
• Mask sensitive fields (like card numbers).
• Store card_number as encrypted or tokenized.
• Use partitioning on date fields for faster querying.
• Add indexes on card_id, merchant_id, transaction_date.

22. Write a query to identify the most profitable

regions based on transaction data.
Assume a transactions table:
(transaction_id, customer_id, amount, region, transaction_date)

Query to find top 3 profitable regions:

SELECT region, SUM(amount) AS total_revenue
FROM transactions
GROUP BY region
ORDER BY total_revenue DESC

Mandar Patil
LIMIT 3;

Explanation:
• Aggregates transaction amounts per region.
• Orders regions by total revenue.
• Retrieves top 3 using LIMIT.
Optional: You could also calculate profit by subtracting costs (if a cost
column is present).
23. How would you analyze customer churn using
SQL?
Step-by-step SQL approach:
Step 1: Define churn
Let’s say a churned customer is one who hasn’t transacted in the last 6
months.

Step 2: Sample schema

• customers(customer_id, name, signup_date)
• transactions(customer_id, transaction_date, amount)

Step 3: Query to identify churned customers

SELECT c.customer_id, c.name

FROM customers c
LEFT JOIN transactions t
ON c.customer_id = t.customer_id
AND t.transaction_date >= CURRENT_DATE - INTERVAL '6 months'
WHERE t.transaction_id IS NULL;

Step 4: Analyze churn metrics

You could extend this analysis by calculating:
• Churn rate = (Churned Customers / Total Customers) * 100
• Monthly churn trend
• Compare churned vs. active customers in terms of average spend

Mandar Patil
24. Explain the difference between OLAP and OLTP
databases.
OLTP(Online Transaction OLAP (Online Analytical
Feature Processing) Processing)
Purpose Handles real-time Used for analytical/reporting
transactional queries queries

Operations INSERT, UPDATE, DELETE SELECT (aggregate, group,

slice, dice)
Data Structure Highly normalized (3NF) De-normalized
(star/snowflake schema)
Speed Fast for read/write of Fast for complex analytical
single rows queries
Examples Banking systems, e- Business intelligence,
commerce, order dashboards, sales trends
processing
Users Clerks, DBAs Analysts, Data Scientists
Backup/Recovery Essential and frequent Less frequent

In short:
• OLTP = operational, fast, real-time transactions.
• OLAP = analytical, slow-changing, historical data.

25. How would you determine the Average Revenue

Per User (ARPU) from transaction data?

ARPU = Total Revenue / Total Number of Users

Assume a transactions table:
(transaction_id, customer_id, amount, transaction_date)

SQL Query:
SELECT
SUM(amount) * 1.0 / COUNT(DISTINCT customer_id) AS ARPU
FROM transactions;

Mandar Patil
Explanation:
• SUM(amount) gets total revenue.
• COUNT(DISTINCT customer_id) counts unique users.
• Multiply by 1.0 to ensure float division.

You can also compute monthly ARPU by grouping by month.

SELECT
DATE_TRUNC('month', transaction_date) AS month,
SUM(amount) * 1.0 / COUNT(DISTINCT customer_id) AS monthly_arpu
FROM transactions
GROUP BY month
ORDER BY month;

26. Describe a scenario where you would use a LEFT

JOIN instead of an INNER JOIN.
Use LEFT JOIN when:
You want all records from the left table, even if there's no matching record
in the right table.

Real-life Scenario:
Question: List all customers and their transactions — even if they haven't
made any.
Query:
SELECT c.customer_id, c.name, t.transaction_id, t.amount
FROM customers c
LEFT JOIN transactions t
ON c.customer_id = t.customer_id;

Why LEFT JOIN?

• Shows all customers, including those with no transactions (returns
NULLs for those).
• Using INNER JOIN would exclude customers with zero activity.

Mandar Patil
27. Write a query to calculate YoY (Year-over-Year)
growth for a set of transactions.

Assume a table named transactions with:

(customer_id, transaction_date, amount)

Step 1: Extract year-wise revenue

SELECT
EXTRACT(YEAR FROM transaction_date) AS year,
SUM(amount) AS total_revenue
FROM transactions
GROUP BY EXTRACT(YEAR FROM transaction_date);

Step 2: Calculate YoY Growth using a CTE and Self-Join

WITH yearly_revenue AS (
SELECT
EXTRACT(YEAR FROM transaction_date) AS year,
SUM(amount) AS total_revenue
FROM transactions
GROUP BY EXTRACT(YEAR FROM transaction_date)
)
SELECT
curr.year AS current_year,
curr.total_revenue,
prev.total_revenue AS previous_year_revenue,
ROUND(((curr.total_revenue - prev.total_revenue) /
prev.total_revenue) * 100, 2) AS yoy_growth_percent
FROM yearly_revenue curr
LEFT JOIN yearly_revenue prev
ON curr.year = prev.year + 1;

Mandar Patil
Explanation:
• Joins each year to its previous year.
• Computes YoY growth as a percentage.

28. How would you implement fraud detection using

transactional data?
Fraud detection typically involves pattern recognition, anomaly detection,
and rule-based filtering.

Possible SQL-Based Checks:

Type Rule
Unusual Amounts Flag transactions > 3x average amount of that
user
Rapid Repeats Detect multiple transactions from same user
within seconds
Location Mismatch Transactions from different countries within
a short time
Card Sharing Same card used by different customers or IPs

Example Query – Unusual high amount per user:

WITH avg_txn AS (
SELECT customer_id, AVG(amount) AS avg_amount
FROM transactions
GROUP BY customer_id
)
SELECT t.*
FROM transactions t
JOIN avg_txn a
ON t.customer_id = a.customer_id
WHERE t.amount > 3 * a.avg_amount;

Mandar Patil
29. Write a query to find customers who have used
more than 2 credit cards for transactions in a given
month.
Assume a transactions table:
(customer_id, card_id, transaction_date)

Query:
SELECT customer_id,
TO_CHAR(transaction_date, 'YYYY-MM') AS txn_month,
COUNT(DISTINCT card_id) AS cards_used
FROM transactions
GROUP BY customer_id, TO_CHAR(transaction_date, 'YYYY-MM')
HAVING COUNT(DISTINCT card_id) > 2;

Explanation:
• Groups by customer_id and month.
• Counts distinct card_id used.
• Filters where more than 2 cards were used in a month.

30. How would you approach a business problem

where you need to analyze the spending patterns of
premium customers?
Step-by-Step Structured Approach:

Step 1: Understand the Objective

• Clarify with stakeholders what "spending pattern" means.
 Is it frequency, amount, category, channel, or timing?

• Define “premium customer”:

 Based on credit score, card tier (e.g., Platinum, Centurion),
monthly spend threshold, etc.

Mandar Patil
Step 2: Data Collection
• Gather relevant datasets:
 Customer table (ID, tier, demographics)
 Transactions table (amount, date, category, location)
 Cards table (card_type, limits, activation)

Step 3: Data Cleaning & Preparation

• Handle missing values and outliers.
• Filter only premium customers using defined criteria.
• Enrich data (e.g., categorize merchant types or locations).

Step 4: Exploratory Data Analysis (EDA)

Use SQL/Python/Power BI to derive insights like:

Focus Area Example Analysis

Spend Amount Average monthly/yearly spend
Time Trends Seasonality or weekly spending behavior
Categories Where they spend most (Travel, Dining,
Shopping)
Geography City or region-wise behavior
Trends Is their spend increasing/decreasing YoY?

Step 5: Segmentation
• Use clustering or thresholds to group premium customers into:
 High spenders
 Frequent spenders
 Category loyalists (e.g., only travel)
• Identify anomalies or subgroups with unique patterns.

Step 6: Business Recommendations

• Personalize rewards or offers based on their dominant categories.
• Enhance retention strategies for segments showing decline.
• Promote premium card upgrades based on usage patterns.

Mandar Patil
Bonus: Sample SQL Query
Get top 3 spending categories of premium customers monthly:

SELECT customer_id,
DATE_TRUNC('month', transaction_date) AS txn_month,
category,
SUM(amount) AS total_spend
FROM transactions
WHERE customer_id IN (
SELECT customer_id FROM customers WHERE tier = 'Premium'
)
GROUP BY customer_id, txn_month, category
ORDER BY customer_id, txn_month, total_spend DESC;

Mandar Patil
Want to Learn more
about Data
Analytics Free?
Do Join the Community
Channel Link is in the Bio and
Comments section

FOLLOW MANDAR PATIL FOR

MORE DATA ANALYTICS TIPS!

Mandar Patil

What Every Engineer Should Know About
0% (1)
What Every Engineer Should Know About
217 pages
FullStack - Grade 10 - ICT - மீட்டல் வினாத்தாள்-1
83% (6)
FullStack - Grade 10 - ICT - மீட்டல் வினாத்தாள்-1
14 pages
Jee Main Maths 27 January Shift 1 Byjus
No ratings yet
Jee Main Maths 27 January Shift 1 Byjus
6 pages
Ömer Aydan (Author) - Rock Mechanics and Rock Engineering-Volume 1 - Fundamentals of Rock Mechanics (2019, CRC Press) - Libgen - Li
100% (4)
Ömer Aydan (Author) - Rock Mechanics and Rock Engineering-Volume 1 - Fundamentals of Rock Mechanics (2019, CRC Press) - Libgen - Li
392 pages
SQL Exam Questoins
No ratings yet
SQL Exam Questoins
6 pages
Aaaaaa
No ratings yet
Aaaaaa
15 pages
SQL Questions
No ratings yet
SQL Questions
14 pages
SQL Interview Guide For Three Previous Posts
No ratings yet
SQL Interview Guide For Three Previous Posts
15 pages
Interview Preparation Data Collection-01
No ratings yet
Interview Preparation Data Collection-01
14 pages
KPMG Data Analyst Interview Questions
No ratings yet
KPMG Data Analyst Interview Questions
30 pages
100 SQL Questions With Real Examples-2
No ratings yet
100 SQL Questions With Real Examples-2
16 pages
SQL Short Notes Top 10 Questions 1748266007
No ratings yet
SQL Short Notes Top 10 Questions 1748266007
8 pages
SQL Questions Guide For QA S 1738694053
No ratings yet
SQL Questions Guide For QA S 1738694053
24 pages
Tech Mahindra Data Analyst Interview Questions
No ratings yet
Tech Mahindra Data Analyst Interview Questions
11 pages
Detailed SQL Interview Questions
No ratings yet
Detailed SQL Interview Questions
4 pages
TCS Data Analyst Interview Questions
No ratings yet
TCS Data Analyst Interview Questions
8 pages
SQL 1732644814
No ratings yet
SQL 1732644814
7 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
4 pages
Data Engineer (3-5 Years of Experience.) PDF
No ratings yet
Data Engineer (3-5 Years of Experience.) PDF
7 pages
Interview - 7 - IMP
No ratings yet
Interview - 7 - IMP
26 pages
Top Advanced SQL Interview Questions & Answers
No ratings yet
Top Advanced SQL Interview Questions & Answers
6 pages
Myntra SQL
No ratings yet
Myntra SQL
34 pages
Tech Mahindra SQL Interview Questions For Data Engineer
No ratings yet
Tech Mahindra SQL Interview Questions For Data Engineer
6 pages
Basic Select, Where, Distinct
No ratings yet
Basic Select, Where, Distinct
11 pages
Day 9 1733668828
No ratings yet
Day 9 1733668828
8 pages
Basic SQL Interview Questions
No ratings yet
Basic SQL Interview Questions
18 pages
Ade 1737191501
No ratings yet
Ade 1737191501
29 pages
Most Asked SQL Queries in Interview
No ratings yet
Most Asked SQL Queries in Interview
6 pages
SQL Part1 (Basics)
No ratings yet
SQL Part1 (Basics)
6 pages
SQL Query Questions Commonly Asked in Interviews 1732271818
No ratings yet
SQL Query Questions Commonly Asked in Interviews 1732271818
7 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
8 pages
SQL Info
No ratings yet
SQL Info
12 pages
Unit 3
No ratings yet
Unit 3
4 pages
SQL Interview Questions
No ratings yet
SQL Interview Questions
5 pages
SQL 2
No ratings yet
SQL 2
15 pages
Quiz No.3
No ratings yet
Quiz No.3
13 pages
SQL Interview Questions Top 100
No ratings yet
SQL Interview Questions Top 100
18 pages
Top 80 SQL Questions 1738906719
No ratings yet
Top 80 SQL Questions 1738906719
10 pages
Important SQL Q&A
No ratings yet
Important SQL Q&A
18 pages
Wipro Data Analyst Interview Questions
No ratings yet
Wipro Data Analyst Interview Questions
29 pages
SQL Interview Questions 1728270256
No ratings yet
SQL Interview Questions 1728270256
19 pages
SQL Interview Questions and Answers
No ratings yet
SQL Interview Questions and Answers
5 pages
Questions For SQL Part 1
No ratings yet
Questions For SQL Part 1
17 pages
Test Questions
No ratings yet
Test Questions
10 pages
SQL Latest
No ratings yet
SQL Latest
7 pages
Database Nest Quiz
No ratings yet
Database Nest Quiz
22 pages
SQL Question Basic to Advanced
No ratings yet
SQL Question Basic to Advanced
21 pages
Books Isbn - N o Title Publisher - I D Year
No ratings yet
Books Isbn - N o Title Publisher - I D Year
29 pages
IP XII Quick Notes - Querying in MYSQL
No ratings yet
IP XII Quick Notes - Querying in MYSQL
11 pages
University of Technology Sydney SQL-Exam-Notes 2019 Database Fundamental
No ratings yet
University of Technology Sydney SQL-Exam-Notes 2019 Database Fundamental
2 pages
MySQL Practical Lab Manual-1
No ratings yet
MySQL Practical Lab Manual-1
16 pages
SQL Scenario-Based Interview Questions & Answers: Nitya Cloudtech PVT LTD
No ratings yet
SQL Scenario-Based Interview Questions & Answers: Nitya Cloudtech PVT LTD
14 pages
Relational Database Interview Questions With Sections
No ratings yet
Relational Database Interview Questions With Sections
5 pages
CC6001ES ADSD SQL Exam Model Paper 1 Answers
No ratings yet
CC6001ES ADSD SQL Exam Model Paper 1 Answers
12 pages
100+ SQL Interview Questions
No ratings yet
100+ SQL Interview Questions
20 pages
Unit 5 Bcomcardbms 2024
No ratings yet
Unit 5 Bcomcardbms 2024
31 pages
SQL Interview Queries Reference
No ratings yet
SQL Interview Queries Reference
6 pages
PostgreSQL Mastery Cheat Sheet
No ratings yet
PostgreSQL Mastery Cheat Sheet
1 page
DBMS File Section B
No ratings yet
DBMS File Section B
22 pages
Basic SQL For Data Analyst Interview Questions
No ratings yet
Basic SQL For Data Analyst Interview Questions
10 pages
SQL Scenario Based Questions-1
No ratings yet
SQL Scenario Based Questions-1
25 pages
SQL Doc
No ratings yet
SQL Doc
39 pages
Panasonic KX HDV130 User Guide v1.0
No ratings yet
Panasonic KX HDV130 User Guide v1.0
35 pages
Carbolite Furnace
No ratings yet
Carbolite Furnace
6 pages
Assignment Unit 4
No ratings yet
Assignment Unit 4
13 pages
B B B Boerhavia Diffusa Oerhavia Diffusa Oerhavia Diffusa Oerhavia Diffusa
No ratings yet
B B B Boerhavia Diffusa Oerhavia Diffusa Oerhavia Diffusa Oerhavia Diffusa
5 pages
New Job 1-New Section 1-PCR Report
100% (1)
New Job 1-New Section 1-PCR Report
2 pages
3205-Article Text-23308-1-10-20240703
No ratings yet
3205-Article Text-23308-1-10-20240703
7 pages
Chapter-6 and 7
100% (1)
Chapter-6 and 7
16 pages
Test - Chapter 8 WAVES - Quizlet
No ratings yet
Test - Chapter 8 WAVES - Quizlet
3 pages
Internal Flow
No ratings yet
Internal Flow
6 pages
Open Document Architecture
No ratings yet
Open Document Architecture
15 pages
Exploring Ambiguous Harmony With Triadic Slash Chords: BAJP - Year 4 Written Analysis Project
100% (1)
Exploring Ambiguous Harmony With Triadic Slash Chords: BAJP - Year 4 Written Analysis Project
39 pages
LeChatelier's Principle MiniLab
No ratings yet
LeChatelier's Principle MiniLab
2 pages
Example Test (Copy)
No ratings yet
Example Test (Copy)
26 pages
Cramer's Rule - Wikipedia
No ratings yet
Cramer's Rule - Wikipedia
29 pages
Fiitjee - Phase Test (JEE-Advanced) : Physics, Chemistry & Mathematics
No ratings yet
Fiitjee - Phase Test (JEE-Advanced) : Physics, Chemistry & Mathematics
13 pages
TMGT Aab m1 Cs 001 - Rev 3 HSD Storage Tank - Ifc
No ratings yet
TMGT Aab m1 Cs 001 - Rev 3 HSD Storage Tank - Ifc
59 pages
1756-IF16 (Single Ended Current) With 1492-AIFM16-F-5
No ratings yet
1756-IF16 (Single Ended Current) With 1492-AIFM16-F-5
1 page
TC L42U30 Panasonic
No ratings yet
TC L42U30 Panasonic
82 pages
Desalination Final PaperSubmission-Nannarone
No ratings yet
Desalination Final PaperSubmission-Nannarone
13 pages
GNU Linear Programming Kit Reference Manual
No ratings yet
GNU Linear Programming Kit Reference Manual
181 pages
Basic Derivatives Practice Worksheet
100% (1)
Basic Derivatives Practice Worksheet
2 pages
Chapter 1 The Nature of Probability and Statistics
No ratings yet
Chapter 1 The Nature of Probability and Statistics
9 pages
Drawing Lecture 1
No ratings yet
Drawing Lecture 1
28 pages
Stoichiometry
No ratings yet
Stoichiometry
14 pages
Ericsson_Antenna_80010866_DS
No ratings yet
Ericsson_Antenna_80010866_DS
6 pages

SQL Intervie Data Analyst

Uploaded by

SQL Intervie Data Analyst

Uploaded by

American Express Data Analyst

1. Write a query to find duplicate rows in a table.

SELECT e.name, d.department_name

RIGHT OUTER JOIN:

FULL OUTER JOIN:

Option 1: Using DISTINCT, ORDER BY, and LIMIT (MySQL/PostgreSQL)

SELECT DISTINCT salary

Option 2: Using subquery (Generic SQL)

4. How do you use GROUP BY and HAVING together?

Use GROUP BY to group data and HAVING to filter aggregated results

Assume the table employees has:

SELECT e.name AS employee_name, e.salary, m.name AS manager_name,

6. What is a window function in SQL? Provide examples of

SELECT name, department, salary,

7. Write a query to fetch the top 3 performing products

Assume table sales_data has:

SELECT product_id, product_name, total_sales

Alternate using RANK() (if ties matter):

SELECT city FROM customers

9. How do you use a CASE statement in SQL? Provide an

CASE lets you write conditional logic in SQL (similar to IF/ELSE).

SELECT order_date, product_id, sales_amount,

11. What is a CTE (Common Table Expression), and how

12. Write a query to identify customers who have

Assume transactions table has:

SELECT customer_id, COUNT(*) AS high_value_txns

14. How do you optimize SQL queries for better

2. Create proper indexes

3. Avoid functions on indexed columns

-- Slower (cannot use index)

5. Avoid unnecessary joins or nested subqueries

6. Use appropriate data types and avoid implicit conversions

7. Analyze execution plans (EXPLAIN or EXPLAIN ANALYZE)

15. Write a query to find all customers who have not

SELECT c.customer_id, c.name

2. Replace NULL using COALESCE() or IFNULL() (MySQL):

3. Handling NULLs in aggregation (e.g., AVG, SUM):

17. Write a query to transpose rows into columns.

2. Using PIVOT (SQL Server or Oracle syntax):

18. Explain indexing and how it improves query

How indexing helps:

19. Write a query to fetch the maximum transaction

Assume a transactions table:

Example Use Case – Employees and Managers:

emp_id name manager_id

Here, manager_id refers to emp_id of another employee.

Query: Get employee names along with their manager names

21. How would you design a database to store credit

Suggested Schema Design:

22. Write a query to identify the most profitable

Query to find top 3 profitable regions:

Step 2: Sample schema

Step 3: Query to identify churned customers

SELECT c.customer_id, c.name

Step 4: Analyze churn metrics

Operations INSERT, UPDATE, DELETE SELECT (aggregate, group,

25. How would you determine the Average Revenue

ARPU = Total Revenue / Total Number of Users

You can also compute monthly ARPU by grouping by month.

26. Describe a scenario where you would use a LEFT

Why LEFT JOIN?

Assume a table named transactions with:

Step 1: Extract year-wise revenue

Step 2: Calculate YoY Growth using a CTE and Self-Join

28. How would you implement fraud detection using

Possible SQL-Based Checks:

Example Query – Unusual high amount per user:

30. How would you approach a business problem

Step 1: Understand the Objective

• Define “premium customer”:

Step 3: Data Cleaning & Preparation

Step 4: Exploratory Data Analysis (EDA)

Focus Area Example Analysis

Step 6: Business Recommendations

FOLLOW MANDAR PATIL FOR

You might also like