0% found this document useful (0 votes)
30 views2 pages

Naan Mudhalvan Questions

Uploaded by

bhavadharanimcr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views2 pages

Naan Mudhalvan Questions

Uploaded by

bhavadharanimcr
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Naan Mudhalvan Questions

Git & Version Control

1. Explain the role of Git in version control and collaboration. Describe key operations such as
cloning, pushing, committing, and the purpose of branches.

2. Describe the one-time Git setup process. Include the purpose of commands like git config, git init,
git add, and git commit with examples.

Python & Pandas

3. Why is Python widely used for Data Science? Discuss its features as a high-level interpreted
language and its relevance in this field.

4. Explain how Pandas is used in data manipulation. Demonstrate the use of important functions like
dropna(), fillna(), and how it handles missing or duplicate data.

Cloud Computing & Oracle Cloud Infrastructure (OCI)

5. Why is cloud computing preferred for data science projects? Discuss advantages such as scalability,
cost-efficiency, data integration, and collaboration.

6. Define 'Tenancy' in OCI. Explain how compartments, users, groups, and policies are structured and
secured. Include an example of a policy rule.

7. What is the role of OCI Resource Manager? Describe how Terraform is used to create and deploy
infrastructure using Resource Manager.

8. Describe the key networking concepts in OCI such as VCN, Subnets, and Gateways (Internet, NAT,
Service). How do they contribute to secure connectivity?

9. Discuss the significance of authentication in OCI. Differentiate between public and private keys and
explain how they are used.

10. What is ADS (Accelerated Data Science) SDK in OCI? Mention any four use cases like fraud
detection, healthcare analytics, or predictive maintenance.

Data Science Basics

11. What are the core requirements for a successful Data Science project? Explain the roles of
mathematics, statistics, domain knowledge, and computer science.
12. What is the significance of platforms like Kaggle in Data Science? How do they support learning,
collaboration, and dataset access?

Machine Learning & Data Preparation

13. Explain the complete Machine Learning Lifecycle. Include steps such as data access, exploration,
preparation, modeling, and deployment.

14. What are the common challenges in real-world data preprocessing? Explain how missing data
and outliers are handled.

15. Describe feature scaling and dimensionality reduction. Explain the difference between
normalization and standardization.

16. What is the purpose of splitting data into training, validation, and test sets? Explain a typical split
ratio and its importance in model evaluation.

Data Visualization

17. Differentiate between Matplotlib and Seaborn libraries in Python. Highlight their use cases, key
features, and relationship with each other.

18. Explain any three Seaborn visualizations such as pairplot, boxplot, and heatmap. Mention
suitable use cases for each.

Supervised & Unsupervised Learning

19. Compare Supervised and Unsupervised Learning with examples. Describe typical algorithms
under each category and their applications.

20. Explain Linear and Logistic Regression. Include formulas, types of data used, the meaning of
slope/intercept, and differences in use-cases.

You might also like