100% found this document useful (1 vote)
504 views1 page

ChatGPT For Data Science Cheat Sheet KDnuggets

The document covers requesting assistance on topics like data analysis, research, generating and cleaning data

Uploaded by

Ali Karar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
100% found this document useful (1 vote)
504 views1 page

ChatGPT For Data Science Cheat Sheet KDnuggets

The document covers requesting assistance on topics like data analysis, research, generating and cleaning data

Uploaded by

Ali Karar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Visit KDnuggets.

com for more


cheatsheets and additional
learning resources.

Unit Test Dummy Data


Imbalance Data
>>> Write a unit test for train function. The test cases are: x >>> Generate the dummy data for me to use as
>>> I have an imbalanced dataset with target column
should not be null value and y should be a numerical value. placeholders in my spreadsheet.
species. In python, how do I oversample and/or

ChatGPT
undersample my data?
Code Explanation Tips

>>> Can you explain what the code is doing? `[code- >>> Give me some tips on how to improve the efficiency of
Explain the Model
snippet]` my spreadsheet?
>>> I have trained a LightGBM model. Write a Python code

Cheat Sheet

to explain the output using a series of plots with Shap.


Optimize Code
>>> Can you improve the time complexity of the code?
`[code-snippet]`
Data Analysis
Research
ChatGPT is a large language conversational AI built by
OpenAI. It was trained using Reinforcement Learning from
Human Feedback, similar to InstructGPT. ChatGPT SQL Generate Data
understands the prompt and provides detailed repose that >>> Generate a fake data with 100 rows and 4 columns:
can help you with research, coding, and various data [id,name,grade,subject] Explain the Concept
science tasks.
>>> Explain t-test to an undergraduate as a data science
Data Cleaning instructor.
>>> I have a text classification dataset. Write Python code

SQL Formatting
for data cleaning. Stakeholders
>>> Format the following SQL code and convert all reserved

>>> Tips on explaining data science reports to a business
Ideas keywords to uppercase. `[code-snippet]`
Data Exploration stakeholder.

>>> I have a dataset of 100 rows and four columns:[id,

Translate Between DBMS


name, grade, subject]. Write R code for data visualization Summarize the paper
>>> What is the equivalent of PostgreSQL's DATE_TRUNC for
and exploration. >>> Please summarize the paper “Adding Conditional
MySQL?

Control to Text-to-Image Diffusion Models” in simple terms
Dataset Suggestion

Data Visualization in one paragraph.


>>> I want to build a predictive model for image classifiers. Calculate Average
>>> I have a dataset with 100 rows columns [id, name,

Can you please suggest the five most relevant datasets for >>> Write the SQL code that works for PostgreSQL 14. I have
grade, subject]. Create a matplotlib bar chart of subject vs. Writing Blog
my use case? a table with two columns [date, temp]. I would like to
grade. >>> Write an outline for a blog "Python lists".

calculate an average temp.


Suggest Resources

Research History
>>> I would like to learn about deep learning. Please Calculate Runway
>>> Can you research the history of the graph neural
suggest 3 best specific resources. >>> Write SQL to calculate my runway.
network?

AB Testing
Machine Learning
>>> [case-study]. Please design an A/B test for this purpose.
Please include the concrete steps on which statistical test
I should run.
Spreadsheets

Train Regression Model


Subscribe to KDnuggets News
Career Coaching
>>> I am looking for a role as a data engineer. My >>> You are a data scientist, write Python code for me. I
background is management. What should I do in 6 months have a dataset with columns [model,hp,speed]. Please
to get a job? build a machine learning model that predicts speed.

Spreadsheets Formula Hyperparameter Tuning


>>> Create a spreadsheet formula to calculate the sum of >>> I have a logistic regression model, write Python code to
cells B1 to B20? tune hyperparameters.
Coding
Abid Ali Awan | 2023

You might also like