0% found this document useful (0 votes)
9 views1 page

Tesla

tesla dataset

Uploaded by

ahmedsamer6788
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views1 page

Tesla

tesla dataset

Uploaded by

ahmedsamer6788
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Big Data Project Fall 2024

Tesla
You are tasked with conducting PySpark queries and data processing on the Tesla dataset. This dataset provides
valuable insights into stock exchange performance and pricing trends over a specified period, beginning in the year
2019.
Instructions:
• Carefully examine the dataset to understand its structure and contents.
• Perform the necessary data queries and processing tasks using PySpark.
• Use the information obtained to answer the following questions accurately and comprehensively.

1. What are the columns names?


2. What does the schema look like?
3. Display the first 4 rows
4. Describe the dataset in statistical measurements
5. Reshape the results of the previous question into a suitable format
6. Add a new detail that displays ratio of the Open versus Volume
7. What day had the Peak closing?
8. What is the average of the opening?
9. What is the max and variance of the Volume?
10. How many days was the opening lower than 100 dollars?
11. What percentage of the time was the closing greater than 200 dollars ?
12. What is the sample covariance between High and Low?
13. What is the average of opening per month?
14. What is the lowest high per year?

➢ Data is located at the file “TSLA-Copy1.csv”.

Best of luck

You might also like