Lab5 - Prep.
Lab5 - Prep.
We have dealt a lot with different types of data files and analyzing it so far. But we haven’t dealt
with question regarding data quality and data manipulation yet. Data quality is perhaps one of the
most important aspects in a data project. Ensuring your data is in the right format is key to
performing all your analysis. If the data isn’t right, the analysis most certainly won’t be.
In our previous labs, we have always had clean, good data where we assumed we had no issues, and
the data was usually in a good format, ready to be analyzed and used. This may not always be the
case. In real world scenarios, you may not get a pretty and clean workbook to start your work.
Instead, you will have to filter out relevant data, clean it to make sure there are no issues and
combine it from multiple sources to get the right dataset before you can even start your work.
So let’s spend some time practicing these concepts in Excel within this Lab.
Lock Formulas in a Cell, so that another user does not change it. A reference video on how to
achieve this is given here: https://trumpexcel.com/lock-formulas-excel/
Text Functions
Check out ‘remove duplicates’ function from the data tab ribbon of excel.
‘text to column’ function can be found under the Data tab ribbon of excel.
Find a text related excel function that makes the first letters of the name capital. E.g. muhammad
ahmed -> Muhammad Ahmed.
Find a text function to make all the letters upper case.
Find a text related function to display only the last 4 letters of a word.
Find function to concatenate multiple words from different columns into one word.
Use random functions (see Excel help for different types) to generate random numbers.
Checkout Data Validation options in the Data Tab.
Order Date should only be a date for January 2024. Add an Error Message saying: “Please
enter dates for January 2024 only”
Customer Name should be text only. Add appropriate Error Message.
How to view only unique values.
2. Locking cells