Data Preprocessing Automation is a Python-based GUI application designed to simplify and automate data preprocessing tasks. It allows users to upload Excel files, automatically handle missing values, remove duplicates, and detect and remove outliers using statistical methods. The application provides data visualization tools, including box plots for distribution analysis and scatter plots for exploring relationships between variables. Users can download the processed data for further analysis. Built with Tkinter, Pandas, Matplotlib, and Seaborn, it ensures an intuitive interface and efficient performance. Additionally, it features a custom logo, a clean UI with a green-blue theme, and options for licensing and public release. This tool is ideal for data analysts, researchers, and professionals looking to automate preprocessing without coding. 🚀
Features
- Upload Excel Files
- Automated Data Preprocessing: Removes duplicates, fills missing values, and cleans data.
- Outlier Removal: Identifies and removes outliers using the IQR method.
- Data Visualization: Boxplot: Displays data distribution and detects anomalies. Scatter Plot: Shows relationships between numerical variables.
- Processed Data Download: Save the cleaned data in Excel format.
License
MIT LicenseFollow Data Preprocessing Automate
User Reviews
-
very good