Code With Dates HARDCODED

This document contains code to process missing data in Excel sheets. It imports necessary libraries, reads in an XML file to get sheet names and missing data rules. It then reads each sheet, cleans the data, generates a datetime index, and iterates through columns applying missing data rules to fill in missing values. The cleaned sheets are then written to a new Excel file.

Uploaded by

Bhanu Suravarapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views2 pages

Code With Dates HARDCODED

Uploaded by

Bhanu Suravarapu

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

import pandas as pd

import random
import math
import numpy as np

import xml.etree.ElementTree as ET
result_list = []
tree = ET.parse('DM SHORT.xml')
root = tree.getroot()
missing_data = root.find('MissingData')
for sheet in root.findall('Sheet'):
sheet_name = sheet.find('Name').text
print('Sheet:', sheet_name)
# Read the excel file
df = pd.read_excel('/content/dummy.xlsx', sheet_name=sheet_name, header=2)

df.replace(["NW","Nw","-"], np.nan, inplace=True)

df['DATE'].fillna(method='ffill', inplace=True)
dateTime = pd.date_range(start='2021.12.31', end='01.08.2022', freq='1T')
result = pd.DataFrame({'DateTime': dateTime})

columns = sheet.find('Columns')
include_columns = columns.find('IncludeColumns')
for column in include_columns.findall('Column'):
col_name = column.find('Name').text

print(col_name)
missing_data = column.find('MissingData')
if missing_data is not None:
limit = missing_data.find('Limit')
if limit is not None:
print("Missing Data Limit for Column:", limit.text)
last_known_value = missing_data.find('LastKnownValue')
if last_known_value is not None:
print("Last Known Value for Column:", last_known_value.text)
result[col_name] = None

for i in result.index:
try:
target_date = result.at[i, 'DateTime'].strftime('%d.%m.%Y')
target_hour = result.at[i, 'DateTime'].strftime('%H:00')

filtered_df = df[(df.DATE == target_date) & (df.Time ==

target_hour)]
if not filtered_df.empty:
hour_val = filtered_df.iloc[0][col_name]
if not (isinstance(hour_val, float) and math.isnan(hour_val)):
result.at[i, col_name] = hour_val
except KeyError:
pass

if result.at[i, col_name] is None:

if limit is None:
# Get the previous non-null value for this column
prev_vals = result[col_name].loc[:i-1]
[result[col_name].notnull()]
if not prev_vals.empty:
prev_val = prev_vals.iloc[-1]
try:
prev_val = float(prev_val)
except ValueError:
prev_val = np.nan
if not math.isnan(prev_val):
result.at[i, col_name] = prev_val
else:
limit_value = float(limit.text)
random_val = random.uniform(0, limit_value)
result.at[i, col_name] = random_val

result.set_index('DateTime', inplace=True)
result.index = result.index.strftime('%d.%m.%Y %H:%M:%S')
result_list.append((sheet_name, result))

print(result_list)
# Write the results to an Excel file
with pd.ExcelWriter('RES1.xlsx') as writer:
for sheet_name, result in result_list:
result.to_excel(writer, sheet_name=sheet_name)

The Manual For The Quality Management of Educational Programmes in Myanmar
100% (1)
The Manual For The Quality Management of Educational Programmes in Myanmar
160 pages
Turning Great Strategy Into Great Performance
100% (3)
Turning Great Strategy Into Great Performance
22 pages
Dev Lab Record
No ratings yet
Dev Lab Record
21 pages
T.E. (Computer Science I & II)
100% (6)
T.E. (Computer Science I & II)
20 pages
AI Practical 2025
No ratings yet
AI Practical 2025
14 pages
Cambridge Advanced Practice Tests 2015
0% (1)
Cambridge Advanced Practice Tests 2015
17 pages
Natural Dyeing of Cotton Fabric
100% (2)
Natural Dyeing of Cotton Fabric
35 pages
DAV Previous Year
No ratings yet
DAV Previous Year
7 pages
Apr 2023
No ratings yet
Apr 2023
32 pages
Dissertation Plan Par Opposition
100% (2)
Dissertation Plan Par Opposition
4 pages
Unit 4
No ratings yet
Unit 4
25 pages
Data Cleaning
No ratings yet
Data Cleaning
22 pages
Part A Assignment 6
No ratings yet
Part A Assignment 6
28 pages
ML Ex2
No ratings yet
ML Ex2
7 pages
Niact 2
No ratings yet
Niact 2
25 pages
Solution
No ratings yet
Solution
8 pages
QP DAV 3rd Sem Dec 2023
No ratings yet
QP DAV 3rd Sem Dec 2023
12 pages
Group 10A - GA2
No ratings yet
Group 10A - GA2
10 pages
Cleaning Data in Python
No ratings yet
Cleaning Data in Python
8 pages
Unit3 - Cleaning - Preparing - Data - Jupyter Notebook
No ratings yet
Unit3 - Cleaning - Preparing - Data - Jupyter Notebook
10 pages
Geo Python Doc (1) 7,8 Bavesh
No ratings yet
Geo Python Doc (1) 7,8 Bavesh
9 pages
Fda E0323040 20 12 24
No ratings yet
Fda E0323040 20 12 24
4 pages
AI ML
No ratings yet
AI ML
8 pages
LAB FILE-Shelly Sharma
No ratings yet
LAB FILE-Shelly Sharma
47 pages
Bank Loan Case Study
No ratings yet
Bank Loan Case Study
71 pages
Victoria Code of Practice For Using Concrete Pump
0% (1)
Victoria Code of Practice For Using Concrete Pump
56 pages
Wa0012.
No ratings yet
Wa0012.
30 pages
Project Prog
No ratings yet
Project Prog
6 pages
Data Wrangling 2
No ratings yet
Data Wrangling 2
4 pages
Report
No ratings yet
Report
25 pages
Record PDF
No ratings yet
Record PDF
5 pages
Python Assignment-2
No ratings yet
Python Assignment-2
3 pages
Lab 1
No ratings yet
Lab 1
12 pages
LDB MP2020 FRMWRK
No ratings yet
LDB MP2020 FRMWRK
77 pages
What Is Defensive Driving?
No ratings yet
What Is Defensive Driving?
3 pages
Module 9: Social and Resources Mobilization: An Approach in The Implementation of Civic Welfare and Training Services
No ratings yet
Module 9: Social and Resources Mobilization: An Approach in The Implementation of Civic Welfare and Training Services
25 pages
Week 2
No ratings yet
Week 2
2 pages
AP Chemistry Solubility Rules Equations Sheet
100% (1)
AP Chemistry Solubility Rules Equations Sheet
8 pages
To 15a8-4-10-3 Navair 03-30ak-103
No ratings yet
To 15a8-4-10-3 Navair 03-30ak-103
42 pages
Absenteeism Module
No ratings yet
Absenteeism Module
2 pages
Pandas Revision1
No ratings yet
Pandas Revision1
2 pages
Numpy Boolean Indexing: Filter
No ratings yet
Numpy Boolean Indexing: Filter
39 pages
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
No ratings yet
Cs Sem III Dav Upc 2343012002 Sl. No. Qp. 1673 Dec '23
12 pages
Code
No ratings yet
Code
2 pages
cdp201 10 11 2023
No ratings yet
cdp201 10 11 2023
17 pages
Mycbseguide: Class 12 - Accountancy Sample Paper 07
No ratings yet
Mycbseguide: Class 12 - Accountancy Sample Paper 07
15 pages
RSPile Tutorials - 1 - Axially Loaded Piles
No ratings yet
RSPile Tutorials - 1 - Axially Loaded Piles
14 pages
B. Sc. H Computer S FkQNyBB
No ratings yet
B. Sc. H Computer S FkQNyBB
6 pages
SOLUTION, SUSPENSION and COLLOID Activity Sheet
67% (3)
SOLUTION, SUSPENSION and COLLOID Activity Sheet
1 page
Co Digit Ooo
No ratings yet
Co Digit Ooo
15 pages
Cefasabal Underland - 2011 - CAM Reviews Serenoa Repens For Benign Prostatic Hyperplasia-2
No ratings yet
Cefasabal Underland - 2011 - CAM Reviews Serenoa Repens For Benign Prostatic Hyperplasia-2
2 pages
Detailed Lesson Plan
No ratings yet
Detailed Lesson Plan
6 pages
Unit3 - 3) Pandas - Ipynb - Colab
No ratings yet
Unit3 - 3) Pandas - Ipynb - Colab
11 pages
Start Date
No ratings yet
Start Date
2 pages
Sowmi DS
No ratings yet
Sowmi DS
27 pages
Individual Performance Commitment and Review Form: Tabuk City Division Tuga National High School Tuga, Tabuk City
No ratings yet
Individual Performance Commitment and Review Form: Tabuk City Division Tuga National High School Tuga, Tabuk City
10 pages
Cardiologie MANUAL
50% (12)
Cardiologie MANUAL
15 pages
Practical Questions
No ratings yet
Practical Questions
7 pages
Data Preprocessing 1
No ratings yet
Data Preprocessing 1
6 pages
Oracle Database 12c: OR1, 5 Tage
No ratings yet
Oracle Database 12c: OR1, 5 Tage
1 page
Project Intern - Jupyter Notebook
No ratings yet
Project Intern - Jupyter Notebook
16 pages
ML Lab Manual Final
No ratings yet
ML Lab Manual Final
36 pages
English - Question - Paper (HW-1)
No ratings yet
English - Question - Paper (HW-1)
1 page
Exp-12 Iaiml
No ratings yet
Exp-12 Iaiml
13 pages
10) Merging Dataframes: # Detecting Duplicates
No ratings yet
10) Merging Dataframes: # Detecting Duplicates
7 pages
DA Lab
No ratings yet
DA Lab
27 pages
Pandas Introduction: What Is Python Pandas Used For?
No ratings yet
Pandas Introduction: What Is Python Pandas Used For?
28 pages
Pro Proctor User Guide
No ratings yet
Pro Proctor User Guide
24 pages
Lesson 3 - Week 1
No ratings yet
Lesson 3 - Week 1
28 pages
Data Cleaning in Python
No ratings yet
Data Cleaning in Python
6 pages
Dav 2024 Pyq
No ratings yet
Dav 2024 Pyq
7 pages
Exp 3
No ratings yet
Exp 3
10 pages
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
No ratings yet
What Can You Do With Dataframes Using Pandas?: Pandas Is A High-Level Data Manipulation Tool Developed by Wes Mckinney
10 pages
Avinash DA 6
No ratings yet
Avinash DA 6
3 pages
The Star Weaver
No ratings yet
The Star Weaver
2 pages
Code Explanation For Date Types
No ratings yet
Code Explanation For Date Types
8 pages
ASI Show Orlando 2025 Exhibitor List
No ratings yet
ASI Show Orlando 2025 Exhibitor List
16 pages
Pandas Syntax Revision For ML
No ratings yet
Pandas Syntax Revision For ML
10 pages
DataAnalytics Lab Manual
No ratings yet
DataAnalytics Lab Manual
35 pages
Group A Assignment No2 Writeup
No ratings yet
Group A Assignment No2 Writeup
9 pages
Important Pandas Operations 1697910759
No ratings yet
Important Pandas Operations 1697910759
6 pages
Pandas Data Manipulation Extended CheatSheet 1731972219
No ratings yet
Pandas Data Manipulation Extended CheatSheet 1731972219
9 pages
Paidout Policies
No ratings yet
Paidout Policies
2 pages
Assignment 2 Ds
No ratings yet
Assignment 2 Ds
8 pages
ML LAB Manual-1
No ratings yet
ML LAB Manual-1
33 pages
Step-by-Step Explanation of Python Data Preprocessing Script
No ratings yet
Step-by-Step Explanation of Python Data Preprocessing Script
9 pages
Practice Questions2
No ratings yet
Practice Questions2
2 pages
Information Technology British English Teacher B2 C1
No ratings yet
Information Technology British English Teacher B2 C1
13 pages
The Trade - Offs of ChatGPT To Filipino Freelance Content Writers A Diffusion of Innovation Theory Perspective
No ratings yet
The Trade - Offs of ChatGPT To Filipino Freelance Content Writers A Diffusion of Innovation Theory Perspective
7 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet

Code With Dates HARDCODED

Uploaded by

Code With Dates HARDCODED

Uploaded by

import pandas as pd

df.replace(["NW","Nw","-"], np.nan, inplace=True)

filtered_df = df[(df.DATE == target_date) & (df.Time ==

if result.at[i, col_name] is None:

You might also like