Python Developer Assignment
Python Developer Assignment
Datasets
Consider the following datasets
Transactions: https://websdk-assets.s3.ap-south-
1.amazonaws.com/public/txns+(13).csv
This dataset represents “loans” booked. “transactionId” is a uniqueId for a
loan. “day” is the date on which the loan was confirmed.
“downPaymentAmount” is the amount given by the customer upfront for the
loan (in paise). “emiPrincipal” is the principal component of emi that the user
is expected to pay per month (in paise). “emiInterest” is the interest
component of the emi that the user is expected to pay per month (in paise).
“tenure” is the loan tenure in months (number of emis).
Repayments:https://websdk-assets.s3.ap-south-
1.amazonaws.com/public/repayments+(2).csv
This dataset represents the payments made by customers towards their
loans (inlcuding down payments and emi payments). “transactionId”
represents the loan towards which payment was made. "repaymentDay”
represents date on which the payment was made. “amount” is the amount
paid towards the loan.
Queries
1. For every loan what is the current outstanding (unpaid) principal amount (in
rupees). Expected output columns “transactionId”, “outstandingAmount’.
Submission instructions
The submission should be a python notebook file (ipynb) along with the
requirements as requirements.txt or as a Pipfile. Running all cells should:
a. The variables holding the URLs should be declared in the first cell so
that it is convenient to replace them with other datasets that we want to
test your code with. The code should automatically fetch through
internet / local filesystem basis the URL protocol used (https/http/file).
2. render pandas dataframes for all the queries above (no need to write to
CSVs etc.).