Case Studies
Case Studies
Now a days all enterprises, especially financial firms and banks, need to use
big data and Hadoop technologies to their fullest potential, particularly
with the overwhelming amount of data and transactions amassed on a daily
basis. In order to remain competitive and maintain current customers
while attracting new ones, banks have started planning to utilize big data
technologies. A particular bank maintains records in various text files.
Users can have more than one loan account in the bank.
As and when user pays the installment, data is inserted in
Loan_Info.txt
Loan_id, user_id, closure_installment_date, first_installment_date,
installation_amount remain the same for a given user’s loan account.
Frequency of installment is monthly for any loan account.
Write Hadoop map reduce code in java for following analysis. As a data
analyst you could also suggest any change in data gathering in all 3 files.
State yourassumptions/changes clearly if any. Add comments in your code
wherever required.
(1) Bank is interested in finding loyal customers who have a good loan
payment history. Loyal customers always pay the exact installation
amount and should not have missed any installation as on date. Find the
list of such loyal customers. (4M)
(2) Find out the list of users who have a healthy credit card but outstanding
loan amount. (4M)
(3) Whenever there is any change in share price, the details are inserted in
the Share_Info.txt file. For every share and every date, find the maximum
profit one could have made on the share on that date. Bare in mind that if
the share prices fall throughout the day max possible profit can be
negative. Opening price is the buying price of the share. Whenever there is
any change in share price, profit will change. You are expected to find max
profit on that date. (6M)