0% found this document useful (0 votes)
21 views2 pages

Case Studies

The case study discusses the necessity for financial firms and banks to leverage big data and Hadoop technologies to manage the vast amounts of data generated daily. It outlines the structure of three text files containing loan, credit card, and share information, and presents analysis tasks to identify loyal customers, users with healthy credit cards but outstanding loans, and to calculate maximum profit from share price changes. The document also requests Hadoop MapReduce code in Java for these analyses and suggests improvements in data gathering for the files.

Uploaded by

Farin Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views2 pages

Case Studies

The case study discusses the necessity for financial firms and banks to leverage big data and Hadoop technologies to manage the vast amounts of data generated daily. It outlines the structure of three text files containing loan, credit card, and share information, and presents analysis tasks to identify loyal customers, users with healthy credit cards but outstanding loans, and to calculate maximum profit from share price changes. The document also requests Hadoop MapReduce code in Java for these analyses and suggests improvements in data gathering for the files.

Uploaded by

Farin Khan
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

CASE STUDY

Now a days all enterprises, especially financial firms and banks, need to use
big data and Hadoop technologies to their fullest potential, particularly
with the overwhelming amount of data and transactions amassed on a daily
basis. In order to remain competitive and maintain current customers
while attracting new ones, banks have started planning to utilize big data
technologies. A particular bank maintains records in various text files.

These file details are as follows.


Loan_Info.txt file has Loan_id, user_id, closure_installment_date,
first_installment_date, installation_amount,
last_installment_received_date,
actual_ installation_amount_received.

Users can have more than one loan account in the bank.
As and when user pays the installment, data is inserted in
Loan_Info.txt
Loan_id, user_id, closure_installment_date, first_installment_date,
installation_amount remain the same for a given user’s loan account.
Frequency of installment is monthly for any loan account.

CreditCard_Info.txt has creditCard_no, user_id, max_credit,


outstanding_balance, due_date.
Share_Info.txt has share_id, company_name, Timestamp_Of_sharePrice,
sharePrice_value.

Write Hadoop map reduce code in java for following analysis. As a data
analyst you could also suggest any change in data gathering in all 3 files.
State yourassumptions/changes clearly if any. Add comments in your code
wherever required.
(1) Bank is interested in finding loyal customers who have a good loan
payment history. Loyal customers always pay the exact installation
amount and should not have missed any installation as on date. Find the
list of such loyal customers. (4M)

(2) Find out the list of users who have a healthy credit card but outstanding
loan amount. (4M)

(3) Whenever there is any change in share price, the details are inserted in
the Share_Info.txt file. For every share and every date, find the maximum
profit one could have made on the share on that date. Bare in mind that if
the share prices fall throughout the day max possible profit can be
negative. Opening price is the buying price of the share. Whenever there is
any change in share price, profit will change. You are expected to find max
profit on that date. (6M)

You might also like