0% found this document useful (0 votes)

94 views

CIS 5500 Database

Uploaded by

tanishk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

94 views

CIS 5500 Database

Uploaded by

tanishk

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CIS 5500 Project Milestone 2

Section I - Motivation
Our motivation behind choosing this as our project was that we saw there was a need for people
to be able to find Healthcare Providers in a streamlined fashion with specific characteristics:
proximity, specialty, cost, and ratings. Currently, if you try to google for healthcare providers near
you your page is cluttered with ads and sponsored websites so you don’t always get the best /
most appropriate healthcare provider for what you specifically need. That’s why we thought of
making a website where you could do that. Additionally, with the information required to perform
this search, we thought that it would be beneficial to also tell people what conditions they might
be predisposed to and educate them on what can be done. We saw this as a problem as people
often don’t know what conditions they are predisposed to and what steps they need to take to
avoid getting potentially life-altering conditions.

Section II - Features that WILL be implemented

● Health Risk Assessment

○ As the user loads the website, they get a pop-up survey that they have to fill out
detailing important personal info: weight, age, height, town/county, exercise
habits, nutritional habits, substance usage (alcohol, smoking, drugs, etc.),
vaccinations, (*potentially insurance plan via API Find a Provider)
○ Based on the user’s demographic and geographical information, the application
calculates potential health risks and common conditions in the area and
demographic.

● Healthcare provider matching

○ On a separate page, a search bar with sliders/dropdowns/checkboxes to select
options for healthcare providers based on criteria
○ Criteria: radius (from town), provider type, cost, quality, (*do they accept user’s
insurance via API Find a Provider)
○ Output: list of providers that meet requirements

Section III - Features that MIGHT be implemented

● Appointment helper
○ Given the user likes a certain healthcare provider, the user can click a button like
make an appointment and it will then utilize something like Google Assistant to
help set up an appointment for the user
● Auth0
○ Auth0 implementation to secure user data (entering personal details) we need to
decide if we want to completely delete a user’s data, or if we want to store it so
they can use it again in the future

Section IV - List of Pages

● Home page
○ This page will serve as a home base with buttons that will bring you to the
subpages and an overall overview of our application.
● Health Risk Assessment
○ Opening on this page (for first time with new account *see possible features with
storing data) a pop-up survey will be shown to the user and they can choose
what data they would like to submit (most of the inputs should be optional so
users can choose to opt out of some things if they choose)
○ After completing the survey the page will then display conditions the user is
predisposed to and accompanying websites about the condition and early
prevention steps
● Healthcare provider matching
○ A simple search page similar to HW3 songs page where user can select multiple
options (outlined above) and then run a query on healthcare providers based on
the given criteria
● Credits
○ A simple page with acknowledgments to anything and everything used
(technologies, TA help, etc.) as well as credits to us as the authors

Section V - Relational Schema ER Diagram

Section VI - DDL

User Table
CREATE TABLE Users (
UserID INT PRIMARY KEY,
DemographicInfo VARCHAR(255),
GeographicInformation VARCHAR(255)
);

Insurance Table
CREATE TABLE Insurance (
InsuranceID INT PRIMARY KEY,
GeographicArea VARCHAR(255),
PlanName VARCHAR(255),
PlanBenefits TEXT,
APILink VARCHAR(255)
);

Subscriber Relationship Table

CREATE TABLE Subscriber (
UserID INT,
InsuranceID INT,
FOREIGN KEY (UserID) REFERENCES Users(UserID),
FOREIGN KEY (InsuranceID) REFERENCES Insurance(InsuranceID)
);

CREATE TABLE HealthcareProvider (

ProviderID INT PRIMARY KEY,
Name VARCHAR(255),
MedicalLicenseNo VARCHAR(50),
GeographicInformation VARCHAR(255),
InstitutionAffiliation VARCHAR(255),
EducationalCredentials TEXT,
PracticingSpecialty VARCHAR(255)
);

CREATE TABLE HealthCondition (

ConditionID INT PRIMARY KEY,
MostAffectedDemographic VARCHAR(255),
MostAffectedGeography VARCHAR(255),
ProviderCareSpeciality VARCHAR(255),
PreventiveMeasures TEXT,
RiskFactors TEXT,
LinkToWHO VARCHAR(255)
);

CREATE TABLE Disease (

UserID INT,
ConditionID INT,
FOREIGN KEY (UserID) REFERENCES Users(UserID),
FOREIGN KEY (ConditionID) REFERENCES HealthCondition(ConditionID)
);

CREATE TABLE ConditionProvider (

ProviderID INT,
ConditionID INT,
FOREIGN KEY (ProviderID) REFERENCES HealthcareProvider(ProviderID),
FOREIGN KEY (ConditionID) REFERENCES HealthCondition(ConditionID)
);

Section VII - Cleaning Explanation

There are quite a lot of steps that we could use in order to pre-process and clean our data for
use.

1. Dealing with the missing values: For essential fields that cannot be imputed (e.g., NPI,
Provider Last Name, Provider First Name), we will consider removing rows with missing
values. For non-essential fields, we can also fill missing values with a placeholder (e.g.,
"Unknown" for categorical data, or the column's median for numerical data).
2. Deduplication: Identify and remove duplicate entries to avoid redundancy. This can be
particularly important for providers of healthcare insurance services listed multiple times
with slight variations in their address or other details since that could give incorrect
results.
3. Standardization: Standardize the formatting of key fields such as names, addresses, and
phone numbers to ensure consistency. This might include converting text to title case,
removing extraneous characters from phone numbers, and standardizing address
formats.
4. Data Type Conversions: We will need to ensure that each column is of the appropriate
data type. For example, ZIP codes should be treated as strings to preserve leading
zeros, and graduation years should be integers.
5. Normalization: We also plan to normalize the dataset to ensure that similar data points
are represented uniformly. This might involve unifying similar specialty names or
grouping them into broader categories to facilitate easier analysis and matching.
Especially when it comes to analyzing the dataset of insurance services per region and
being able to match, there needs to be efficient grouping on the basis of distance and
proximity.
6. Feature Engineering: We will create new features that could be useful for our application.
For example, we can extract or compute the provider's years of experience from the
graduation year, or create flags indicating if the provider offers telehealth services based
on the Telehealth field.
7. We also plan to specifically work on handling the specialties: The dataset contains
multiple columns for specialties (pri_spec, sec_spec_1, sec_spec_2, etc.). We can
aggregate these into a single column or a structured format (like a list) associated with
each provider to simplify querying and analysis.
8. For the Geographical Data like City/Town, State, and ZIP Code, ensure these are
correctly formatted and consider creating a combined location field if useful for
application's geolocation features which we definitely have to use and thus this will be
very important!!!!
9. Binary/Indicator Fields: For fields like Telehlth, ind_assgn, and grp_assgn, we will ensure
they are consistently coded (e.g., Y/N converted to True/False) to facilitate analysis and
filtering.
10. Lastly, to make sure our data is consistent and correct, we will also have a validity check
for all the geographic information and specialties.

Section VIII - Technologies that will most likely be used

● React.js
● Node.js
● MySQL
● AWS
● Javascript
● HTML
● CSS
● Python
○ pandas (pre-cleaning)
○ numpy (pre-cleaning)
● Github
○ GitHub pages (automatic deployment?)
● Auth0 (potentially)
● Hugo (website templater, potentially?)

Section VIIII - Responsibilities

1) Tanish Kelkar | [email protected] | Github: TanishKelkar
a) Great at UI design.
b) Will work on overall design, clean data and manage integrations
2) Max Mercado | [email protected] | GitHub: maxmerc
a) Best at SQL queries, front-end (HTML, CSS, React)
b) Will work on back-end and creating the provider matching algorithm
3) Ryan Kertzner | [email protected] | GitHub: rkertz
a) Great at front-end and React
b) Will work on developing the pages for HRA Assesement
4) Seher Taneja | [email protected] | GitHub : sehertaneja
a) Great at algorithms and overall optimization
b) Will work on creating the page for healthcare provider matching and survey.

FINAL REPORT Hospital Management System
No ratings yet
FINAL REPORT Hospital Management System
108 pages
Chapter 6 Vertical Integration: Strategic Management and Competitive Advantage, 5e (Barney)
No ratings yet
Chapter 6 Vertical Integration: Strategic Management and Competitive Advantage, 5e (Barney)
29 pages
Proposal Medical
No ratings yet
Proposal Medical
3 pages
Analytics and Big Data for Accountants
From Everand
Analytics and Big Data for Accountants
Jim Lindell
No ratings yet
Hackrx4.0 Problemstatements
No ratings yet
Hackrx4.0 Problemstatements
16 pages
App Idea, Labwork Graph
No ratings yet
App Idea, Labwork Graph
15 pages
Group 19
No ratings yet
Group 19
24 pages
Scope of GP
No ratings yet
Scope of GP
3 pages
Health tunnel
No ratings yet
Health tunnel
17 pages
Minor Project End Term Report
No ratings yet
Minor Project End Term Report
27 pages
Synopsis of Smart Health Prediction
50% (4)
Synopsis of Smart Health Prediction
22 pages
DocBook Final A6 Final
No ratings yet
DocBook Final A6 Final
43 pages
Health Care System Analysispdf
No ratings yet
Health Care System Analysispdf
19 pages
Titlu Lucrare
No ratings yet
Titlu Lucrare
50 pages
Synopsis
No ratings yet
Synopsis
11 pages
08_24_37_43_65 (3)
No ratings yet
08_24_37_43_65 (3)
20 pages
Software Requirement Specification For E - Healthcare
No ratings yet
Software Requirement Specification For E - Healthcare
10 pages
Health Intelligence System (HIS) - A New Trend in Healthcare
No ratings yet
Health Intelligence System (HIS) - A New Trend in Healthcare
4 pages
PPS Batch 1
No ratings yet
PPS Batch 1
25 pages
Final Reprt
No ratings yet
Final Reprt
67 pages
Health Care Management System
No ratings yet
Health Care Management System
4 pages
08_24_37_43_65 (2)
No ratings yet
08_24_37_43_65 (2)
19 pages
21BCA1757_MiniProject-Report
No ratings yet
21BCA1757_MiniProject-Report
50 pages
Analysis, Design, Development & Testing Methodology
No ratings yet
Analysis, Design, Development & Testing Methodology
60 pages
dorabot_io_1734941766_Engineering_University_Project
No ratings yet
dorabot_io_1734941766_Engineering_University_Project
4 pages
cpp mega (1)1212
No ratings yet
cpp mega (1)1212
28 pages
ijsart-Paper-Template - PDF 20240508 100152 0000
No ratings yet
ijsart-Paper-Template - PDF 20240508 100152 0000
2 pages
Hilabs Deck
No ratings yet
Hilabs Deck
10 pages
1653476686458_SHM SYSTEM GRP16
No ratings yet
1653476686458_SHM SYSTEM GRP16
4 pages
Sepm SRS
No ratings yet
Sepm SRS
2 pages
Document 1
No ratings yet
Document 1
24 pages
Problem Definition: Information System. The Clinic Information System Allows Patients To View The Details
No ratings yet
Problem Definition: Information System. The Clinic Information System Allows Patients To View The Details
8 pages
Project Template 2023-24
No ratings yet
Project Template 2023-24
7 pages
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
From Everand
DATA ANALYSIS AND DATA SCIENCE: Unlock Insights and Drive Innovation with Advanced Analytical Techniques (2024 Guide)
WINTON CLEM
No ratings yet
new_paper_for_26
No ratings yet
new_paper_for_26
10 pages
Hospital Management Slide Show
No ratings yet
Hospital Management Slide Show
28 pages
Smart Health
No ratings yet
Smart Health
40 pages
Dbms Final Report
No ratings yet
Dbms Final Report
58 pages
The
No ratings yet
The
3 pages
Gaurav Project
No ratings yet
Gaurav Project
53 pages
Final Year Project Title BSCS M2-19-22
No ratings yet
Final Year Project Title BSCS M2-19-22
9 pages
G4 F CDC Final Project Report
No ratings yet
G4 F CDC Final Project Report
226 pages
About Project
No ratings yet
About Project
2 pages
E-Health: Project Report
No ratings yet
E-Health: Project Report
27 pages
HDIMS webplan1
No ratings yet
HDIMS webplan1
5 pages
Smart Health Disease Prediction Django
No ratings yet
Smart Health Disease Prediction Django
41 pages
Final PPT 22MSIT 040 043
No ratings yet
Final PPT 22MSIT 040 043
20 pages
Name Mirza Talha Ali Tauqeer +hammad Naseem: Course Software Engineering
No ratings yet
Name Mirza Talha Ali Tauqeer +hammad Naseem: Course Software Engineering
56 pages
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
From Everand
Hadoop BIG DATA Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
RISHABH REPORT (Final)
No ratings yet
RISHABH REPORT (Final)
62 pages
Project Name: Online Healthcare System Project Course: Object Oriented Analysis and Design Course Teacher: MD. ANWARUL KABIR
No ratings yet
Project Name: Online Healthcare System Project Course: Object Oriented Analysis and Design Course Teacher: MD. ANWARUL KABIR
7 pages
CHR Srs
No ratings yet
CHR Srs
6 pages
SE Report Arkesh
No ratings yet
SE Report Arkesh
76 pages
CHR Proposal
No ratings yet
CHR Proposal
8 pages
CLINIC: A Web Healthcare Management System For Enhancing Clinical Services
No ratings yet
CLINIC: A Web Healthcare Management System For Enhancing Clinical Services
5 pages
Ronak Project 2.0
No ratings yet
Ronak Project 2.0
8 pages
Cse299 Speech
No ratings yet
Cse299 Speech
2 pages
Heal Well
No ratings yet
Heal Well
2 pages
Key steps for creating portal
No ratings yet
Key steps for creating portal
4 pages
Medical Report
No ratings yet
Medical Report
35 pages
WE Mini Report Final
No ratings yet
WE Mini Report Final
58 pages
Cylinder / Zoom, Ø100 / Ø50 S1900 Lmin 2374
No ratings yet
Cylinder / Zoom, Ø100 / Ø50 S1900 Lmin 2374
2 pages
(en) Programming Microsoft Visual C# 2005 - The Language (MS Press , 2006)
No ratings yet
(en) Programming Microsoft Visual C# 2005 - The Language (MS Press , 2006)
1,264 pages
Alvo Stockman - SuperDuperMagicTricks
100% (2)
Alvo Stockman - SuperDuperMagicTricks
20 pages
Naruto RPG-Ninjutsu Shinobi Core Class
100% (1)
Naruto RPG-Ninjutsu Shinobi Core Class
3 pages
IGRT Lymphoma Managment
No ratings yet
IGRT Lymphoma Managment
94 pages
Sponsorship Brochure 22.09.13
No ratings yet
Sponsorship Brochure 22.09.13
9 pages
Tall Building Structures Analysis
100% (3)
Tall Building Structures Analysis
562 pages
Comparison BTWN Climate of Ahmedabad and Nagpur
No ratings yet
Comparison BTWN Climate of Ahmedabad and Nagpur
1 page
Medicine/Medicine 6th Year 2016
100% (1)
Medicine/Medicine 6th Year 2016
31 pages
NFCC Talent Management Toolkit Interactive Document V2.Feb24 3
No ratings yet
NFCC Talent Management Toolkit Interactive Document V2.Feb24 3
12 pages
UFC v. Barrio Fiesta Manufacturing Corporation
No ratings yet
UFC v. Barrio Fiesta Manufacturing Corporation
6 pages
Riddhi Siddhi Annual Report 2023
No ratings yet
Riddhi Siddhi Annual Report 2023
166 pages
Selection Criteria Statement
No ratings yet
Selection Criteria Statement
3 pages
La Real English - 1414503719
No ratings yet
La Real English - 1414503719
9 pages
Module 4 & 5 - 21EC733- DSP Algorithms and Architecture
No ratings yet
Module 4 & 5 - 21EC733- DSP Algorithms and Architecture
70 pages
Unit VII. Pre-Stocking Management Practices
No ratings yet
Unit VII. Pre-Stocking Management Practices
16 pages
Balanced Scorecard For Projects: 2000 International Student Paper Award Winner
No ratings yet
Balanced Scorecard For Projects: 2000 International Student Paper Award Winner
16 pages
Environmental Management
No ratings yet
Environmental Management
3 pages
Respondents Profile Thesis
100% (2)
Respondents Profile Thesis
14 pages
7 Year Yojana - S&T
No ratings yet
7 Year Yojana - S&T
3 pages
Cabinetry Shop Drawings - Malvar
No ratings yet
Cabinetry Shop Drawings - Malvar
5 pages
Solutions: Louis Barson Kyoto University February 2, 2009
No ratings yet
Solutions: Louis Barson Kyoto University February 2, 2009
16 pages
Computer Network Lab Manual r22 CSD
No ratings yet
Computer Network Lab Manual r22 CSD
61 pages
SDGTR 002 Exercise Mozart Pedals-O1v1wm
No ratings yet
SDGTR 002 Exercise Mozart Pedals-O1v1wm
1 page
DOC-20250122-WA0054.
No ratings yet
DOC-20250122-WA0054.
24 pages
Physical Science SHS 8.1 Proteins
100% (1)
Physical Science SHS 8.1 Proteins
42 pages
Atmanirbhar Bharat Abhiyan: (Relief Package by Government of India)
No ratings yet
Atmanirbhar Bharat Abhiyan: (Relief Package by Government of India)
27 pages
True Space Marines Codex
60% (5)
True Space Marines Codex
10 pages
Comp Mocks 2011
No ratings yet
Comp Mocks 2011
72 pages

CIS 5500 Database

Uploaded by

CIS 5500 Database

Uploaded by

CIS 5500 Project Milestone 2

Section II - Features that WILL be implemented

● Health Risk Assessment

● Healthcare provider matching

Section III - Features that MIGHT be implemented

Section IV - List of Pages

Section V - Relational Schema ER Diagram

Subscriber Relationship Table

CREATE TABLE HealthcareProvider (

CREATE TABLE HealthCondition (

CREATE TABLE Disease (

CREATE TABLE ConditionProvider (

Section VII - Cleaning Explanation

Section VIII - Technologies that will most likely be used

Section VIIII - Responsibilities

You might also like