Ph.D. Research Proposal Doctoral Program in Computer Science
Ph.D. Research Proposal Doctoral Program in Computer Science
Research proposal
Doctoral program in Computer Science
1
RESEARCH BACKGROUND
My goal in Postgraduate (Computer Science) studies is to prepare myself for career in Computer
Science research and teaching.
Later, I joined Indus University, where I chose Computer Science as Postgraduate major core
courses and curriculum have helped me to get sound grip of fundamentals of almost all aspects of
Computer Science, which is reflected by my position as among top ten students in class, I also
discovered intellectual satisfaction of solving difficult problems on my own. I spend some time
working with research scholar in Department of Computer Science that is led to research proposal,
because of this and my success in undergraduate courses.
My two-year Postgraduate in (Computer Science), that I am trying to learned core courses as much
of theory as possible in my Postgraduate program which was Cloud Computing, Data Science and
Research Methodology, as electives in the Two Year and got desired results in algorithms, under
guidance of Assoc. Prof. Dr. Zeeshan Ahmed in theory, where in I have analyzed the data of data
sets on “Home Energy Consumption in data science”. On behalf of theory I own the title of
publishing two research papers one in Data Science ( Analysis of Home Energy Consumption by
K-Mean) and (Using Naïve Bayes Algorithms to Students’ bachelor Academic Performances
Analysis). Currently, my research interests are in Human Collaboration within the field of Big
Data and Data Mining by using Data Processing and Data Modeling.
I look forward to joining your department as PhD scholar. I am confident enough that postgraduate
studies in Indus University will prove to be rewarding task and able to fulfill my career objectives.
2
INTRODUCTION
3
The procedures use in analytical apparatuses essential discovers patterns, trends, and associations
over diversity of period horizons in statistics (Oswal et al., 2013). When examining, these
apparatuses visualize findings in tables, graphs, and spatial diagrams for effectual decision making.
Consequently, big data analysis was serious experiment for various applications because of data
complexity and scalability of fundamental procedures that support such methods (Candela,
2012).Finally, conduct extensive simulation using Hadoop based analysis by considering simulated
scenario of social and technical network. From results and discussion, that proposed system
architecture is feasible for analyzing human behavior using big data (Ahmad, 2017).Data
producers, humans constantly generate huge amount of text data and behavior data. Joint modeling
of such diverse types of data ensures in-depth understanding of humans. On the other hand, as
knowledge consumers, different users interact with various systems to fulfill their idiosyncratic
intents. Traditional static, ad-hoc and passive machine human interactions are inadequate to
optimize such dynamic decision making process; and thus online adaptively learning driven by the
feedback from humans is desired.
Social structure of human behavior is also coming from relationship; objects (owned by a user,
such mobile, laptop, etc.) establish their relationship either by working at the same place. It is
similar to human’s way of life, e.g., human’s share personal experiences in public. These
relationships are established as a location-based application or situation-based application (Atzori,
2011).Figure 2Shows Users are using these technologies for a longer period of time that devices
are properly authorized by users and they are used to exchange data, e.g., contacts, messages,
online purchasing, etc. the behavior can be derived from the interaction of these devices. These
regularities in terms of variations and frequencies for human behavior are daily activities, such
reading/sending emails, messages, Facebook likes or sharing videos or other data, sports, online
shopping, and other work patterns are usually considered to be random.
4
Figure 2. Human activities
OBJECTIVE
LITERATURE REVIEW
Marjani et al. (2016) studies that big amounts of data were produced since past period as
contraction of Internet of things (IoT) expedients increases, however; data was not beneficial
without analytic influence. Various big data, IoT, and analytics results were facilitated persons to
acquire appreciated intuition into bulky data generated through IoT expedients. Conversely,
solutions were still in early period, and domain deficiencies inclusive assessment. It considered up-
to-date study efforts fixed to large IoT data analytics that was proposed new architecture. Finally,
5
exposed research experiments, for example confidentiality, big data mining, visualization, and
integration, were offered.
Landge and Gumaste (2015) investigated that Actual activity recognition with body sensor
networks was significant and challenging assignment. Towards capture human activity and
examine that data, both activity was very important in anomalous activity detection. It presents
automated method to activity tracking that recognizes mutual activity that happens in routine of
individual. It was binary classifier which was used to classified two data’s classes which could
track incidence of systematic actions to supervise useful health and to detect modifications in
routine of human life. It was define data mining’s activity and pursuing it in image processing
method’s arena and validate procedures on data composed from isolated sensors.
Driscoll (2010) describes Big Data as dataset that was greater than sure threshold, for instance
terabyte although others express Big Data that shows structures of big volume, velocity, and
variety. Johnson (2012) also studies very large data sets associated to customer manners, public
web posts, geo tagging, and sensor productivities by Big Data. Researchers well-defined Big Data
through volume, velocity, variety, veracity and value. Chow and Green (2013) represent social
movement in societies to data-driven decision making. Boyd and Crawford (2012) studies big data
was traditional, scientific, and scholarly realism that offered capability to collection of big quantity
of data investigate patterns, recognized financial, public, practical, and high level of extracted.
IBM (2011) torrent of attention in big data conception was quiet of infancy and an application was
extensive and diverse.
Goes (2014) suggested that was no particular big data definition, corporation managers was tried to
take steps around innovative wave, because data was rapidly fetching fresh capital’s form , diverse
currency, and new value’s source.
Maimon and Rokach (2010) studies that data mining exertion was dispersed to numerous collude
negotiator human. Impartial of data mining exertion was to outturn elucidations to attempted
problem which was treated well by certain metric, with esteem to those results that would attained
by distinct, non-collaborate negotiator. Results require assessment, contrast, and methods for
permutation as association needs conversation, and involve societies system. Establishing societies
in efficient way was non- resistant and usually needs unambiguous roles and procedures.
6
Sharma and Wang(2017) suggested that big data analytics was received significant interest in
diversity of field comprised commerce, economics, health care and IoT (Internet of Things) was
measured like significant platform in carrying public, method, statistics and things collectively in
organize to improve excellence of daily life. Still, challenge were how to efficiently extract
functional characteristic from huge amount of heterogeneous data created by source constrained
IoT devices information to offer immediate and reaction to users, and how to exploit data
conscious intellect in attractive performance of IoT networks. While cloud computing for attends
to problem in data analytics, that was individual benefits and boundaries. Researchers offered new
frame for synchronized processing cloud by incorporate reward from platform. It could utilize
knowledge and historical information accessible at cloud middle to lead edge computing
component towards pleasing diverse act necessities of heterogeneous IoT networks.
Charissis and Papanastasiou (2008) investigated innovative plan for self-propelled full-windshield
head-up display interfaces that goal to enhance driver’s dimensional sentience and retort eras with
lower discernibility settings. To accomplish these necessities, contrive and enforced occupied
archetype of human–machine interface. Certain accent was retained on prioritizing and efficacious
appearance of information accessible done with vehicular sensors, this would assistance, devoid of
disrupting, driver in efficaciously traversing vehicle in small visibility situations. Future interface is
built on simple optical appearance of actual substances to render a novel style of cooperative
regulation for motorway environments. Generally, work deliberates strategy contest of human–
machine system, enlarges on interface strategy idea and offerings result of manipulator efforts
which compared efficiency of planned HUD in contradiction of head-down display.
Extraction: Statistics composed was not layout prepared for analysis that was deliberate gathering
of automatic health records in clinic consist of copy out notations from a number of doctors,
structured data from devices and capacities, and image information. It could not sabbatical
information in form and quiet well analyzed which necessitate acts mining method that attractions
available statistics from causal sources and states in structured.
Integration: Torrent of statistics that was not sufficient merely to highest and toss it into
repository. Statistics from range of scientific experiments, repository with data sets, that was
improbable someone would forever be capable to discovery.
Modeling: Approaches aimed at query and mining huge statistics was vital diverse from traditional
analysis on minor samples. However, noisy BD was valued than small samples because overall
8
statistics obtain from repeated patterns and correlation frequently suppress distinct fluxes and
reveal more consistent concealed patterns and knowledge. Additional, interrelated BD forms big
heterogeneous statistics grids, with which data redundancy could be discovered to reward for
missing statistics, to authenticate reliable associations, to reveal intrinsic clusters, and to expose
concealed associations.
Interpretation: ability to analyze data of limited value if users could not comprehend study.
Decision maker, delivered with analysis’s outcome, has toward interpreted outcomes which was
involved investigative all assumptions made and reviewing study. Furthermore, it was various
potential bases of mistake; computer schemes could have bugs, models constantly have
assumptions, and outcomes could be founded on wrong data. Through reviewing how finest to
capture, stock, and query provenance, in aggregation with methods to arrest sufficient meta data,
that could generated substructure to delivered consumers with capability both to interpret analytical
outcomes achieved and to replication analysis with diverse assumptions and data sets.
The education backbone stand greatly of study effort by specific helped discussion group from
(university name) university in addition spouse talent affiliates related to distance learning
methods, feature learning, machine learning, data mining, pattern recognizing, hip addition, an
occupied stay towards two-way societies existing analysis of data assimilation methods then data
analysis modeling, data analysis processing its alteration to inferior leeway areas. The general
approach determination constructed the proceeding plans delineated and according to PhD.
Advisor. management, preprocessing and distribution will be progressive focused on approaches
specified through appropriate architecture.
o Coursework and current the architecture solutions with the help literature review.
o Appraisal allusion software consumer attendants connected through kit and booklets,
10
documents.
o Literature review to use best approach for data modeling and clustering.
o Distance learning and features learning techniques and related systems literature
o Evaluation of features and extracting features from videos with the help of existing different
applications.
o Improvement of system
Formulate
Hypothesis/Conduct
Experimental
Work
Journal Findings,
Papers Publishing
Compilation of
Thesis
REFERENCES
Ahmad.A, 2017, Socio-Cyber Network: The Potential of Cyber-Physical System to Define Human
Behaviors using Big Data Analytics. Future Generation Computer Systems.
Steve Lohr, 2012, The Age of Big Data. New York Times,
http://www.nytimes.com/2012/02/12/sunday-review/big-datas-impact-in-the-world.html
Kambatla, K., et al., 2014, Trends in big data analytics. Journal of Parallel and
DistributedComputing.74 (7):p.2561-2573.
Landge.A, Gumaste. S.V, 2015, Anomalous Human Activity Detection Using SVM and Active
Leaning Based Approach, IJIRCCE, 3(9)
Beyer, M., SolvingG. S, 2011, Big Data Challenge Involves More Than Just Managing Volumes of
Data.
Candela, L., D. Castelli, and P. Pagano, 2012, Managing big data through hybrid data
infrastructures. ERCIM News, 89: p. 37-38.
Atzori, Luigi, Iera. A, and Morabito. G, 2011,"Siot: Giving a social structure to the internet of
things." IEEE communications letters 15, no. 11: 1193-1195.
Oswal, S. and S. Koul. 2013, Big Data Analytic and Visualization On Mobile Devices.
inProceedings of National Conference on New Horizons in IT-NCNHIT.
12
Peter A. Chow. W, Sandy E. Green.J.R, 2013,” Data Mining Difference in the Age of Big Data:
Communication and the Social Shaping of Genome Technologies from 1998 to 2007.”
International Journal of Communication 7, 556–583
David L. Banks and Yasmin H. Said, 2006,” Data Mining in Electronic Commerce.” Statistical
Science, Vol. 21, No. 2, A Special Issue on Statistical Challenges and Opportunities in Electronic
Commerce Research, pp. 234-246.
Kurt. F and Rita.G, 2013, “Big Data: Implications for Financial Managers.” The Journal of
Corporate Accounting &Financ.
BarkerJ. F, Peter A. Chow. W, Charters. A and Dung. H, 2016, “Genomic Big Data and Privacy:
Challenges and Opportunities for Precision Medicine.” Computer Supported Cooperative Work
(CSCW) , Volume 25, Issue 2–3, pp 115–136
Tawny.S and Brian.D. J, 2012, Entertainment in the Age of Big Data.Proceedings of IEEE Vol.
100.
Danah.B and Kate.C, 2012, CRITICAL QUESTIONS FOR BIG DATA, Information,
Communication & Society, 15:5, 662-679.
Marjani. M, Nasaruddin. F, Gani. A, Karim.A, Hashem. I.A. T, Siddiqa. A, Yaqoob. I, 2016, Big
IoT Data Analytics: Architecture, Opportunities, and Open Research Challenges, IEEE
Sharma. S. K, Wang. X, 2017, Live Data Analytics with Collaborative Edge and Cloud Processing
in Wireless IoT Networks, IEEE
13