Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
By Haiyan Wang, Feng Wang and Kuai Xu
()
About this ebook
Haiyan Wang
He completed his doctorate in mathematics, while also earning a master's degree in computer science at Michigan State University in 1997. He worked as a full-time software engineer in industry for almost ten years before joining Arizona State University. Dr. Wang’s research interests include applied mathematics, data science, differential equations, online social networks. He has published numerous articles in scholarly journals and a book entitled, “Modeling Information Diffusion in Online Social Networks with Partial Differential Equations, Springer, 2020. Recently he developed and taught a course, Mathematical Methods in Data Science, at Arizona State University.
Related to Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Titles in the series (1)
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations Rating: 0 out of 5 stars0 ratings
Related ebooks
Linear and Generalized Linear Mixed Models and Their Applications Rating: 0 out of 5 stars0 ratingsAudit Studies: Behind the Scenes with Theory, Method, and Nuance Rating: 0 out of 5 stars0 ratingsSocial and Economic Networks Rating: 4 out of 5 stars4/5Data and the American Dream: Contemporary Social Controversies and the American Community Survey Rating: 1 out of 5 stars1/5Domain-Specific Knowledge Graph Construction Rating: 0 out of 5 stars0 ratingsEcological Models and Data in R Rating: 5 out of 5 stars5/5Information Geometry and Its Applications Rating: 0 out of 5 stars0 ratingsMathematical Decision Making: Predictive Models (Transcript) Rating: 5 out of 5 stars5/5How We Understand Mathematics: Conceptual Integration in the Language of Mathematical Description Rating: 0 out of 5 stars0 ratingsData Mining for the Social Sciences: An Introduction Rating: 0 out of 5 stars0 ratingsAdoption of Data Analytics in Higher Education Learning and Teaching Rating: 0 out of 5 stars0 ratingsConnectivity Prediction in Mobile Ad Hoc Networks for Real-Time Control Rating: 5 out of 5 stars5/5Understanding Leadership in Complex Systems: A Praxeological Perspective Rating: 0 out of 5 stars0 ratingsNumerical Methods for Stochastic Computations: A Spectral Method Approach Rating: 5 out of 5 stars5/5Robust Estimation and Testing Rating: 3 out of 5 stars3/5Statistical Analysis of Network Data with R Rating: 2 out of 5 stars2/5Cities as Spatial and Social Networks Rating: 0 out of 5 stars0 ratingsBig Data: How Data Analytics Is Transforming the World (Transcript) Rating: 0 out of 5 stars0 ratingsThe NSTA Quick-Reference Guide to the Three Dimensions Rating: 0 out of 5 stars0 ratingsBiometric Recognition: 13th Chinese Conference, CCBR 2018, Urumqi, China, August 11-12, 2018, Proceedings Rating: 0 out of 5 stars0 ratingsIntroduction to Statistics in Metrology Rating: 0 out of 5 stars0 ratingsDeep Learning: Convergence to Big Data Analytics Rating: 0 out of 5 stars0 ratingsCreating Engineering Design Challenges: Success Stories From Teachers Rating: 0 out of 5 stars0 ratingsSpatial Analysis Along Networks: Statistical and Computational Methods Rating: 0 out of 5 stars0 ratingsScience and Engineering Projects Using the Arduino and Raspberry Pi: Explore STEM Concepts with Microcomputers Rating: 0 out of 5 stars0 ratingsEmergent Nested Systems: A Theory of Understanding and Influencing Complex Systems as well as Case Studies in Urban Systems Rating: 0 out of 5 stars0 ratingsModern Mathematical Statistics with Applications Rating: 0 out of 5 stars0 ratingsMathematics for the Life Sciences Rating: 0 out of 5 stars0 ratingsConflict and Complexity: Countering Terrorism, Insurgency, Ethnic and Regional Violence Rating: 0 out of 5 stars0 ratings
Mathematics For You
My Best Mathematical and Logic Puzzles Rating: 4 out of 5 stars4/5Standard Deviations: Flawed Assumptions, Tortured Data, and Other Ways to Lie with Statistics Rating: 4 out of 5 stars4/5Calculus Made Easy Rating: 4 out of 5 stars4/5Real Estate by the Numbers: A Complete Reference Guide to Deal Analysis Rating: 0 out of 5 stars0 ratingsQuantum Physics for Beginners Rating: 4 out of 5 stars4/5What If?: Serious Scientific Answers to Absurd Hypothetical Questions Rating: 5 out of 5 stars5/5Sneaky Math: A Graphic Primer with Projects Rating: 0 out of 5 stars0 ratingsBasic Math & Pre-Algebra For Dummies Rating: 4 out of 5 stars4/5The Little Book of Mathematical Principles, Theories & Things Rating: 3 out of 5 stars3/5Math Magic: How To Master Everyday Math Problems Rating: 3 out of 5 stars3/5The Everything Guide to Algebra: A Step-by-Step Guide to the Basics of Algebra - in Plain English! Rating: 4 out of 5 stars4/5Algebra - The Very Basics Rating: 5 out of 5 stars5/5Algebra II For Dummies Rating: 3 out of 5 stars3/5Pre-Calculus For Dummies Rating: 5 out of 5 stars5/5Relativity: The special and the general theory Rating: 5 out of 5 stars5/5Alan Turing: The Enigma: The Book That Inspired the Film The Imitation Game - Updated Edition Rating: 4 out of 5 stars4/5Mental Math Secrets - How To Be a Human Calculator Rating: 5 out of 5 stars5/5Logicomix: An epic search for truth Rating: 4 out of 5 stars4/5Algebra I Workbook For Dummies Rating: 3 out of 5 stars3/5The Golden Ratio: The Divine Beauty of Mathematics Rating: 5 out of 5 stars5/5How to Solve It: A New Aspect of Mathematical Method Rating: 4 out of 5 stars4/5Summary of The Black Swan: by Nassim Nicholas Taleb | Includes Analysis Rating: 5 out of 5 stars5/5Statistics: a QuickStudy Laminated Reference Guide Rating: 0 out of 5 stars0 ratingsHow Math Explains the World: A Guide to the Power of Numbers, from Car Repair to Modern Physics Rating: 3 out of 5 stars3/5Flatland Rating: 4 out of 5 stars4/5Calculus For Dummies Rating: 4 out of 5 stars4/5
Reviews for Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
0 ratings0 reviews
Book preview
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations - Haiyan Wang
© Springer Nature Switzerland AG 2020
H. Wang et al.Modeling Information Diffusion in Online Social Networks with Partial Differential EquationsSurveys and Tutorials in the Applied Mathematical Sciences7https://doi.org/10.1007/978-3-030-38852-2_1
1. Introduction
Haiyan Wang¹ , Feng Wang¹ and Kuai Xu¹
(1)
School of Mathematical & Natural Sciences, Arizona State University, Phoenix, AZ, USA
Abstract
Online social networks (OSNs) such as Twitter and Facebook, emerging as the model organism
of Big Data, have gained tremendous popularity for the platforms they provided for information exchange. Much of prior work on information diffusion over online social networks has been based on empirical and statistical approaches. The majority of dynamical models arising from information diffusion over online social networks are ordinary differential equations (ODEs). Recently, the authors proposed to use partial differential equations (PDEs) to model information diffusion in online social networks and introduced a new transdisciplinary architecture for modeling information diffusion. These studies demonstrate fascinating connections between advanced mathematics and online social networks.
Online social networks (OSNs) such as Twitter and Facebook, emerging as the model organism
of Big Data, have gained tremendous popularity for the platforms they provided for information exchange. Much of prior work on information diffusion over online social networks has been based on empirical and statistical approaches. The majority of dynamical models arising from information diffusion over online social networks are ordinary differential equations (ODEs). Recently, the authors proposed to use partial differential equations (PDEs) to model information diffusion in online social networks and introduced a new transdisciplinary architecture for modeling information diffusion. These studies demonstrate fascinating connections between advanced mathematics and online social networks.
A significant body of research about online social networks has focused on analysis of such networks with empirical approaches that use data mining and statistical modeling schemes [9, 18, 19, 25, 29, 34, 35, 41, 43, 47, 50, 55, 60–62, 68, 69, 72, 73, 85, 88, 89, 105, 111, 120, 121, 138–141, 144]. Mathematical models have played a significant role in understanding and predicting information diffusion in online social networks over time. In particular, epidemiological models have influenced the research on information diffusion [8, 51, 80, 89, 90, 124, 145, 146]. However, the deterministic models proposed for online social networks in the literature are largely based on ordinary differential equations (ODEs) that deal with collective social processes over time.
In a paper [128], the authors proposed to use partial differential equations (PDEs) built on intuitive cyber-distance among online users to study both temporal and spatial patterns of information diffusion process in social media. One of the simple, yet fundamental questions that the models address is this: for a piece of given information m initiated from a particular user called source s, what is the density of influenced users at network distance x from the source at any time t. We validate the models with real datasets collected from two popular social media sites, Twitter and Digg. The experiment results show that the models can achieve over 90% accuracy and effectively predict the density of influenced users.
This paper [128] is the first attempt to propose a PDE-based model for characterizing and predicting the temporal and spatial patterns of information diffusion over online social networks, which is also indicated in a survey by Zhang et al. [146]. In Guille et al.’s survey [38] on information diffusion over online social networks, the PDE model in [128] is reported as one of the three non-graph predictive models: epidemiological models, linear influence model (LIM), and PDE approach. The LIM approach developed in [140] focuses on predicting the temporal dynamics of information diffusion through solving non-negative least squares problems. The PDE-based models are dynamic systems that take into account the influence of the underlying network structure as well as information contents for predicting information diffusion over both temporal and spatial dimensions. We shall propose a number of spatio-temporal epidemiological models to describe information diffusion in online social networks in this book.
The book lies at the interface of mathematics, social media analysis, network science, and data science. A key challenge of this interdisciplinary research is to integrate big data from online social networks into the framework of partial differential equations. We use clustering analysis from data mining to aggregate big data for the validation of the PDE models. There are many clustering methods such as k-means clustering. We focus on spectral clustering analysis in this book as our data are graph-structured. The book integrates the research efforts carried out collaboratively by mathematicians specialized in partial differential equations, computer scientists focused on network theory and data mining, and researchers in social media. The extension of partial differential equations into online social networks presents new opportunities and challenges for mathematicians as well as computer scientists and researchers in social media.
References
8.
Barrat, A., Barthelemy, M., Vespignani, A.: Dynamical Processes on Complex Networks. Cambridge University Press, New York (2008)
9.
Benevenuto, F., Rodrigues, T., Cha, M., Almeida, V.: Characterizing user behavior in online social networks. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference, pp. 49–62. ACM, New York (2009)
18.
Cha, M., Mislove, A., Adams, B., Gummadi, K.: Characterizing social cascades in flickr. In: Proceedings of the First Workshop on Online Social Networks, pp. 13–18. ACM, New York (2008)
19.
Cha, M., Mislove, A., Gummadi, K.: A measurement-driven analysis of information propagation in the flickr social network. In: Proceedings of the 18th International Conference on World Wide Web, pp. 721–730. ACM, New York (2009)
25.
Dietz, L.: Inferring shared interests from social networks. In: Proceedings of Neural Information Processing Systems Workshop on Computational Social Science and the Wisdom of Crowds (2010)
29.
Easley, D., Kleinberg, J.: Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press, New York (2010)
34.
Ghosh, R., Lerman, K.: A framework for quantitative analysis of cascades on networks. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining, pp. 665–674. ACM, New York (2011)
35.
Girvan, M., Newman, M.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99, 7821–7826 (2002)
38.
Guille, A., Hacid, H., Favre, C., Zighed, D.: Information diffusion in online social networks: a survey. SIGMOD Rec. 42, 17–28 (2013)
41.
Hajibagheri, A., Alvari, H., Hamzeh, A., Hashemi, S.: Community detection in social networks using information diffusion. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 702–703. IEEE Computer Society, Washington (2012)
43.
Hernandez-Campos, F., Nobel, A.B., Smith, F.D., Jeffay, K.: Statistical clustering of Internet communication patterns. In: Proceedings of Symposium on the Interface of Computing Science and Statistics (2003)
47.
Ikeda, Y., Hasegawa, T., Nemoto, K.: Cascade dynamics on clustered network. J. Phys. Conf. Ser. 221, 012005 (2010)
50.
Jiang, J., Wilson, C., Wang, X., Huang, P., Sha, W., Dai, Y., Zhao, B.: Understanding latent interactions in online social networks. ACM Trans. Web 7, 18 (2013)
51.
Jin, F., Dougherty, E., Saraf, P., Cao, Y., Ramakrishnan, N.: Epidemiological modeling of news and rumors on Twitter. In: Proceedings of the 7th Workshop on Social Network Mining and Analysis, p. 8. ACM, New York (2013)
55.
Kawamoto, T.: A stochastic model of tweet diffusion on the Twitter network. Phys. A 392, 3470–3475(2013)
60.
Krishnamurthy, B., Wang, J.: On network-aware clustering of web clients. In: Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pp. 97–110. ACM, New York (2000)
62.
Kwak, H., Choi, Y., Eom, Y.-H., Jeong, H., Moon, S.: Mining communiteis in networks: a solution for consistency and its evaluation. In: Proceedings of the 9th SIGCOMM Conference on Internet Measurement Conference, pp. 301–314. ACM, New York (2009)
68.
Lerman, K., Ghosh, R.: Information contagion: an empirical study of spread of news on Digg and Twitter social networks. In: Proceedings of International Conference on Weblogs and Social Media (ICWSM) (2010)
69.
Leskovec, J., Mcglohon, M., Faloutsos, C., Glance, N., Hurst, M.: Cascading behavior in large blog graphs. In: SIAM International Conference on Data Mining (SDM), pp. 551–556 (2007)
72.
Liu, J., Aggarwal, C., Han, J.: On integrating network and community discovery. In: Proceedings of International Conference on Web Search and Data Mining (WSDM), pp. 117–126. ACM, New York (2015)
73.
Livne, A., Simmons, M., Adar, E., Adamic, L.: The party is over here: structure and content in the 2010 election. In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 17–21 (2011)
80.
Mena-Lorca, J., Hethcote, H.W.: Dynamic models of infectious diseases as regulators of population sizes. J. Math. Biol. 30, 693–716 (1992)
85.
Nazir, A., Raza, S., Chuah, C.-N.: Unveiling facebook: a measurement study of social network based applications. In: Proceedings of the 8th ACM SIGCOMM Conference on Internet Measurement, pp. 43–56. ACM, New York (2008)
88.
Nematzadeh, A., Ferrara, E., Flammini, A., Ahn, Y.: Optimal network modularity for information diffusion. Phys. Rev. Lett. 113, 088701 (2014)
89.
Newman, M.: The structure and function of complex networks. SIAM Rev. 45, 167–256 (2003)
90.
Newman, M.: Networks: An Introdution. Oxford University Press, Oxford (2010)
105.
Romero, D., Meeder, B., Kleinberg, J.: Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter. In: Proceedings of 20th International World Wide Web Conference (2011)
111.
Schneider, F., Feldmann, A., Krishnamurthy, B., Willinger, W.: Understanding online social network usage from a network perspective. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference, pp. 35–48. ACM, New York (2009)
120.
Tang, L., Liu, H.: Community Detection and Mining in Social Media. Morgan & Claypool, San Rafael (2010)
121.
Tang, S., Blenn, N., Doerr, C., Van Mieghem, P.: Digging in the Digg social news website. IEEE Trans. Multimedia 13, 1163–1175 (2011)
124.
Ver Steeg, G., Ghosh, R., Lerman, K.: What stops social epidemics? In: Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM) (2011)
128.
Wang, F., Wang, H., Xu, K.: Diffusive logistic model towards predicting information diffusion in online social networks. In: 2012 32nd International Conference on Distributed Computing Systems Workshops (ICDCSW), pp. 133–139. IEEE, Piscataway (2012). https://doi.org/10.1109/ICDCSW.2012.16
138.
Xu, K., Wang, F., Jia, X., Wang, H.: The impact of sampling on big data analysis of social media: a case study on Flu and Ebola. In: IEEE Global Communications Conference (GLOBECOM), pp. 1–6 (2015)
140.
Yang, J., Leskovec, J.: Modeling information diffusion in implicit networks. In: 2010 10th International Conference on Data Mining (ICDM), pp. 599–608. IEEE, Piscataway (2010)
141.
Yu, B., Fei, H.: Modeling social cascade in the Flickr social network. In: Proceedings of International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), vol. 7, pp. 566–570 (2009)
144.
Zhang, L., Zhong, X., Wan, L.: Modeling structure evolution of online social networks. In: 2012 8th International Conference on Information Science and Digital Content Technology (ICIDT), pp. 15–19. IEEE, Piscataway (2012)
145.
Zhang, X., Sun, G.-Q., Zhu, Y.-X., Ma, J., Jin, Z.: Epidemic dynamics on semi-directed complex networks. Math. Biosci. 246, 242–251 (2013)
146.
Zhang, Z., Liu, C., Zhan, X., Lu, X., Zhang, C., Zhang, Y.: Dynamics of information diffusion and its applications on complex networks. Phys. Rep. 651, 1–34 (2016)
© Springer Nature Switzerland AG 2020
H. Wang et al.Modeling Information Diffusion in Online Social Networks with Partial Differential EquationsSurveys and Tutorials in the Applied Mathematical Sciences7https://doi.org/10.1007/978-3-030-38852-2_2
2. Ordinary Differential Equation Models on Social Networks
Haiyan Wang¹ , Feng Wang¹ and Kuai Xu¹
(1)
School of Mathematical & Natural Sciences, Arizona State University, Phoenix, AZ, USA
Abstract
In this chapter we consider a number of ordinary differential equation models for diffusion of innovation and epidemiological models. We discuss the classical theory on diffusion of innovation, emphasizing online social networks and analyzing several ordinary differential equation models for innovation diffusion. We also present a number of basic compartment epidemiological models and their applications in online social networks, and finally we discuss SIR models and their extensions when the total population is not constant.
2.1 Introduction
Information diffusion over online social networks has become a fast growing research domain encompassing techniques from a plethora of sciences, among them mathematics, computer science, communications and marketing, etc. In addition, the method for studying the spread of infectious diseases among a population has been applied to understanding information spreading patterns. Information diffusion has become a subject with disparate views of what is an information diffusion process. We focus here on a particular case in which information diffuses in online social networks, and we define information diffusion as the process by which a piece of information (knowledge) spreads and reaches individuals through interactions in a network.
The theory of information diffusion can be traced back to the research on the diffusion of innovation over a population, advanced by a pioneering mass communication scholar, E. M. Rogers. Information diffusion in online social networks becomes a premier example to revisit innovation diffusion. In this chapter, we will focus on ODE models for diffusion of innovation and epidemics. Both diffusion of innovations and epidemic models provide a global view of how an innovation (e.g., news, a product) or a type of disease spreads through a population even when interactions among individuals are unavailable. We will extend innovation diffusion and epidemic models to spatial models where both local and global information between clusters in a network are available.
2.2 Diffusion of Innovations
Diffusion of innovations is a theory that seeks to explain how new ideas and technologies spread through cultures. E. M. Rogers was a professor of communication studies who popularized the theory with his book Diffusion of Innovations
[104]. The origins of the diffusion of innovations theory span various disciplines including anthropology, early sociology, rural sociology, education, industrial sociology, and medical sociology. Rogers [104] theorized that information diffusion is the social process through which an innovation is communicated through certain channels over time among the participants in a social system. He identified four key elements that influence diffusion of a new idea: the innovation itself, communication channels, time, and a social system. The theory of diffusion of innovation has been applied to numerous contexts, including marketing, communications, health promotion, organizational studies, and complexity studies.
The rise of social media has provided a new platform to study diffusion of innovation. Rogers [104] defined an innovation as an idea, practice, or object if it is perceived as novel by an individual or other unit of adoption. A piece of news being reposted many times in online social networks is a typical example of innovations diffusing across online social networks. The theory of diffusion of innovations can be applied to online social networks to answer why and how information spreads fast and reaches a broad audience. It also can be used to reveal some key characteristics of information diffusion such as its motivations. With appropriate models, it can be used to further predict the rate at which ideas spread. In this section, we review some key characteristics of innovation diffusion that are important for the PDE modeling of information diffusion in online social networks. Finally, we present mathematical models that can be used to describe the process of innovation diffusion.
2.2.1 Characteristics of Innovation Diffusion
Rogers [104] explored many characteristics of innovations, individual adopters, and organizations. The nature of networks and the roles opinion leaders play determine the likelihood of innovation adoption. We here identify several characteristics that are related to mathematical modeling of information diffusion in online social networks. Our goal is to use mathematical models to identify key factors that influence the spread of information, and then explain how and why users adopt certain information.
2.2.1.1 Logistic Adoption Curve
In [104] Rogers argued that the diffusion process, depending on a combination of the four key elements, has a point at which an innovation reaches critical mass. He then studied the various stages and introduced an adopter category as a classification of individuals within a social system on the basis of innovativeness. Based on the order in which they adopt the innovations, it has been found that the five different types of adopters are: (1) Innovators (top 2.5%), (2) Early Adopters (13.5%), (3) Early Majority (34%), (4) Late Majority (34%), and (5) Laggards (16%). Numerous studies have shown that different types of adopters behave in significantly different ways in various