Discover millions of ebooks, audiobooks, and so much more with a free trial

From $11.99/month after trial. Cancel anytime.

Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Ebook325 pages2 hours

Modeling Information Diffusion in Online Social Networks with Partial Differential Equations

Rating: 0 out of 5 stars

()

Read preview

About this ebook

The book lies at the interface of mathematics, social media analysis, and data science. Its authors aim to introduce a new dynamic modeling approach to the use of partial differential equations for describing information diffusion over online social networks. The eigenvalues and eigenvectors of the Laplacian matrix for the underlying social network are used to find communities (clusters) of online users. Once these clusters are embedded in a Euclidean space, the mathematical models, which are reaction-diffusion equations, are developed based on intuitive social distances between clusters within the Euclidean space. The models are validated with data from major social media such as Twitter. In addition, mathematical analysis of these models is applied, revealing insights into information flow on social media. Two applications with geocoded Twitter data are included in the book: one describing the social movement in Twitter during the Egyptian revolution in 2011 and another predicting influenza prevalence. The new approach advocates a paradigm shift for modeling information diffusion in online social networks and lays the theoretical groundwork for many spatio-temporal modeling problems in the big-data era.
LanguageEnglish
PublisherSpringer
Release dateMar 16, 2020
ISBN9783030388522
Modeling Information Diffusion in Online Social Networks with Partial Differential Equations
Author

Haiyan Wang

He completed his doctorate in mathematics, while also earning a master's degree in computer science at Michigan State University in 1997. He worked as a full-time software engineer in industry for almost ten years before joining Arizona State University. Dr. Wang’s research interests include applied mathematics, data science, differential equations, online social networks. He has published numerous articles in scholarly journals and a book entitled, “Modeling Information Diffusion in Online Social Networks with Partial Differential Equations”, Springer, 2020. Recently he developed and taught a course, Mathematical Methods in Data Science, at Arizona State University.

Related to Modeling Information Diffusion in Online Social Networks with Partial Differential Equations

Titles in the series (1)

View More

Related ebooks

Mathematics For You

View More

Related articles

Reviews for Modeling Information Diffusion in Online Social Networks with Partial Differential Equations

Rating: 0 out of 5 stars
0 ratings

0 ratings0 reviews

What did you think?

Tap to rate

Review must be at least 10 words

    Book preview

    Modeling Information Diffusion in Online Social Networks with Partial Differential Equations - Haiyan Wang

    © Springer Nature Switzerland AG 2020

    H. Wang et al.Modeling Information Diffusion in Online Social Networks with Partial Differential EquationsSurveys and Tutorials in the Applied Mathematical Sciences7https://doi.org/10.1007/978-3-030-38852-2_1

    1. Introduction

    Haiyan Wang¹ , Feng Wang¹ and Kuai Xu¹

    (1)

    School of Mathematical & Natural Sciences, Arizona State University, Phoenix, AZ, USA

    Abstract

    Online social networks (OSNs) such as Twitter and Facebook, emerging as the model organism of Big Data, have gained tremendous popularity for the platforms they provided for information exchange. Much of prior work on information diffusion over online social networks has been based on empirical and statistical approaches. The majority of dynamical models arising from information diffusion over online social networks are ordinary differential equations (ODEs). Recently, the authors proposed to use partial differential equations (PDEs) to model information diffusion in online social networks and introduced a new transdisciplinary architecture for modeling information diffusion. These studies demonstrate fascinating connections between advanced mathematics and online social networks.

    Online social networks (OSNs) such as Twitter and Facebook, emerging as the model organism of Big Data, have gained tremendous popularity for the platforms they provided for information exchange. Much of prior work on information diffusion over online social networks has been based on empirical and statistical approaches. The majority of dynamical models arising from information diffusion over online social networks are ordinary differential equations (ODEs). Recently, the authors proposed to use partial differential equations (PDEs) to model information diffusion in online social networks and introduced a new transdisciplinary architecture for modeling information diffusion. These studies demonstrate fascinating connections between advanced mathematics and online social networks.

    A significant body of research about online social networks has focused on analysis of such networks with empirical approaches that use data mining and statistical modeling schemes [9, 18, 19, 25, 29, 34, 35, 41, 43, 47, 50, 55, 60–62, 68, 69, 72, 73, 85, 88, 89, 105, 111, 120, 121, 138–141, 144]. Mathematical models have played a significant role in understanding and predicting information diffusion in online social networks over time. In particular, epidemiological models have influenced the research on information diffusion [8, 51, 80, 89, 90, 124, 145, 146]. However, the deterministic models proposed for online social networks in the literature are largely based on ordinary differential equations (ODEs) that deal with collective social processes over time.

    In a paper [128], the authors proposed to use partial differential equations (PDEs) built on intuitive cyber-distance among online users to study both temporal and spatial patterns of information diffusion process in social media. One of the simple, yet fundamental questions that the models address is this: for a piece of given information m initiated from a particular user called source s, what is the density of influenced users at network distance x from the source at any time t. We validate the models with real datasets collected from two popular social media sites, Twitter and Digg. The experiment results show that the models can achieve over 90% accuracy and effectively predict the density of influenced users.

    This paper [128] is the first attempt to propose a PDE-based model for characterizing and predicting the temporal and spatial patterns of information diffusion over online social networks, which is also indicated in a survey by Zhang et al. [146]. In Guille et al.’s survey [38] on information diffusion over online social networks, the PDE model in [128] is reported as one of the three non-graph predictive models: epidemiological models, linear influence model (LIM), and PDE approach. The LIM approach developed in [140] focuses on predicting the temporal dynamics of information diffusion through solving non-negative least squares problems. The PDE-based models are dynamic systems that take into account the influence of the underlying network structure as well as information contents for predicting information diffusion over both temporal and spatial dimensions. We shall propose a number of spatio-temporal epidemiological models to describe information diffusion in online social networks in this book.

    The book lies at the interface of mathematics, social media analysis, network science, and data science. A key challenge of this interdisciplinary research is to integrate big data from online social networks into the framework of partial differential equations. We use clustering analysis from data mining to aggregate big data for the validation of the PDE models. There are many clustering methods such as k-means clustering. We focus on spectral clustering analysis in this book as our data are graph-structured. The book integrates the research efforts carried out collaboratively by mathematicians specialized in partial differential equations, computer scientists focused on network theory and data mining, and researchers in social media. The extension of partial differential equations into online social networks presents new opportunities and challenges for mathematicians as well as computer scientists and researchers in social media.

    References

    8.

    Barrat, A., Barthelemy, M., Vespignani, A.: Dynamical Processes on Complex Networks. Cambridge University Press, New York (2008)

    9.

    Benevenuto, F., Rodrigues, T., Cha, M., Almeida, V.: Characterizing user behavior in online social networks. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference, pp. 49–62. ACM, New York (2009)

    18.

    Cha, M., Mislove, A., Adams, B., Gummadi, K.: Characterizing social cascades in flickr. In: Proceedings of the First Workshop on Online Social Networks, pp. 13–18. ACM, New York (2008)

    19.

    Cha, M., Mislove, A., Gummadi, K.: A measurement-driven analysis of information propagation in the flickr social network. In: Proceedings of the 18th International Conference on World Wide Web, pp. 721–730. ACM, New York (2009)

    25.

    Dietz, L.: Inferring shared interests from social networks. In: Proceedings of Neural Information Processing Systems Workshop on Computational Social Science and the Wisdom of Crowds (2010)

    29.

    Easley, D., Kleinberg, J.: Networks, Crowds, and Markets: Reasoning about a Highly Connected World. Cambridge University Press, New York (2010)

    34.

    Ghosh, R., Lerman, K.: A framework for quantitative analysis of cascades on networks. In: Proceedings of the 4th ACM International Conference on Web Search and Data Mining, pp. 665–674. ACM, New York (2011)

    35.

    Girvan, M., Newman, M.: Community structure in social and biological networks. Proc. Natl. Acad. Sci. 99, 7821–7826 (2002)

    38.

    Guille, A., Hacid, H., Favre, C., Zighed, D.: Information diffusion in online social networks: a survey. SIGMOD Rec. 42, 17–28 (2013)

    41.

    Hajibagheri, A., Alvari, H., Hamzeh, A., Hashemi, S.: Community detection in social networks using information diffusion. In: Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012), pp. 702–703. IEEE Computer Society, Washington (2012)

    43.

    Hernandez-Campos, F., Nobel, A.B., Smith, F.D., Jeffay, K.: Statistical clustering of Internet communication patterns. In: Proceedings of Symposium on the Interface of Computing Science and Statistics (2003)

    47.

    Ikeda, Y., Hasegawa, T., Nemoto, K.: Cascade dynamics on clustered network. J. Phys. Conf. Ser. 221, 012005 (2010)

    50.

    Jiang, J., Wilson, C., Wang, X., Huang, P., Sha, W., Dai, Y., Zhao, B.: Understanding latent interactions in online social networks. ACM Trans. Web 7, 18 (2013)

    51.

    Jin, F., Dougherty, E., Saraf, P., Cao, Y., Ramakrishnan, N.: Epidemiological modeling of news and rumors on Twitter. In: Proceedings of the 7th Workshop on Social Network Mining and Analysis, p. 8. ACM, New York (2013)

    55.

    Kawamoto, T.: A stochastic model of tweet diffusion on the Twitter network. Phys. A 392, 3470–3475(2013)

    60.

    Krishnamurthy, B., Wang, J.: On network-aware clustering of web clients. In: Proceedings of the Conference on Applications, Technologies, Architectures, and Protocols for Computer Communication, pp. 97–110. ACM, New York (2000)

    62.

    Kwak, H., Choi, Y., Eom, Y.-H., Jeong, H., Moon, S.: Mining communiteis in networks: a solution for consistency and its evaluation. In: Proceedings of the 9th SIGCOMM Conference on Internet Measurement Conference, pp. 301–314. ACM, New York (2009)

    68.

    Lerman, K., Ghosh, R.: Information contagion: an empirical study of spread of news on Digg and Twitter social networks. In: Proceedings of International Conference on Weblogs and Social Media (ICWSM) (2010)

    69.

    Leskovec, J., Mcglohon, M., Faloutsos, C., Glance, N., Hurst, M.: Cascading behavior in large blog graphs. In: SIAM International Conference on Data Mining (SDM), pp. 551–556 (2007)

    72.

    Liu, J., Aggarwal, C., Han, J.: On integrating network and community discovery. In: Proceedings of International Conference on Web Search and Data Mining (WSDM), pp. 117–126. ACM, New York (2015)

    73.

    Livne, A., Simmons, M., Adar, E., Adamic, L.: The party is over here: structure and content in the 2010 election. In: Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, pp. 17–21 (2011)

    80.

    Mena-Lorca, J., Hethcote, H.W.: Dynamic models of infectious diseases as regulators of population sizes. J. Math. Biol. 30, 693–716 (1992)

    85.

    Nazir, A., Raza, S., Chuah, C.-N.: Unveiling facebook: a measurement study of social network based applications. In: Proceedings of the 8th ACM SIGCOMM Conference on Internet Measurement, pp. 43–56. ACM, New York (2008)

    88.

    Nematzadeh, A., Ferrara, E., Flammini, A., Ahn, Y.: Optimal network modularity for information diffusion. Phys. Rev. Lett. 113, 088701 (2014)

    89.

    Newman, M.: The structure and function of complex networks. SIAM Rev. 45, 167–256 (2003)

    90.

    Newman, M.: Networks: An Introdution. Oxford University Press, Oxford (2010)

    105.

    Romero, D., Meeder, B., Kleinberg, J.: Differences in the Mechanics of Information Diffusion Across Topics: Idioms, Political Hashtags, and Complex Contagion on Twitter. In: Proceedings of 20th International World Wide Web Conference (2011)

    111.

    Schneider, F., Feldmann, A., Krishnamurthy, B., Willinger, W.: Understanding online social network usage from a network perspective. In: Proceedings of the 9th ACM SIGCOMM Conference on Internet Measurement Conference, pp. 35–48. ACM, New York (2009)

    120.

    Tang, L., Liu, H.: Community Detection and Mining in Social Media. Morgan & Claypool, San Rafael (2010)

    121.

    Tang, S., Blenn, N., Doerr, C., Van Mieghem, P.: Digging in the Digg social news website. IEEE Trans. Multimedia 13, 1163–1175 (2011)

    124.

    Ver Steeg, G., Ghosh, R., Lerman, K.: What stops social epidemics? In: Proceedings of the 5th International AAAI Conference on Weblogs and Social Media (ICWSM) (2011)

    128.

    Wang, F., Wang, H., Xu, K.: Diffusive logistic model towards predicting information diffusion in online social networks. In: 2012 32nd International Conference on Distributed Computing Systems Workshops (ICDCSW), pp. 133–139. IEEE, Piscataway (2012). https://​doi.​org/​10.​1109/​ICDCSW.​2012.​16

    138.

    Xu, K., Wang, F., Jia, X., Wang, H.: The impact of sampling on big data analysis of social media: a case study on Flu and Ebola. In: IEEE Global Communications Conference (GLOBECOM), pp. 1–6 (2015)

    140.

    Yang, J., Leskovec, J.: Modeling information diffusion in implicit networks. In: 2010 10th International Conference on Data Mining (ICDM), pp. 599–608. IEEE, Piscataway (2010)

    141.

    Yu, B., Fei, H.: Modeling social cascade in the Flickr social network. In: Proceedings of International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), vol. 7, pp. 566–570 (2009)

    144.

    Zhang, L., Zhong, X., Wan, L.: Modeling structure evolution of online social networks. In: 2012 8th International Conference on Information Science and Digital Content Technology (ICIDT), pp. 15–19. IEEE, Piscataway (2012)

    145.

    Zhang, X., Sun, G.-Q., Zhu, Y.-X., Ma, J., Jin, Z.: Epidemic dynamics on semi-directed complex networks. Math. Biosci. 246, 242–251 (2013)

    146.

    Zhang, Z., Liu, C., Zhan, X., Lu, X., Zhang, C., Zhang, Y.: Dynamics of information diffusion and its applications on complex networks. Phys. Rep. 651, 1–34 (2016)

    © Springer Nature Switzerland AG 2020

    H. Wang et al.Modeling Information Diffusion in Online Social Networks with Partial Differential EquationsSurveys and Tutorials in the Applied Mathematical Sciences7https://doi.org/10.1007/978-3-030-38852-2_2

    2. Ordinary Differential Equation Models on Social Networks

    Haiyan Wang¹ , Feng Wang¹ and Kuai Xu¹

    (1)

    School of Mathematical & Natural Sciences, Arizona State University, Phoenix, AZ, USA

    Abstract

    In this chapter we consider a number of ordinary differential equation models for diffusion of innovation and epidemiological models. We discuss the classical theory on diffusion of innovation, emphasizing online social networks and analyzing several ordinary differential equation models for innovation diffusion. We also present a number of basic compartment epidemiological models and their applications in online social networks, and finally we discuss SIR models and their extensions when the total population is not constant.

    2.1 Introduction

    Information diffusion over online social networks has become a fast growing research domain encompassing techniques from a plethora of sciences, among them mathematics, computer science, communications and marketing, etc. In addition, the method for studying the spread of infectious diseases among a population has been applied to understanding information spreading patterns. Information diffusion has become a subject with disparate views of what is an information diffusion process. We focus here on a particular case in which information diffuses in online social networks, and we define information diffusion as the process by which a piece of information (knowledge) spreads and reaches individuals through interactions in a network.

    The theory of information diffusion can be traced back to the research on the diffusion of innovation over a population, advanced by a pioneering mass communication scholar, E. M. Rogers. Information diffusion in online social networks becomes a premier example to revisit innovation diffusion. In this chapter, we will focus on ODE models for diffusion of innovation and epidemics. Both diffusion of innovations and epidemic models provide a global view of how an innovation (e.g., news, a product) or a type of disease spreads through a population even when interactions among individuals are unavailable. We will extend innovation diffusion and epidemic models to spatial models where both local and global information between clusters in a network are available.

    2.2 Diffusion of Innovations

    Diffusion of innovations is a theory that seeks to explain how new ideas and technologies spread through cultures. E. M. Rogers was a professor of communication studies who popularized the theory with his book Diffusion of Innovations [104]. The origins of the diffusion of innovations theory span various disciplines including anthropology, early sociology, rural sociology, education, industrial sociology, and medical sociology. Rogers [104] theorized that information diffusion is the social process through which an innovation is communicated through certain channels over time among the participants in a social system. He identified four key elements that influence diffusion of a new idea: the innovation itself, communication channels, time, and a social system. The theory of diffusion of innovation has been applied to numerous contexts, including marketing, communications, health promotion, organizational studies, and complexity studies.

    The rise of social media has provided a new platform to study diffusion of innovation. Rogers [104] defined an innovation as an idea, practice, or object if it is perceived as novel by an individual or other unit of adoption. A piece of news being reposted many times in online social networks is a typical example of innovations diffusing across online social networks. The theory of diffusion of innovations can be applied to online social networks to answer why and how information spreads fast and reaches a broad audience. It also can be used to reveal some key characteristics of information diffusion such as its motivations. With appropriate models, it can be used to further predict the rate at which ideas spread. In this section, we review some key characteristics of innovation diffusion that are important for the PDE modeling of information diffusion in online social networks. Finally, we present mathematical models that can be used to describe the process of innovation diffusion.

    2.2.1 Characteristics of Innovation Diffusion

    Rogers [104] explored many characteristics of innovations, individual adopters, and organizations. The nature of networks and the roles opinion leaders play determine the likelihood of innovation adoption. We here identify several characteristics that are related to mathematical modeling of information diffusion in online social networks. Our goal is to use mathematical models to identify key factors that influence the spread of information, and then explain how and why users adopt certain information.

    2.2.1.1 Logistic Adoption Curve

    In [104] Rogers argued that the diffusion process, depending on a combination of the four key elements, has a point at which an innovation reaches critical mass. He then studied the various stages and introduced an adopter category as a classification of individuals within a social system on the basis of innovativeness. Based on the order in which they adopt the innovations, it has been found that the five different types of adopters are: (1) Innovators (top 2.5%), (2) Early Adopters (13.5%), (3) Early Majority (34%), (4) Late Majority (34%), and (5) Laggards (16%). Numerous studies have shown that different types of adopters behave in significantly different ways in various

    Enjoying the preview?
    Page 1 of 1