Big Data
Big Data
💡 Example:
● Facebook → Nodes = Users, Edges = Friendships
🔹 Types of Centrality:
✅ Degree Centrality → Counts direct connections (e.g., most followed person on Twitter).
✅ Betweenness Centrality → Measures influence in connecting others (e.g., news aggregators).
✅ Closeness Centrality → Identifies people who can quickly reach everyone.
✅ PageRank → Google's algorithm for ranking websites and influencers.
💡 Example:
● Twitter ranks influential users based on followers & retweets.
B. Community Detection (Finding Groups) 👥
Identifies clusters of tightly connected users.
🔹 Popular Algorithms:
✅ Modularity Maximization → Finds groups with dense internal connections.
✅ Louvain Algorithm → Fast, scalable method for large graphs.
✅ Label Propagation → Spreads labels to detect communities.
💡 Example:
● Facebook suggests Groups based on detected communities.
🔹 Methods:
✅ Jaccard Coefficient → Measures similarity between two users.
✅ Adamic/Adar Score → Favors users with common neighbors.
✅ Graph Embedding (Node2Vec, DeepWalk) → Uses machine learning to recommend links.
💡 Example:
● LinkedIn’s "People You May Know" feature uses link prediction.
🔹 Models:
✅ Independent Cascade Model → Users influence their neighbors probabilistically.
✅ Linear Threshold Model → Users adopt behavior based on peer pressure.
💡 Example:
● Twitter’s trending hashtags spread based on retweets & mentions.
🔹 Techniques:
✅ Graph-based anomaly detection → Identifies outlier nodes.
✅ Behavioral analysis → Detects unusual activity patterns.
✅ Bot detection algorithms → Finds automated accounts.
💡 Example:
● Facebook & Twitter use AI to detect fake accounts & spam bots.
Social Network Mining is the process of analyzing relationships, interactions, and patterns in social
networks using graph theory, machine learning, and data analytics. It helps in understanding how people
or entities are connected and how information spreads across a network.
💡 Example:
● Facebook → Mining user friendships to suggest new connections.
💡 Example:
● Instagram & YouTube → Brands collaborate with influencers for promotions.
💡 Example:
● LinkedIn’s "People You May Know" → Suggests professional connections.
💡 Example:
● Twitter Trending Topics → Analyzing popular discussions.
💡 Example:
● Facebook & Twitter → AI models detect and block fake accounts.
💡 Example:
● Netflix & Spotify → Suggest movies & music based on friends' preferences.
💡 Example:
● COVID-19 tracking → Contact tracing using social graphs.
💡 Example:
● Twitter analysis in elections → Predicting candidate popularity.
● Social activism → Tracking global movements (e.g., #MeToo, Black Lives Matter).
💡 Example:
● FBI & Interpol → Monitor social media for suspicious activities.
💡 Example:
● If Alice is friends with Bob, the connection is mutual.
💡 Example:
● If Alice follows Bob on Twitter, Bob may not follow Alice back.
💡 Example:
● Strong ties (family, close friends) vs. Weak ties (acquaintances, followers).
4️⃣ Bipartite Graph (Two Different Node Types) 🔗
● Two types of nodes with edges only between different types.
💡 Example:
● Users ↔ Movies (Netflix) → User A watches Movie X.
💡 Example:
● Most followed user on Twitter has high degree centrality.
💡 Example:
● Facebook suggests groups based on detected communities.
📌 Link Prediction (Friend & Connection Recommendations)
● Predicts future relationships based on current network structure.
● Used in LinkedIn ("People You May Know") and Facebook friend suggestions.
💡 Example:
● If Alice & Bob share many mutual friends, they may connect soon.
💡 Example:
● Twitter trends spread via retweets (Viral Marketing).
● Focus: Connecting with friends, family, and other individuals. These platforms are often used for
sharing updates, photos, videos, and engaging in conversations.
● Examples: Facebook, X (formerly Twitter), LinkedIn, Instagram, Threads, Bluesky, Mastodon.
3. Discussion Forums:
● Focus: Facilitating conversations and discussions around specific topics or interests. Users ask
questions, share opinions, and engage in debates.
● Examples: Reddit, Quora, Discord, Digg, Clubhouse.
● Focus: Publishing and sharing blog posts and articles. These platforms allow individuals and
businesses to share their thoughts, expertise, and stories.
● Examples: Medium, Tumblr, WordPress.com, Blogger, LiveJournal.
● Focus: Connecting with professionals for career development, networking, and business
opportunities.
● Examples: LinkedIn, Xing.
6. Review Networks:
7. Interest-Based Networks:
8. Dating Networks:
10. Educational Networks: * Focus: Connecting students, educators, and institutions for learning and
knowledge sharing. * Examples: Academia.edu, ResearchGate.
11. Mobile Messaging Apps (with social features): * Focus: Primarily for private messaging, but often
include features for group chats, sharing media, and social interaction. * Examples: WhatsApp,
Telegram, Facebook Messenger, WeChat.
12. Niche Social Networks: * Focus: Catering to specific communities or demographics, such as
networks for parents (CafeMom), travelers (CouchSurfing), or specific cultural groups.
Or
💡 Examples:
● Facebook → Friendships, groups, and personal connections.
🔹 Key Features: Friend requests, messaging, content feeds, and live updates.
🔹 Key Features: Resume sharing, job searches, endorsements, and business networking.
💡 Examples:
● Twitter (X) → Text-based updates, retweets, and hashtags.
💡 Examples:
● YouTube → Video sharing and streaming.
💡 Examples:
● Reddit → Communities (subreddits) for discussions.
💡 Examples:
● Tinder → Swipe-based dating.
💡 Examples:
● Slack → Business communication via channels.
💡 Examples:
● ResearchGate → Academic networking for researchers.
💡 Examples:
● Twitch → Live game streaming.
💡 Examples:
● Mastodon → Open-source and decentralized social networking.
● LinkedIn Network → Professional circles like colleagues, alumni, and industry groups.
📌 2. Recommendation Systems
🔹 Suggests friends, content, and products based on social clusters.
🔹 Improves Netflix, Amazon, Spotify recommendations.
💡 Example: Netflix recommends movies liked by your community.
💡 Examples:
● Netflix → Suggests movies & shows based on watch history.
● Amazon → Recommends products based on past purchases.
💡 Example:
● If you watch action movies, Netflix will suggest more action films.
💡 Example:
● If User A & User B like similar products, A gets recommendations from B’s choices.
🔹 Types:
✅ User-Based Collaborative Filtering → Finds similar users.
✅ Item-Based Collaborative Filtering → Finds similar items.
🔹 Algorithms: Pearson Correlation, Matrix Factorization (SVD)
💡 Example:
● Netflix uses both user preferences & similar user behavior to recommend shows.
💡 Example:
● Travel booking sites recommend places based on user preferences (beach, adventure, budget).