The document discusses performance evaluation of social network analysis algorithms using Apache Spark. It analyzes the performance of algorithms like PageRank, connected components, triangle counting and K-means clustering on different social network datasets. The results show that GraphX PageRank performs faster than the naive implementation in Spark. Connected components execution time grows super linearly initially and then fluctuates. Triangle counting time grows linearly with size. K-means clustering is tested using both naive and MLlib implementations in Spark.
Related topics: