Please wait a minute...
文章检索
复杂系统与复杂性科学  2023, Vol. 20 Issue (2): 1-9    DOI: 10.13306/j.1672-3813.2023.02.001
  本期目录 | 过刊浏览 | 高级检索 |
网络直播大数据:统计特征与时序规律挖掘
郭淑慧, 吕欣
国防科技大学系统工程学院, 长沙 410073
Data Mining of Live Streaming Platforms: Statistical Characteristics and Temporal Pattern
GUO Shuhui, LÜ Xin
College of Systems Engineering, National University of Defense Technology, Changsha 410073, China
全文: PDF(3503 KB)  
输出: BibTeX | EndNote (RIS)      
摘要 为挖掘网络直播领域数百万主播与数亿计观众的活跃互动下大规模人群行为学特征,以斗鱼和虎牙直播平台为例,统计分析了连续123天、涉及240多万名主播、超过7.26亿条的直播数据,总结了直播平台的负载时序规律和用户行为特征。发现直播负载存在明显的日内效应和周内效应,不同直播模式的主播在观众数、粉丝数等统计特征上存在显著的组间差异,主播生存期和直播间观众数量符合幂律分布,随着平台发展,主播和观众数量呈现较强的线性相关性,但其波动性也逐步增大,体现出系统越来越强的异质性和非均匀性。对理解网络直播复杂系统中的用户行为模式、挖掘用户分布规律及变化趋势、设计商业模式如个性化推荐等方面具有重要意义。
服务
把本文推荐给朋友
加入引用管理器
E-mail Alert
RSS
作者相关文章
郭淑慧
吕欣
郭淑慧
吕欣
关键词 网络直播直播平台大数据流量分析行为动力学    
Abstract:To explore the behavioral characteristics of massive crowds under the active interaction of millions of streamers and viewers in the field of live streaming, this paper summarized the temporal patterns of live streaming workload and user behavior characteristics of the live streaming platform, taking Douyu and Huya live streaming platforms as examples, a statistical analysis of 123 consecutive days, involving more than 2.4 million anchors, and more than 726 million live streaming data. The live streaming workload has obvious intra-day and intra-week effect. Different live streaming modes have significant differences in live streaming characteristics such as the average number of viewers and followers. The lifetime of streamers and the number of viewers conform to a power law distribution. With the development of the platform, there is a strong linear correlation between the number of streamers and viewers, but its volatility is gradually increasing, reflecting the increasingly strong heterogeneity and non-uniformity of the system. It is of great significance for understanding user behavior patterns in complex systems of live streaming, mining user distribution laws and changing trends, and designing business models such as personalized recommendations.
Key wordslive streaming    live streaming platform    big data    workload analysis    behavioral dynamics
收稿日期: 2021-09-06      出版日期: 2023-07-21
:  TP391  
  G358  
基金资助:国家杰出青年科学基金(72025405);国家自然科学基金重大研究计划(91846301);国家社科基金重大项目(22ZDA102)
通讯作者: 吕欣(1984-),男,博士,教授,主要研究方向为大数据挖掘、复杂网络、应急管理、人类行为动力学。   
作者简介: 郭淑慧(1996-),女,博士研究生,主要研究方向为社交媒体大数据分析挖掘。
引用本文:   
郭淑慧, 吕欣. 网络直播大数据:统计特征与时序规律挖掘[J]. 复杂系统与复杂性科学, 2023, 20(2): 1-9.
GUO Shuhui, LÜ Xin. Data Mining of Live Streaming Platforms: Statistical Characteristics and Temporal Pattern[J]. Complex Systems and Complexity Science, 2023, 20(2): 1-9.
链接本文:  
https://fzkx.qdu.edu.cn/CN/10.13306/j.1672-3813.2023.02.001      或      https://fzkx.qdu.edu.cn/CN/Y2023/V20/I2/1
[1] CHEN X, CHEN S, WANG X, et al. " I was afraid, but now I enjoy being a streamer!" understanding the challenges and prospects of using live streaming for online education[J]. Proceedings of the ACM on Human-Computer Interaction, 2021, 4(CSCW3): 1-32.
[2] LIU L, AREMU E O, YOO D. Brand marketing strategy of live streaming in mobile era: a case study of tmall platform[J]. Journal of East Asia Management, 2020, 1(1): 65-87.
[3] LU Z, ANNETT M, FAN M, et al. " I feel it is my responsibility to stream" streaming and engaging with intangible cultural heritage through livestreaming[C]// BREWSTER S, FITZPATRICK G, COX A, et al. Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. Scotland, UK: ACM, 2019: 1-14.
[4] FAN H, LEE F L F. Judicial visibility under responsive authoritarianism: a study of the live broadcasting of court trials in China[J]. Media, Culture & Society, 2019, 41(8): 1088-1106.
[5] 中国互联网网络信息中心. 第49次中国互联网络发展状况统计报告[EB/OL]. [2022-07-04]. http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/202202/P020220311493378715650.pdf.
CENTER C I N I. The 49th statistical report on internet development in China[EB/OL]. [2022-07-04].http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/202202/P020220311493378715650.pdf.
[6] 艾媒咨询. 2021Q3中国在线直播行业研究报告[EB/OL]. [2022-07-04]. https://www.iimedia.cn/c400/81868.html.
CONSULTING I R. 2021Q3 China online live streaming industry research report[EB/OL]. [2022-07-04].https://www.iimedia.cn/c400/81868.html.
[7] CLAYPOOL M, FARRINGTON D, MUESCH N. Measurement-based analysis of the video characteristics of twitch. tv[C]// BERRY J, BERTOZZI E, FIELLIN L, et al. 2015 IEEE Games Entertainment Media Conference (GEM). Toronto, Canada: IEEE, 2015: 1-4.
[8] PIRES K, SIMON G. YouTube live and twitch: a tour of user-generated live streaming systems[C]// OOI W T, FENG W-C, LIU F. Proceedings of the 6th ACM Multimedia Systems Conference. Oregon, USA: ACM, 2015: 225-230.
[9] ZHU Z H, YANG Z, DAI Y F. Understanding the gift-sending interaction on live-streaming video websites[C]// MEISELWITZ G. International Conference on Social Computing and Social Media. Vancouver, Canada: Springer, 2017: 274-285.
[10] NASCIMENTO G, RIBEIRO M, CERF L, et al. Modeling and analyzing the video game live-streaming community[C]// BAEZA-YATES R. 2014 9th Latin American Web Congress. Minas Gerais, Brazil: IEEE, 2014: 1-9.
[11] ZHAO J, MA M, GONG W, et al. Social media stickiness in mobile personal livestreaming service[C]// LAB C. 2017 IEEE/ACM 25th International Symposium on Quality of Service (IWQoS). Vilanova i la Geltrú, Spain: IEEE, 2017: 1-2.
[12] PIRES K, SIMON G. Dash in twitch: adaptive bitrate streaming in live game streaming platforms[C]// HASSAN M, BEGEN A C, TIMMERER C. Proceedings of the 2014 Workshop on Design, Quality and Deployment of Adaptive Video Streaming. Sydney, Australia: ACM, 2014: 13-18.
[13] ZHANG C, LIU J. On crowdsourced interactive live streaming: a twitch. tv-based measurement study[C]// FENG W-C, ZINK M. Proceedings of the 25th ACM Workshop on Network and Operating Systems Support for Digital Audio and Video. Oregon, USA: ACM, 2015: 55-60.
[14] HAMILTON W A, GARRETSON O, KERNE A. Streaming on twitch: fostering participatory communities of play within live mixed media[C]// JONES M, PALANQUE P, SCHMIDT A, et al. Proceedings of the 32nd annual ACM Conference on Human Factors in Computing Systems. Toronto, Canada: ACM, 2014: 1315-1324.
[15] LYKOUSAS N, GóMEZ V, PATSAKIS C. Adult content in social live streaming services: characterizing deviant users and relationships[C]// BRANDES U, REDDY C, TAGARELLI A. 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). Barcelona, Spain: IEEE, 2018: 375-382.
[16] 郭淑慧, 吕欣. 网络直播平台数据挖掘与行为分析综述[J]. 物理学报, 2020, 69(8): 117-126.
GUO S, LU X. Live streaming: data mining and behavior analysis[J].Acta Physica Sinica, 2020, 69(8): 117-126.
[17] BORGES A, GOMES P, NACIF J, et al. Characterizing sopcast client behavior[J]. Computer Communications, 2012, 35(8): 1004-1016.
[18] VELOSO E, ALMEIDA V, MEIRA W, et al. A hierarchical characterization of a live streaming media workload[C]// KüHLEWIND M, KUTSCHER D. Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurment. Marseille France: ACM, 2002: 117-130.
[19] DENG J, CUADRADO F, TYSON G, et al. Behind the game: exploring the twitch streaming platform[C]// NETGAMES. 2015 International Workshop on Network and Systems Support for Games (NetGames). Zagreb, Croatia: IEEE, 2015: 1-6.
[20] JIA A L, SHEN S, EPEMA D H, et al. When game becomes life: the creators and spectators of online game replays and live streaming[J]. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2016, 12(4): 47.
[21] FALLICA B, LU Y, KUIPERS F, et al. On the quality of experience of SopCast[C]// AT-BEGAIN K, CUEVAS A. 2008 The Second International Conference on Next Generation Mobile Applications, Services, and Technologies. Cardiff, Hnited Kingdom: IEEE, 2008: 501-506.
[22] 中国信息通信研究院. 2018下半年中国网络直播行业景气指数及短视频报告[EB/OL]. [2022-07-04]. http://www.caict.ac.cn/kxyj/qwfb/ztbg/201907/P020190711347399467992.pdf.
TECHNOLOGY C A O I A C. China's online live streaming industry prosperity index and short video report in the second half of 2018[EB/OL]. [2022-07-04].http://www.caict.ac.cn/kxyj/qwfb/ztbg/201907/P020190711347399467992.pdf.
[23] KHADEMALOMOOM S, NARAYAN P K. Intraday effects of the currency market[J]. Journal of International Financial Markets, Institutions and Money, 2019, 58(1): 65-77.
[24] PINK D H. When: The Scientific Secrets of Perfect Timing[M]. New York: Penguin Press, 2019: 15-20.
[25] HINES C B. Time-of-day effects on human performance[J]. Journal of Catholic Education, 2004, 7(3): 390-413.
[26] BERNARD T, GIACOMONI M, GAVARRY O, et al. Time-of-day effects in maximal anaerobic leg exercise[J]. European Journal of Applied Physiology and Occupational Physiology, 1997, 77(1-2): 133-138.
[27] MÜLLER U A, DACOROGNA M M, OLSEN R B, et al. Statistical study of foreign exchange rates, empirical evidence of a price change scaling law, and intraday analysis[J]. Journal of Banking & Finance, 1990, 14(6): 1189-1208.
[28] STOHR D, LI T, WILK S, et al. An analysis of the YouNow live streaming platform[C]// KANHERE S, TöLLE J, CHERKAOUI S. 2015 IEEE 40th Local Computer Networks Conference Workshops (LCN Workshops). Florida, USA: IEEE, 2015: 673-679.
[29] GUPTA S, HANSSENS D, HARDIE B, et al. Modeling customer lifetime value[J]. Journal of service research, 2006, 9(2): 139-155.
[30] SRIPANIDKULCHAI K, MAGGS B, ZHANG H. An analysis of live streaming workloads on the internet[C]// LOMBARDO A, KUROSE J. Proceedings of the 4th ACM SIGCOMM conference on Internet measurement. Sicily, Italy: ACM, 2004: 41-54.
[31] 樊超, 郭进利, 韩筱璞, 等. 人类行为动力学研究综述[J]. 复杂系统与复杂性科学, 2011, 8(2): 1-17.
FAN C, GUO J, HAN X, et al. A review of research on human dynamics[J]. Complex Systems and Complexity Science, 2011, 8(2): 1-17.
[32] 李爽, 陈亚荣. 网络直播环境下人际互动对用户行为意愿的影响研究[J]. 中国市场, 2018, 1(7): 18-20.
LI S, CHEN Y. Research on the influence of interpersonal interaction on user behavior intention in the environment of online live streaming[J]. China Market, 2018, 1(7): 18-20.
[33] LILJEROS F, EDLING C R, AMARAL L A, et al. The web of human sexual contacts[J]. Nature, 2001, 411(6840): 907-8.
[34] BARÁBASI A L, ALBERT R. Emergence of scaling in random networks[J]. Science, 1999, 286(5439): 509-12.
[35] REDNER S. How popular is your paper? An empirical study of the citation distribution[J]. European Physical Journal B Condensed Matter Physics,1998,4(2): 131-134.
[36] REPETOWICZ P, HUTZLER S, RICHMOND P. Dynamics of money and income distributions[J]. Physica A: Statistical Mechanics and Its Applications, 2005, 356(2-4): 641-654.
[37] ALMEIDA J M, KRUEGER J, EAGER D L, et al. Analysis of educational media server workloads[C]// NIEH J. Proceedings of the 11th International Workshop on Network and Operating Systems Support for Digital Audio and Video. New York, USA: ACM, 2001: 21-30.
[38] DA SILVA D F C, NETO R D M S. Population dynamics and spatial dependence: evidence from Brazilian cities[J]. Review of Regional Studies, 2019, 49(3): 454-473.
[39] GUO Q, GAO L. Distribution of individual incomes in China between 1992 and 2009[J]. Physica A: Statistical Mechanics and Its Applications, 2012, 391(21): 5139-5145.
[1] 聂廷远, 王艳伟, 聂晶晶, 刘鹏飞. 基于注意力机制和复杂网络的FPGA可布性预测[J]. 复杂系统与复杂性科学, 2026, 23(1): 53-59.
[2] 潘文祥, 李东艳, 孙思翔, 佟宁. 一种基于社团外围节点的网络鲁棒性优化策略[J]. 复杂系统与复杂性科学, 2026, 23(1): 70-78.
[3] 刘博航, 赵强, 唐政林, 唐英龙, 李业琪. 自动驾驶场景下的高效多任务视觉感知模型[J]. 复杂系统与复杂性科学, 2026, 23(1): 130-137.
[4] 黄锦钿. 基于改进文化基因算法的设备混合批动态调度[J]. 复杂系统与复杂性科学, 2025, 22(4): 71-77.
[5] 章浩淳, 寇博潇, 张泰杰, 唐智慧. 基于Granger Causality的滑坡机理网络客观权值确定方法[J]. 复杂系统与复杂性科学, 2025, 22(4): 63-70.
[6] 韩世翔, 闫光辉, 裴华艳. 复杂网络上双向免疫对传染病传播的影响[J]. 复杂系统与复杂性科学, 2025, 22(4): 55-62.
[7] 霍宣蓉, 肖玉芝, 韩佳新, 黄涛, 胡泽宇. 基于节点特征增强的信息溯源模型[J]. 复杂系统与复杂性科学, 2025, 22(3): 1-10.
[8] 焦然, 许小可. 呼吸道传染病聚集性疫情的传播网络分析[J]. 复杂系统与复杂性科学, 2025, 22(3): 11-16.
[9] 赵光哲, 金铭, 邱爽, 王雪平, 闫飞虎. 文本驱动的人体运动生成综述[J]. 复杂系统与复杂性科学, 2025, 22(2): 64-72.
[10] 张元东, 张先杰, 张若楠, 张海峰. 基于多层超图卷积神经网络的故障诊断方法[J]. 复杂系统与复杂性科学, 2025, 22(1): 131-137.
[11] 李寒, 安新磊, 刘思洋, 王越. 基于忆阻自激振荡系统的图像加密算法[J]. 复杂系统与复杂性科学, 2025, 22(1): 154-160.
[12] 高天, 许小可. 基于社团结构的抑制校园新冠传播研究[J]. 复杂系统与复杂性科学, 2024, 21(3): 9-16.
[13] 刘思洋, 安新磊, 施倩倩, 王越. 一类多涡卷Chua系统及其在图像加密中的应用[J]. 复杂系统与复杂性科学, 2024, 21(3): 85-92.
[14] 田梦龙, 张纪会. 跨层四向穿梭车仓库复合作业路径优化[J]. 复杂系统与复杂性科学, 2024, 21(3): 100-107.
[15] 高峰. 复杂网络深度重叠结构的发现[J]. 复杂系统与复杂性科学, 2024, 21(2): 15-21.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed