当前位置: X-MOL 学术Big Data › 论文详情
Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Predicting Sociodemographic Attributes from Mobile Usage Patterns: Applications and Privacy Implications.
Big Data ( IF 4.6 ) Pub Date : 2023-08-14 , DOI: 10.1089/big.2022.0182
Rouzbeh Razavi 1 , Guisen Xue 1 , Ikpe Justice Akpan 2
Affiliation  

When users interact with their mobile devices, they leave behind unique digital footprints that can be viewed as predictive proxies that reveal an array of users' characteristics, including their demographics. Predicting users' demographics based on mobile usage can provide significant benefits for service providers and users, including improving customer targeting, service personalization, and market research efforts. This study uses machine learning algorithms and mobile usage data from 235 demographically diverse users to examine the accuracy of predicting their sociodemographic attributes (age, gender, income, and education) from mobile usage metadata, filling the gap in the current literature by quantifying the predictive power of each attribute and discussing the practical applications and privacy implications. According to the results, gender can be most accurately predicted (balanced accuracy = 0.862) from mobile usage footprints, whereas predicting users' education level is more challenging (balanced accuracy = 0.719). Moreover, the classification models were able to classify users based on whether their age or income was above or below a certain threshold with acceptable accuracy. The study also presents the practical applications of inferring demographic attributes from mobile usage data and discusses the implications of the findings, such as privacy and discrimination risks, from the perspectives of different stakeholders.

中文翻译:

从移动使用模式预测社会人口特征:应用程序和隐私影响。

当用户与移动设备交互时,他们会留下独特的数字足迹,这些足迹可以被视为揭示一系列用户特征(包括人口统计数据)的预测代理。根据移动使用情况预测用户的人口统计数据可以为服务提供商和用户带来显着的好处,包括改进客户定位、服务个性化和市场研究工作。本研究使用机器学习算法和来自 235 个人口结构不同的用户的移动使用数据来检查从移动使用元数据预测其社会人口统计属性(年龄、性别、收入和教育)的准确性,通过量化预测值来填补当前文献中的空白每个属性的力量并讨论实际应用和隐私影响。根据结果​​,通过移动使用足迹可以最准确地预测性别(平衡准确度 = 0.862),而预测用户的教育水平则更具挑战性(平衡准确度 = 0.719)。此外,分类模型能够根据用户的年龄或收入是否高于或低于某个阈值,以可接受的精度对用户进行分类。该研究还介绍了从移动使用数据推断人口统计属性的实际应用,并从不同利益相关者的角度讨论了研究结果的影响,例如隐私和歧视风险。
更新日期:2023-08-14
down
wechat
bug