research-article

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System

Authors:
Derun Song

Northeastern University, Shenyang, China

Northeastern University, Shenyang, China

0009-0005-1127-3623
View Profile

,
Enneng Yang

Northeastern University, Shenyang, China

Northeastern University, Shenyang, China

0000-0001-5419-5286
View Profile

,
Guibing Guo

Northeastern University, Shenyang, China

Northeastern University, Shenyang, China

0000-0002-1709-5056
View Profile

,
Li Shen

JD Explore Academy, Beijing, China

JD Explore Academy, Beijing, China

0000-0001-5659-3464
View Profile

,
Linying Jiang

Northeastern University, Shenyang, China

Northeastern University, Shenyang, China

0000-0001-7492-0473
View Profile

,
Xingwei Wang

Northeastern University, Shenyang, China

Northeastern University, Shenyang, China

0000-0003-2856-4716
View Profile

ACM Transactions on Knowledge Discovery from Data Volume 18 Issue 6Article No.: 142pp 1–20https://doi.org/10.1145/3651312

Published:12 April 2024Publication History

ACM Transactions on Knowledge Discovery from Data

Abstract

Multi-scenario and multi-task recommendation can use various feedback behaviors of users in different scenarios to learn users’ preferences and then make recommendations, which has attracted attention. However, the existing work ignores feature interactions and the fact that a pair of feature interactions will have differing levels of importance under different scenario-task pairs, leading to sub-optimal user preference learning. In this article, we propose a Multi-scenario and Multi-task aware Feature Interaction model, dubbed MMFI, to explicitly model feature interactions and learn the importance of feature interaction pairs in different scenarios and tasks. Specifically, MMFI first incorporates a pairwise feature interaction unit and a scenario-task interaction unit to effectively capture the interaction of feature pairs and scenario-task pairs. Then MMFI designs a scenario-task aware attention layer for learning the importance of feature interactions from coarse-grained to fine-grained, improving the model’s performance on various scenario-task pairs. More specifically, this attention layer consists of three modules: a fully shared bottom module, a partially shared middle module, and a specific output module. Finally, MMFI adapts two sparsity-aware functions to remove some useless feature interactions. Extensive experiments on two public datasets demonstrate the superiority of the proposed method over the existing multi-task recommendation, multi-scenario recommendation, and multi-scenario & multi-task recommendation models.

REFERENCES

[1] Bengio Yoshua, Louradour Jérôme, Collobert Ronan, and Weston Jason. 2009. Curriculum learning. In Proceedings of the ICML. 41–48.Google ScholarDigital Library
[2] Mathieu Blondel, Akinori Fujino, Naonori Ueda, and Masakazu Ishihata. 2016. Higher-order factorization machines. 566 In Proceedings of the NeurIPS.Google Scholar
[3] Caruana R. 1993. Multitask learning: A knowledge-based source of inductive bias1. In Proceedings of the ICML. Citeseer, 41–48.Google ScholarCross Ref
[4] Jianxin Chang, Chenbin Zhang, Yiqun Hui, Dewei Leng, Yanan Niu, Yang Song, and Kun Gai. 2023. Pepnet: Parameter and embedding personalized network for infusing with personalized prior information. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3795–3804.Google Scholar
[5] Chen Yuting, Wang Yanshi, Ni Yabo, Zeng An-Xiang, and Lin Lanfen. 2020. Scenario-aware and Mutual-based approach for multi-scenario recommendation in e-commerce. In Proceedings of the ICDMW. IEEE, 127–135.Google ScholarCross Ref
[6] Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, and Hemal Shah. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 7–10.Google Scholar
[7] Chu Yabo, Yang Enneng, Liu Qiang, Liu Yuting, Jiang Linying, and Guo Guibing. 2022. Bi-directional contrastive distillation for multi-behavior recommendation. In Proceedings of the ECML-PKDD. Springer, 491–507.Google Scholar
[8] Covington Paul, Adams Jay, and Sargin Emre. 2016. Deep neural networks for Youtube recommendations. In Proceedings of the RecSys. 191–198.Google ScholarDigital Library
[9] Dang Yizhou, Yang Enneng, Guo Guibing, Jiang Linying, Wang Xingwei, Xu Xiaoxiao, Sun Qinghui, and Liu Hong. 2023. Uniform sequence better: Time interval aware data augmentation for sequential recommendation. In Proceedings of the AAAI, Vol. 37. 4225–4232.Google ScholarDigital Library
[10] Deng Yang, Zhang Wenxuan, Xu Weiwen, Lei Wenqiang, Chua Tat-Seng, and Lam Wai. 2023. A unified multi-task learning framework for multi-goal conversational recommender systems. TOIS 41, 3 (2023), 1–25.Google ScholarDigital Library
[11] Ke Ding, Xin Dong, Yong He, Lei Cheng, Chilin Fu, Zhaoxin Huan, Hai Li, Tan Yan, Liang Zhang, Xiaolu Zhang, and Linjian Mo. 2021. MSSM: a multiple-level sparse sharing model for efficient multi-task learning. In SIGIR. 2237–2241.Google Scholar
[12] Duchi John C., Hazan Elad, and Singer Yoram. 2011. Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12, 7 (2011), 2121–2159.Google ScholarDigital Library
[13] Fawcett Tom. 2006. An introduction to ROC analysis. Pattern Recognition Letters 27, 8 (2006), 861–874.Google ScholarDigital Library
[14] Gao Min, Li Jian-Yu, Chen Chun-Hua, Li Yun, Zhang Jun, and Zhan Zhi-Hui. 2023. Enhanced multi-task learning and knowledge graph-based recommender system. TKDE 35, 7 (2023), 10281–10294.Google Scholar
[15] Guo Huifeng, Tang Ruiming, Ye Yunming, Li Zhenguo, and He Xiuqiang. 2017. DeepFM: A factorization-machine based neural network for CTR prediction. In Proceedings of the IJCAI. 1725–1731.Google ScholarCross Ref
[16] He Xiangnan and Chua Tat-Seng. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the SIGIR. 355–364.Google ScholarDigital Library
[17] He Xiangnan, Liao Lizi, Zhang Hanwang, Nie Liqiang, Hu Xia, and Chua Tat-Seng. 2017. Neural collaborative filtering. In Proceedings of the WWW. 173–182.Google ScholarDigital Library
[18] He Yun, Feng Xue, Cheng Cheng, Ji Geng, Guo Yunsong, and Caverlee James. 2022. MetaBalance: Improving multi-task recommendations via adapting gradient magnitudes of auxiliary tasks. In Proceedings of the WWW. 2205–2215.Google Scholar
[19] Hu Yuzheng, Xian Ruicheng, Wu Qilong, Fan Qiuling, Yin Lang, and Zhao Han. 2023. Revisiting scalarization in multi-task learning: A theoretical perspective. In Proceedings of the NeurIPS.Google Scholar
[20] Junguang Jiang, Baixu Chen, Junwei Pan, Ximei Wang, Dapeng Liu, Jie Jiang, and Mingsheng Long. 2023. ForkMerge: Mitigating negative transfer in auxiliary-task learning. In NeurIPS.Google Scholar
[21] Juan Yuchin, Zhuang Yong, Chin Wei-Sheng, and Lin Chih-Jen. 2016. Field-aware factorization machines for CTR prediction. In Proceedings of the RecSys. 43–50.Google ScholarDigital Library
[22] Kingma Diederik P. and Ba Jimmy. 2015. Adam: A method for stochastic optimization. In Proceedings of the ICLR.Google Scholar
[23] Koren Yehuda, Bell Robert, and Volinsky Chris. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 30–37.Google ScholarDigital Library
[24] Li Pengcheng, Li Runze, Da Qing, Zeng An-Xiang, and Zhang Lijun. 2020. Improving multi-scenario learning to rank in e-commerce by exploiting task relationships in the label space. In Proceedings of the CIKM. 2605–2612.Google ScholarDigital Library
[25] Lian Jianxun, Zhou Xiaohuan, Zhang Fuzheng, Chen Zhongxia, Xie Xing, and Sun Guangzhong. 2018. xDeepFM: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the KDD. 1754–1763.Google ScholarDigital Library
[26] Liu Bin, Zhu Chenxu, Li Guilin, Zhang Weinan, Lai Jincai, Tang Ruiming, He Xiuqiang, Li Zhenguo, and Yu Yong. 2020. Autofis: Automatic feature interaction selection in factorization models for click-through rate prediction. In Proceedings of the SIGKDD. 2636–2645.Google ScholarDigital Library
[27] Liu Shikun, Johns Edward, and Davison Andrew J.. 2019. End-to-end multi-task learning with attention. In Proceedings of the CVPR. 1871–1880.Google ScholarCross Ref
[28] Ma Jiaqi, Zhao Zhe, Yi Xinyang, Chen Jilin, Hong Lichan, and Chi Ed H.. 2018. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In Proceedings of the SIGKDD. 1930–1939.Google ScholarDigital Library
[29] Ma Xiao, Zhao Liqin, Huang Guan, Wang Zhi, Hu Zelin, Zhu Xiaoqiang, and Gai Kun. 2018. Entire space multi-task model: An effective approach for estimating post-click conversion rate. In Proceedings of the SIGIR. ACM, 1137–1140.Google ScholarDigital Library
[30] Misra Ishan, Shrivastava Abhinav, Gupta Abhinav, and Hebert Martial. 2016. Cross-stitch networks for multi-task learning. In Proceedings of the CVPR. 3994–4003.Google ScholarCross Ref
[31] Pan Junwei, Mao Yizhi, Ruiz Alfonso Lobos, Sun Yu, and Flores Aaron. 2019. Predicting different types of conversions with multi-task learning in online advertising. In Proceedings of the SIGKDD. 2689–2697.Google ScholarDigital Library
[32] Pan Junwei, Xu Jian, Ruiz Alfonso Lobos, Zhao Wenliang, Pan Shengjun, Sun Yu, and Lu Quan. 2018. Field-weighted factorization machines for click-through rate prediction in display advertising. In Proceedings of the WWW. 1349–1357.Google ScholarDigital Library
[33] Reddi Sashank J., Kale Satyen, and Kumar Sanjiv. 2018. On the Convergence of Adam and Beyond. In Proceedings of the ICLR.Google Scholar
[34] Rendle Steffen. 2010. Factorization machines. In Proceedings of the ICDM. IEEE, 995–1000.Google ScholarDigital Library
[35] Ruder Sebastian, Bingel Joachim, Augenstein Isabelle, and Søgaard Anders. 2019. Latent multi-task architecture learning. In Proceedings of the AAAI. AAAI Press, 4822–4829.Google ScholarDigital Library
[36] Sener Ozan and Koltun Vladlen. 2018. Multi-task learning as multi-objective optimization. InProceedings of the NeurIPS.Google Scholar
[37] Shazeer Noam, Mirhoseini Azalia, Maziarz Krzysztof, Davis Andy, Le Quoc, Hinton Geoffrey, and Dean Jeff. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. In Proceedings of the ICLR.Google Scholar
[38] Shelke Mayuri S., Deshmukh Prashant R., and Shandilya Vijaya K.. 2017. A review on imbalanced data handling using undersampling and oversampling technique. IJRTER 3, 4 (2017), 444–449.Google ScholarCross Ref
[39] Shen Qijie, Tao Wanjie, Zhang Jing, Wen Hong, Chen Zulong, and Lu Quan. 2021. SAR-Net: A scenario-aware ranking network for personalized fair recommendation in hundreds of travel scenarios. In Proceedings of the CIKM. 4094–4103.Google ScholarDigital Library
[40] Xiang-Rong Sheng, Liqin Zhao, Guorui Zhou, Xinyao Ding, Binding Dai, Qiang Luo, Siran Yang, Jingshan Lv, Chi Zhang, Hongbo Deng, and Xiaoqiang Zhu. 2021. One model to serve all: Star topology adaptive recommender for multi-domain ctr prediction. In CIKM. 4104–4113.Google Scholar
[41] Sun Tianxiang, Shao Yunfan, Li Xiaonan, Liu Pengfei, Yan Hang, Qiu Xipeng, and Huang Xuanjing. 2020. Learning sparse sharing architectures for multiple tasks. In Proceedings of the AAAI, Vol. 34. 8936–8943.Google ScholarCross Ref
[42] Sun Yang, Pan Junwei, Zhang Alex, and Flores Aaron. 2021. FM2: Field-matrixed factorization machines for recommender systems. In Proceedings of the WWW. 2828–2837.Google ScholarDigital Library
[43] Tang Hongyan, Liu Junning, Zhao Ming, and Gong Xudong. 2020. Progressive layered extraction (PLE): A novel multi-task learning (MTL) model for personalized recommendations. In Proceedings of the RecSys. 269–278.Google ScholarDigital Library
[44] Oord Aaron Van den, Dieleman Sander, and Schrauwen Benjamin. 2013. Deep content-based music recommendation. In Proceedings of the NeurIPS.Google Scholar
[45] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017. Attention is all you need. In Proceedings of the NeurIPS.Google Scholar
[46] Wang Dongjing, Zhang Xin, Yin Yuyu, Yu Dongjin, Xu Guandong, and Deng Shuiguang. 2023. Multi-view enhanced graph attention network for session-based music recommendation. TOIS 42, 10 (2023), 1–30.Google Scholar
[47] Wang Ruoxi, Fu Bin, Fu Gang, and Wang Mingliang. 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD’17. 1–7.Google ScholarDigital Library
[48] Wang Ruoxi, Shivanna Rakesh, Cheng Derek, Jain Sagar, Lin Dong, Hong Lichan, and Chi Ed. 2021. Dcn v2: Improved deep & cross network and practical lessons for web-scale learning to rank systems. In Proceedings of the WWW. 1785–1797.Google ScholarDigital Library
[49] Wang Xiang, He Xiangnan, Wang Meng, Feng Fuli, and Chua Tat-Seng. 2019. Neural graph collaborative filtering. In Proceedings of the SIGIR. 165–174.Google ScholarDigital Library
[50] Wen Hong, Zhang Jing, Wang Yuan, Lv Fuyu, Bao Wentian, Lin Quan, and Yang Keping. 2020. Entire space multi-task modeling via post-click behavior decomposition for conversion rate prediction. In Proceedings of the SIGIR. ACM, 2377–2386.Google ScholarDigital Library
[51] Xi Dongbo, Chen Zhen, Yan Peng, Zhang Yinger, Zhu Yongchun, Zhuang Fuzhen, and Chen Yu. 2021. Modeling the sequential dependence among audience multi-step conversions with multi-task learning in targeted display advertising. In Proceedings of the KDD.Google Scholar
[52] Xiao Jun, Ye Hao, He Xiangnan, Zhang Hanwang, Wu Fei, and Chua Tat-Seng. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. In Proceedings of the IJCAI. 3119–3125.Google ScholarCross Ref
[53] Xie Ruobing, Liu Yanlei, Zhang Shaoliang, Wang Rui, Xia Feng, and Lin Leyu. 2021. Personalized approximate pareto-efficient recommendation. In Proceedings of the WWW. ACM / IW3C2, 3839–3849.Google ScholarDigital Library
[54] Xu En, Yu Zhiwen, Guo Bin, and Cui Helei. 2021. Core interest network for click-through rate prediction. TKDD 15, 2 (2021), 1–16.Google ScholarDigital Library
[55] Yang Enneng, Pan Junwei, Wang Ximei, Yu Haibin, Shen Li, Chen Xihua, Xiao Lei, Jiang Jie, and Guo Guibing. 2023. AdaTask: A task-aware adaptive learning rate approach to multi-task learning. In Proceedings of the AAAI, Vol. 37. 10745–10753.Google ScholarDigital Library
[56] Yu Dongjin, Wang Xingliang, Xiong Yu, Shen Xudong, Wu Runze, Wang Dongjing, Zou Zhene, and Xu Guandong. 2023. MHANER: A multi-source heterogeneous graph attention network for explainable recommendation in online games. TIST (2023).Google ScholarDigital Library
[57] Yuan Guanghu, Yuan Fajie, Li Yudong, Kong Beibei, Li Shujie, Chen Lei, Yang Min, YU Chenyun, Hu Bo, Li Zang, Xu Yu, and Qie Xiaohu. 2022. Tenrec: A large-scale multipurpose benchmark dataset for recommender systems. In Proceedings of the NeurIPS Datasets and Benchmarks Track.Google Scholar
[58] Zhang Qianqian, Liao Xinru, Liu Quan, Xu Jian, and Zheng Bo. 2022. Leaving no one behind: A multi-scenario multi-task meta learning approach for advertiser modeling. In Proceedings of the WSDM. 1368–1376.Google ScholarDigital Library
[59] Zhou Guorui, Zhu Xiaoqiang, Song Chenru, Fan Ying, Zhu Han, Ma Xiao, Yan Yanghui, Jin Junqi, Li Han, and Gai Kun. 2018. Deep interest network for click-through rate prediction. In Proceedings of the SIGKDD. 1059–1068.Google ScholarDigital Library
[60] Jie Zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, and Qian Yu. 2023. Hinet: novel multi-scenario & multi-task learning with hierarchical information extraction. In 2023 IEEE 39th International Conference on Data Engineering (ICDE). IEEE, 2969–2975.Google Scholar
[61] Zhu Jieming, Liu Jinyang, Yang Shuai, Zhang Qi, and He Xiuqiang. 2021. Open benchmarking for click-through rate prediction. In Proceedings of the CIKM. ACM, 2759–2769.Google ScholarDigital Library
[62] Zou Fangyu, Shen Li, Jie Zequn, Zhang Weizhong, and Liu Wei. 2019. A sufficient condition for convergences of Adam and RMSProp. In Proceedings of the CVPR. 11127–11135.Google ScholarCross Ref
[63] Zou Xinyu, Hu Zhi, Zhao Yiming, Ding Xuchu, Liu Zhongyi, Li Chenliang, and Sun Aixin. 2022. Automatic expert selection for multi-scenario and multi-task search. In Proceedings of the SIGIR. 1535–1544.Google ScholarDigital Library

Index Terms

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System
1. Computer systems organization
  1. Dependable and fault-tolerant systems and networks
    1. Redundancy
  2. Embedded and cyber-physical systems
    1. Embedded systems
    2. Robotics

Recommendations

Using a trust network to improve top-N recommendation
RecSys '09: Proceedings of the third ACM conference on Recommender systems

Top-N item recommendation is one of the important tasks of recommenders. Collaborative filtering is the most popular approach to building recommender systems which can predict ratings for a given user and item. Collaborative filtering can be extended ...
Read More
An Effective Implicit Multi-interest Interaction Network for Recommendation
Neural Information Processing
Abstract
Data features in real industrial recommendation scenarios are high-dimensional, diverse and sparse. Rich feature interaction can improve the model effect and bring practical benefits. Factorization machines (FMs) can perform explicit second-order ...
Read More
Typicality-Based Collaborative Filtering Recommendation

Collaborative filtering (CF) is an important and popular technology for recommender systems. However, current CF methods suffer from such problems as data sparsity, recommendation inaccuracy, and big-error in predictions. In this paper, we borrow ideas ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in

ACM Transactions on Knowledge Discovery from Data Volume 18, Issue 6
July 2024
535 pages
ISSN:1556-4681
EISSN:1556-472X
DOI:10.1145/3613684
Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 12 April 2024
- Online AM: 6 March 2024
- Accepted: 27 February 2024
- Revised: 26 December 2023
- Received: 28 July 2023
Published in tkdd Volume 18, Issue 6

Check for updates
Author Tags
Recommendation
interaction
Qualifiers
- research-article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 233
  Total Downloads
- Downloads (Last 12 months)233
- Downloads (Last 6 weeks)160
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Full Text

View this article in Full Text.

View Full Text

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System

ACM Transactions on Knowledge Discovery from Data

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Using a trust network to improve top-N recommendation

An Effective Implicit Multi-interest Interaction Network for Recommendation

Typicality-Based Collaborative Filtering Recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Full Text

Caption

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System

ACM Transactions on Knowledge Discovery from Data

Abstract

REFERENCES

Cited By

Index Terms

Recommendations

Using a trust network to improve top-N recommendation

An Effective Implicit Multi-interest Interaction Network for Recommendation

Typicality-Based Collaborative Filtering Recommendation

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Full Text

Share this Publication link

Share on Social Media