skip to main content
research-article

Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System

Published:12 April 2024Publication History
Skip Abstract Section

Abstract

Multi-scenario and multi-task recommendation can use various feedback behaviors of users in different scenarios to learn users’ preferences and then make recommendations, which has attracted attention. However, the existing work ignores feature interactions and the fact that a pair of feature interactions will have differing levels of importance under different scenario-task pairs, leading to sub-optimal user preference learning. In this article, we propose a Multi-scenario and Multi-task aware Feature Interaction model, dubbed MMFI, to explicitly model feature interactions and learn the importance of feature interaction pairs in different scenarios and tasks. Specifically, MMFI first incorporates a pairwise feature interaction unit and a scenario-task interaction unit to effectively capture the interaction of feature pairs and scenario-task pairs. Then MMFI designs a scenario-task aware attention layer for learning the importance of feature interactions from coarse-grained to fine-grained, improving the model’s performance on various scenario-task pairs. More specifically, this attention layer consists of three modules: a fully shared bottom module, a partially shared middle module, and a specific output module. Finally, MMFI adapts two sparsity-aware functions to remove some useless feature interactions. Extensive experiments on two public datasets demonstrate the superiority of the proposed method over the existing multi-task recommendation, multi-scenario recommendation, and multi-scenario & multi-task recommendation models.

REFERENCES

  1. [1] Bengio Yoshua, Louradour Jérôme, Collobert Ronan, and Weston Jason. 2009. Curriculum learning. In Proceedings of the ICML. 4148.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. [2] Mathieu Blondel, Akinori Fujino, Naonori Ueda, and Masakazu Ishihata. 2016. Higher-order factorization machines. 566 In Proceedings of the NeurIPS.Google ScholarGoogle Scholar
  3. [3] Caruana R. 1993. Multitask learning: A knowledge-based source of inductive bias1. In Proceedings of the ICML. Citeseer, 4148.Google ScholarGoogle ScholarCross RefCross Ref
  4. [4] Jianxin Chang, Chenbin Zhang, Yiqun Hui, Dewei Leng, Yanan Niu, Yang Song, and Kun Gai. 2023. Pepnet: Parameter and embedding personalized network for infusing with personalized prior information. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3795–3804.Google ScholarGoogle Scholar
  5. [5] Chen Yuting, Wang Yanshi, Ni Yabo, Zeng An-Xiang, and Lin Lanfen. 2020. Scenario-aware and Mutual-based approach for multi-scenario recommendation in e-commerce. In Proceedings of the ICDMW. IEEE, 127135.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Heng-Tze Cheng, Levent Koc, Jeremiah Harmsen, Tal Shaked, Tushar Chandra, Hrishi Aradhye, Glen Anderson, Greg Corrado, Wei Chai, Mustafa Ispir, Rohan Anil, Zakaria Haque, Lichan Hong, Vihan Jain, Xiaobing Liu, and Hemal Shah. 2016. Wide & deep learning for recommender systems. In Proceedings of the 1st Workshop on Deep Learning for Recommender Systems. 7–10.Google ScholarGoogle Scholar
  7. [7] Chu Yabo, Yang Enneng, Liu Qiang, Liu Yuting, Jiang Linying, and Guo Guibing. 2022. Bi-directional contrastive distillation for multi-behavior recommendation. In Proceedings of the ECML-PKDD. Springer, 491507.Google ScholarGoogle Scholar
  8. [8] Covington Paul, Adams Jay, and Sargin Emre. 2016. Deep neural networks for Youtube recommendations. In Proceedings of the RecSys. 191198.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Dang Yizhou, Yang Enneng, Guo Guibing, Jiang Linying, Wang Xingwei, Xu Xiaoxiao, Sun Qinghui, and Liu Hong. 2023. Uniform sequence better: Time interval aware data augmentation for sequential recommendation. In Proceedings of the AAAI, Vol. 37. 42254232.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Deng Yang, Zhang Wenxuan, Xu Weiwen, Lei Wenqiang, Chua Tat-Seng, and Lam Wai. 2023. A unified multi-task learning framework for multi-goal conversational recommender systems. TOIS 41, 3 (2023), 125.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Ke Ding, Xin Dong, Yong He, Lei Cheng, Chilin Fu, Zhaoxin Huan, Hai Li, Tan Yan, Liang Zhang, Xiaolu Zhang, and Linjian Mo. 2021. MSSM: a multiple-level sparse sharing model for efficient multi-task learning. In SIGIR. 2237–2241.Google ScholarGoogle Scholar
  12. [12] Duchi John C., Hazan Elad, and Singer Yoram. 2011. Adaptive subgradient methods for online learning and stochastic optimization. JMLR 12, 7 (2011), 21212159.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Fawcett Tom. 2006. An introduction to ROC analysis. Pattern Recognition Letters 27, 8 (2006), 861874.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Gao Min, Li Jian-Yu, Chen Chun-Hua, Li Yun, Zhang Jun, and Zhan Zhi-Hui. 2023. Enhanced multi-task learning and knowledge graph-based recommender system. TKDE 35, 7 (2023), 10281–10294.Google ScholarGoogle Scholar
  15. [15] Guo Huifeng, Tang Ruiming, Ye Yunming, Li Zhenguo, and He Xiuqiang. 2017. DeepFM: A factorization-machine based neural network for CTR prediction. In Proceedings of the IJCAI. 17251731.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] He Xiangnan and Chua Tat-Seng. 2017. Neural factorization machines for sparse predictive analytics. In Proceedings of the SIGIR. 355364.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] He Xiangnan, Liao Lizi, Zhang Hanwang, Nie Liqiang, Hu Xia, and Chua Tat-Seng. 2017. Neural collaborative filtering. In Proceedings of the WWW. 173182.Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. [18] He Yun, Feng Xue, Cheng Cheng, Ji Geng, Guo Yunsong, and Caverlee James. 2022. MetaBalance: Improving multi-task recommendations via adapting gradient magnitudes of auxiliary tasks. In Proceedings of the WWW. 22052215.Google ScholarGoogle Scholar
  19. [19] Hu Yuzheng, Xian Ruicheng, Wu Qilong, Fan Qiuling, Yin Lang, and Zhao Han. 2023. Revisiting scalarization in multi-task learning: A theoretical perspective. In Proceedings of the NeurIPS.Google ScholarGoogle Scholar
  20. [20] Junguang Jiang, Baixu Chen, Junwei Pan, Ximei Wang, Dapeng Liu, Jie Jiang, and Mingsheng Long. 2023. ForkMerge: Mitigating negative transfer in auxiliary-task learning. In NeurIPS.Google ScholarGoogle Scholar
  21. [21] Juan Yuchin, Zhuang Yong, Chin Wei-Sheng, and Lin Chih-Jen. 2016. Field-aware factorization machines for CTR prediction. In Proceedings of the RecSys. 4350.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Kingma Diederik P. and Ba Jimmy. 2015. Adam: A method for stochastic optimization. In Proceedings of the ICLR.Google ScholarGoogle Scholar
  23. [23] Koren Yehuda, Bell Robert, and Volinsky Chris. 2009. Matrix factorization techniques for recommender systems. Computer 42, 8 (2009), 3037.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Li Pengcheng, Li Runze, Da Qing, Zeng An-Xiang, and Zhang Lijun. 2020. Improving multi-scenario learning to rank in e-commerce by exploiting task relationships in the label space. In Proceedings of the CIKM. 26052612.Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. [25] Lian Jianxun, Zhou Xiaohuan, Zhang Fuzheng, Chen Zhongxia, Xie Xing, and Sun Guangzhong. 2018. xDeepFM: Combining explicit and implicit feature interactions for recommender systems. In Proceedings of the KDD. 17541763.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Liu Bin, Zhu Chenxu, Li Guilin, Zhang Weinan, Lai Jincai, Tang Ruiming, He Xiuqiang, Li Zhenguo, and Yu Yong. 2020. Autofis: Automatic feature interaction selection in factorization models for click-through rate prediction. In Proceedings of the SIGKDD. 26362645.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Liu Shikun, Johns Edward, and Davison Andrew J.. 2019. End-to-end multi-task learning with attention. In Proceedings of the CVPR. 18711880.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Ma Jiaqi, Zhao Zhe, Yi Xinyang, Chen Jilin, Hong Lichan, and Chi Ed H.. 2018. Modeling task relationships in multi-task learning with multi-gate mixture-of-experts. In Proceedings of the SIGKDD. 19301939.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Ma Xiao, Zhao Liqin, Huang Guan, Wang Zhi, Hu Zelin, Zhu Xiaoqiang, and Gai Kun. 2018. Entire space multi-task model: An effective approach for estimating post-click conversion rate. In Proceedings of the SIGIR. ACM, 11371140.Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. [30] Misra Ishan, Shrivastava Abhinav, Gupta Abhinav, and Hebert Martial. 2016. Cross-stitch networks for multi-task learning. In Proceedings of the CVPR. 39944003.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Pan Junwei, Mao Yizhi, Ruiz Alfonso Lobos, Sun Yu, and Flores Aaron. 2019. Predicting different types of conversions with multi-task learning in online advertising. In Proceedings of the SIGKDD. 26892697.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Pan Junwei, Xu Jian, Ruiz Alfonso Lobos, Zhao Wenliang, Pan Shengjun, Sun Yu, and Lu Quan. 2018. Field-weighted factorization machines for click-through rate prediction in display advertising. In Proceedings of the WWW. 13491357.Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. [33] Reddi Sashank J., Kale Satyen, and Kumar Sanjiv. 2018. On the Convergence of Adam and Beyond. In Proceedings of the ICLR.Google ScholarGoogle Scholar
  34. [34] Rendle Steffen. 2010. Factorization machines. In Proceedings of the ICDM. IEEE, 9951000.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Ruder Sebastian, Bingel Joachim, Augenstein Isabelle, and Søgaard Anders. 2019. Latent multi-task architecture learning. In Proceedings of the AAAI. AAAI Press, 48224829.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Sener Ozan and Koltun Vladlen. 2018. Multi-task learning as multi-objective optimization. InProceedings of the NeurIPS.Google ScholarGoogle Scholar
  37. [37] Shazeer Noam, Mirhoseini Azalia, Maziarz Krzysztof, Davis Andy, Le Quoc, Hinton Geoffrey, and Dean Jeff. 2017. Outrageously large neural networks: The sparsely-gated mixture-of-experts layer. In Proceedings of the ICLR.Google ScholarGoogle Scholar
  38. [38] Shelke Mayuri S., Deshmukh Prashant R., and Shandilya Vijaya K.. 2017. A review on imbalanced data handling using undersampling and oversampling technique. IJRTER 3, 4 (2017), 444449.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Shen Qijie, Tao Wanjie, Zhang Jing, Wen Hong, Chen Zulong, and Lu Quan. 2021. SAR-Net: A scenario-aware ranking network for personalized fair recommendation in hundreds of travel scenarios. In Proceedings of the CIKM. 40944103.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Xiang-Rong Sheng, Liqin Zhao, Guorui Zhou, Xinyao Ding, Binding Dai, Qiang Luo, Siran Yang, Jingshan Lv, Chi Zhang, Hongbo Deng, and Xiaoqiang Zhu. 2021. One model to serve all: Star topology adaptive recommender for multi-domain ctr prediction. In CIKM. 4104–4113.Google ScholarGoogle Scholar
  41. [41] Sun Tianxiang, Shao Yunfan, Li Xiaonan, Liu Pengfei, Yan Hang, Qiu Xipeng, and Huang Xuanjing. 2020. Learning sparse sharing architectures for multiple tasks. In Proceedings of the AAAI, Vol. 34. 89368943.Google ScholarGoogle ScholarCross RefCross Ref
  42. [42] Sun Yang, Pan Junwei, Zhang Alex, and Flores Aaron. 2021. FM2: Field-matrixed factorization machines for recommender systems. In Proceedings of the WWW. 28282837.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Tang Hongyan, Liu Junning, Zhao Ming, and Gong Xudong. 2020. Progressive layered extraction (PLE): A novel multi-task learning (MTL) model for personalized recommendations. In Proceedings of the RecSys. 269278.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Oord Aaron Van den, Dieleman Sander, and Schrauwen Benjamin. 2013. Deep content-based music recommendation. In Proceedings of the NeurIPS.Google ScholarGoogle Scholar
  45. [45] Vaswani Ashish, Shazeer Noam, Parmar Niki, Uszkoreit Jakob, Jones Llion, Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017. Attention is all you need. In Proceedings of the NeurIPS.Google ScholarGoogle Scholar
  46. [46] Wang Dongjing, Zhang Xin, Yin Yuyu, Yu Dongjin, Xu Guandong, and Deng Shuiguang. 2023. Multi-view enhanced graph attention network for session-based music recommendation. TOIS 42, 10 (2023), 1–30.Google ScholarGoogle Scholar
  47. [47] Wang Ruoxi, Fu Bin, Fu Gang, and Wang Mingliang. 2017. Deep & cross network for ad click predictions. In Proceedings of the ADKDD’17. 17.Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. [48] Wang Ruoxi, Shivanna Rakesh, Cheng Derek, Jain Sagar, Lin Dong, Hong Lichan, and Chi Ed. 2021. Dcn v2: Improved deep & cross network and practical lessons for web-scale learning to rank systems. In Proceedings of the WWW. 17851797.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Wang Xiang, He Xiangnan, Wang Meng, Feng Fuli, and Chua Tat-Seng. 2019. Neural graph collaborative filtering. In Proceedings of the SIGIR. 165174.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Wen Hong, Zhang Jing, Wang Yuan, Lv Fuyu, Bao Wentian, Lin Quan, and Yang Keping. 2020. Entire space multi-task modeling via post-click behavior decomposition for conversion rate prediction. In Proceedings of the SIGIR. ACM, 23772386.Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. [51] Xi Dongbo, Chen Zhen, Yan Peng, Zhang Yinger, Zhu Yongchun, Zhuang Fuzhen, and Chen Yu. 2021. Modeling the sequential dependence among audience multi-step conversions with multi-task learning in targeted display advertising. In Proceedings of the KDD.Google ScholarGoogle Scholar
  52. [52] Xiao Jun, Ye Hao, He Xiangnan, Zhang Hanwang, Wu Fei, and Chua Tat-Seng. 2017. Attentional factorization machines: Learning the weight of feature interactions via attention networks. In Proceedings of the IJCAI. 31193125.Google ScholarGoogle ScholarCross RefCross Ref
  53. [53] Xie Ruobing, Liu Yanlei, Zhang Shaoliang, Wang Rui, Xia Feng, and Lin Leyu. 2021. Personalized approximate pareto-efficient recommendation. In Proceedings of the WWW. ACM / IW3C2, 38393849.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. [54] Xu En, Yu Zhiwen, Guo Bin, and Cui Helei. 2021. Core interest network for click-through rate prediction. TKDD 15, 2 (2021), 116.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Yang Enneng, Pan Junwei, Wang Ximei, Yu Haibin, Shen Li, Chen Xihua, Xiao Lei, Jiang Jie, and Guo Guibing. 2023. AdaTask: A task-aware adaptive learning rate approach to multi-task learning. In Proceedings of the AAAI, Vol. 37. 1074510753.Google ScholarGoogle ScholarDigital LibraryDigital Library
  56. [56] Yu Dongjin, Wang Xingliang, Xiong Yu, Shen Xudong, Wu Runze, Wang Dongjing, Zou Zhene, and Xu Guandong. 2023. MHANER: A multi-source heterogeneous graph attention network for explainable recommendation in online games. TIST (2023).Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. [57] Yuan Guanghu, Yuan Fajie, Li Yudong, Kong Beibei, Li Shujie, Chen Lei, Yang Min, YU Chenyun, Hu Bo, Li Zang, Xu Yu, and Qie Xiaohu. 2022. Tenrec: A large-scale multipurpose benchmark dataset for recommender systems. In Proceedings of the NeurIPS Datasets and Benchmarks Track.Google ScholarGoogle Scholar
  58. [58] Zhang Qianqian, Liao Xinru, Liu Quan, Xu Jian, and Zheng Bo. 2022. Leaving no one behind: A multi-scenario multi-task meta learning approach for advertiser modeling. In Proceedings of the WSDM. 13681376.Google ScholarGoogle ScholarDigital LibraryDigital Library
  59. [59] Zhou Guorui, Zhu Xiaoqiang, Song Chenru, Fan Ying, Zhu Han, Ma Xiao, Yan Yanghui, Jin Junqi, Li Han, and Gai Kun. 2018. Deep interest network for click-through rate prediction. In Proceedings of the SIGKDD. 10591068.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Jie Zhou, Xianshuai Cao, Wenhao Li, Lin Bo, Kun Zhang, Chuan Luo, and Qian Yu. 2023. Hinet: novel multi-scenario & multi-task learning with hierarchical information extraction. In 2023 IEEE 39th International Conference on Data Engineering (ICDE). IEEE, 2969–2975.Google ScholarGoogle Scholar
  61. [61] Zhu Jieming, Liu Jinyang, Yang Shuai, Zhang Qi, and He Xiuqiang. 2021. Open benchmarking for click-through rate prediction. In Proceedings of the CIKM. ACM, 27592769.Google ScholarGoogle ScholarDigital LibraryDigital Library
  62. [62] Zou Fangyu, Shen Li, Jie Zequn, Zhang Weizhong, and Liu Wei. 2019. A sufficient condition for convergences of Adam and RMSProp. In Proceedings of the CVPR. 1112711135.Google ScholarGoogle ScholarCross RefCross Ref
  63. [63] Zou Xinyu, Hu Zhi, Zhao Yiming, Ding Xuchu, Liu Zhongyi, Li Chenliang, and Sun Aixin. 2022. Automatic expert selection for multi-scenario and multi-task search. In Proceedings of the SIGIR. 15351544.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multi-Scenario and Multi-Task Aware Feature Interaction for Recommendation System

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Knowledge Discovery from Data
          ACM Transactions on Knowledge Discovery from Data  Volume 18, Issue 6
          July 2024
          535 pages
          ISSN:1556-4681
          EISSN:1556-472X
          DOI:10.1145/3613684
          Issue’s Table of Contents

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 12 April 2024
          • Online AM: 6 March 2024
          • Accepted: 27 February 2024
          • Revised: 26 December 2023
          • Received: 28 July 2023
          Published in tkdd Volume 18, Issue 6

          Check for updates

          Qualifiers

          • research-article
        • Article Metrics

          • Downloads (Last 12 months)233
          • Downloads (Last 6 weeks)160

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        View Full Text