Gender Identification of Mobile Phone Users based on Internet Usage Pattern

Document Type : Original Article

Authors

Faculty of Computer Engineering, University of Isfahan, Azadi Sq., Hezarjarib St., Isfahan, Iran

Abstract

Gender is an important aspect of a person's identity. In many applications, gender identification is useful for personalizing services and recommendations. On the other hand, many people today spend a lot of time on their mobile phones. Studies have shown that the way users interact with mobile phones is influenced by their gender. But the existing methods for identify the gender of mobile phone users are either not accurate enough or require sensors and specific user activities. In this paper, for the first time, the internet usage patterns are used to identify the gender of mobile phone users. To this end, the interaction data, and specially the internet usage patterns of a random sample of people are automatically recorded by an application installed on their mobile phones. Then, the gender identification is modeled using different machine learning classification methods. The evaluations showed that the internet features play an important role in recognizing the users gender. The linear support vector machine was the superior classifier with the accuracy of 85% and F-measure of 85%.

Keywords


  1. Daneshvar, S. and Inkpen, D., "Gender identification in twitter using n-grams and lsa.", in Proceedings of the Ninth International Conference of the CLEF Association (CLEF 2018), (2018).
  2. Giannakopoulos, O., Kalatzis, N., Roussaki, I. and Papavassiliou, S., "Gender recognition based on social networks for multimedia production." in 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2018 - Proceedings. Institute of Electrical and Electronics Engineers Inc, (2018).
  3. Jain, A. and Kanhangad, V., "Gender recognition in smartphones using touchscreen gestures." Pattern Recognition Letters, Vol. 125, (2019), 604-611. https://doi.org/10.1016/j.patrec.2019.06.008
  4. Jain, A. and Kanhangad, V,, "Gender classification in smartphones using gait information." Expert Systems & Applications, Vol. 93, (2018), 257-266. https://doi.org/10.1016/j.eswa.2017.10.017
  5. Meena, T. and Sarawadekar, K., "Gender recognition using in-built inertial sensors of smartphone.", in IEEE Region 10 Annual International Conference, Proceedings/TENCON. Institute of Electrical and Electronics Engineers Inc., (2020), 462-467.
  6. Nguyen-Quoc, H. and Hoang, VT., "Gender recognition based on ear images: A comparative experimental study.", in 2020 3rd International Seminar on Research of Information Technology and Intelligent Systems, ISRITI 2020. Institute of Electrical and Electronics Engineers Inc., (2020), 451-456.
  7. Jain, A. and Kanhangad, V., "Investigating gender recognition in smartphones using accelerometer and gyroscope sensor readings", in 2016 international conference on computational techniques in information and communication technologies (ICCTICT). IEEE, (2016), 597-602.
  8. Sarraute, C., Blanc, P. and Burroni, J., "A study of age and gender seen through mobile phone usage patterns in mexico.", in 2014 IEEE/ACM international conference on advances in social networks analysis and mining (ASONAM 2014). IEEE, (2014), 836-843.
  9. Choi, Y., Kim, Y., Kim, S., Park, K. and Park, J., "An on-device gender prediction method for mobile users using representative wordsets". Expert Systems & Applications, Vol. 64, (2016), 423-433. https://doi.org/10.1016/j.eswa.2016.08.002
  10. Miguel-Hurtado, O., Stevenage, S. V., Bevan, C. and Guest. R. "Predicting sex as a soft-biometrics from device interaction swipe gestures." Pattern Recognition Letters, Vol. 79, (2016), 44-51
  11. Ramirez-Correa, P.E., Rondan-Cataluña, F.J. and Arenas-Gaitán, J. "Predicting behavioral intention of mobile Internet usage." Telemat Informatics, Vol. 32, (2015), 834-841. https://doi.org/10.1016/j.tele.2015.04.006
  12. Su, W., Han, X., Jin, C., Yan, Y. and Potenza, M.N., "Are males more likely to be addicted to the internet than females? A meta-analysis involving 34 global jurisdictions." Computers in Human Behavior, Vol. 99, (2019), 86-100. https://doi.org/10.1016/j.chb.2019.04.021
  13. Okazaki, S. and Hirose, M., "Does gender affect media choice in travel information search? On the use of mobile Internet.", Tourism Management, Vol 30, (2009), 794-804. https://doi.org/10.1016/j.tourman.2008.12.012
  14. Vashisth, P. and Meehan, K., "Gender classification using Twitter text data.", in 2020 31st Irish Signals and Systems Conference (ISSC). IEEE, (2020), 1-6.
  15. Piot-Perez-Abadin, P., Martín-Rodilla, P. and Parapar, J., "Experimental analysis of the relevance of features and effects on gender classification models for social media author profiling." in ENASE., (2021), 103-113.
  16. Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q. and Liu, T.Y., "Lightgbm: A highly efficient gradient boosting decision tree" Advances in Neural Information Processing Systems, Vol. 30, (2017).
  17. Kowsari, K., Heidarysafa, M., Odukoya, T., Potter, P., Barnes, L.E. and Brown, D.E., "Gender detection on social networks using ensemble deep learning.", in Proceedings of the Future Technologies Conference. Springer, (2020), 346-358.
  18. Safara, F., Mohammed, A.S., Potrus, M.Y., Ali, S., Tho, Q.T., Souri, A., Janenia, F. and Hosseinzadeh, M., "An author gender detection method using whale optimization algorithm and artificial neural network.", IEEE Access, Vol. 8, (2020), 48428-48437
  19. Sadeghian, A. and Kaedi, M.. "Happiness recognition from smartphone usage data considering users’ estimated personality traits.", Pervasive and Mobile Computing, Vol. 73, (2021), 101389
  20. James, G., Witten, D., Hastie, T. and Tibshirani, R., An introduction to statistical learning, Springer, (2013).
  21. Breiman, L., Classification and regression trees, Wadsworth Stat Ser 358, 1984.
  22. Han, J., Pei, J. and Kamber, M., Data mining: concepts and techniques, Elsevier, 2011.
  23. Jo, T., Machine Learning Foundations: Supervised, Unsupervised, and Advanced Learning, Springer Nature, (2021).
  24. Yadav, S. and Shukla, S., "Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification.", in 2016 IEEE 6th International conference on advanced computing (IACC). IEEE, (2016), 78-83.
  25. Hamidi, H. and Rafebakhsh, M. S. “Analyzing factors influencing mobile social media marketing acceptance among customers.”, International Journal of Engineering, Transactions C: Aspects, Vol. 35, No. 6, (2022), 1209-1216. DOI: 10.5829/IJE.2022.35.06C.13
  26. Yavari, A., Hassanpour, H., Rahimpour Cami, B. and Mahdavi, M. “Election Prediction Based on Sentiment Analysis using Twitter Data.” International Journal of Engineering, Transactions B: Applications, Vol. 35, No. 2, (2022), 372-379. DOI: 10.5829/IJE.2022.35.02B.13
  27. Jaderyan, M. and Khotanlou, H. “Automatic hashtag recommendation in social networking and microblogging platforms using a knowledge-intensive content-based approach.”, International Journal of Engineering, Transactions B: Applications, Vol. 32, No. 8, (2019), 1101-1116. doi: 10.5829/ije.2019.32.08b.06
  28. Yu, L. and Shek, D. T. L.. “Internet addiction in Hong Kong adolescents: A three-year longitudinal study.” Journal of Pediatric and Adolescent Gynecology, Vol. 26, No. 3, (2013), 10-17. https://doi.org/10.1016/j.jpag.2013.03.010.
  29. International Telecommunication Union “ICT facts and figures 2016”. (2016), from https://www.itu.int/en/ITU-D/Statistics/Documents/facts/ICTFactsFigures2016.pdf.
  30. Wang, Z., Hu, Y., Zheng, H., Yuan, K., Du, X. and Dong, G. “Females are more vulnerable to Internet gaming disorder than males: Evidence from cortical thickness abnormalities.” Psychiatry Research: Neuroimaging, Vol. 283, (2019), 145-153. https://doi.org/10 .1016/j.pscychresns.2018.11.001.
  31. Hafkin, N. J. and Huyer, S. “Women and gender in ICT statistics and indicators for development.” Information Technologies and International Development, Vol. 4, No. 2, (2007), 25-41.
  32. Becker, J. B., McClellan, M. and Reed, B. G. “Sociocultural context for sex differences in addiction.” Addiction Biology, Vol. 21 No. 5, (2016), 1052-1059. https://doi.org/10.1111/adb.12383.
  33. Jang, M. H. and Ji, E. S. “Gender differences in associations between parental problem drinking and early adolescents' Internet addiction.” Journal for Specialists in Pediatric Nursing, Vol. 17, No. 4, (2012), 288-300. https://doi.org/10.1111/j.1744-6155.2012.00344.x