Predicting user click behavior and making relevant recommendations based on
the user's historical click behavior are critical to simplifying operations and
improving user experience. Modeling UI elements is essential to user click
behavior prediction, while the complexity and variety of the UI make it
difficult to adequately capture the information of different scales. In
addition, the lack of relevant datasets also presents difficulties for such
studies. In response to these challenges, we construct a fine-grained
smartphone usage behavior dataset containing 3,664,325 clicks of 100 users and
propose a UI element spatial hierarchy aware smartphone user click behavior
prediction method (SHA-SCP). SHA-SCP builds element groups by clustering the
elements according to their spatial positions and uses attention mechanisms to
perceive the UI at the element level and the element group level to fully
capture the information of different scales. Experiments are conducted on the
fine-grained smartphone usage behavior dataset, and the results show that our
method outperforms the best baseline by an average of 10.52%, 11.34%, and
10.42% in Top-1 Accuracy, Top-3 Accuracy, and Top-5 Accuracy, respectively