1. Introduction

Music, if conceived as a common heritage of humanity, is a heterogeneous mixture of creative processes taking shape in different historical, cultural, and societal contexts. In everyday life, cultural differences are experienced when interactions among cultures appear, leading the individual to elaborate the notion of self (what is familiar) in contrast to the other (what is different) (). As discussed by Grenier (), throughout history the concept of diversity has evolved and its evolution has been fundamental in shaping relationships between different musical traditions. Whilst fields such as Musicology, Music Cognition, or Psychology and Sociology of Music have a long tradition of questioning the significance of cultural diversity (), the younger field of Music Information Retrieval is still in its early steps in addressing similar questions, and in translating such knowledge through the design of MIR technologies accountable for social values, like diversity ().

Diversifying MIR is a goal to be accomplished by understanding the multidimensionality inherent in both music and human nature. Aspects such as the diversity of the teams engaged in the design and development of MIR systems, the diversity of musical works and their creators, how to diversify tools to help MIR practitioners address cultural differences, and who and how is benefiting from the diversification strategies, and who is not, are part of the challenges described by Born (). Those challenges are similarly identified in the broader field of Artificial Intelligence (AI), the parent of concepts such as ‘Music Intelligence’ and ‘AI Music’ (), a field in which we are already witnessing a diversity crisis, for instance with regards to the workforce involved in the design of AI systems (), or the academic community participating in AI conferences ().

In this overview, we explore the literature related to diversity in Music Recommender Systems. Rather than focusing on the comparison of the works trying to identify ‘good’ or ‘bad’ practices, we choose to connect them arguing about if and how diversity has been embedded in Music RS, and what could be the consequences of specific designs and implementations. In order to do this, in Section 2 we take as reference the field of Information Technology, to which RS belong. We examine how diversity can be included in the design process of such technology in a principled and comprehensive manner (), creating architectures which may help people in making diverse choices (), the idea defining the diversity by design approach. Until now, RS research has mostly focused on the improvement of performance by developing more and more sophisticated techniques, but the impact of these technologies is still underexplored (). As highlighted by Salamon (), music recommendation research can be considered among the few exceptions in MIR in which there is an effective connection between research and end-users. This leaves room for questioning how such research and resulting technology are benefiting, or not, those who are actively engaged in its consumption. In Section 3, we review the diversity-related Music RS literature, the core part of this overview. In today’s digital spaces, listening experiences cannot be imagined without considering the widespread use of streaming services, in which Music RS play a crucial role in helping people find what they want to listen to, but also in driving them by proposing music when they do not know what to choose (). The mere fact that streaming services in 2019 generated almost half of the global revenues in the music industry should be enough to understand the potential impact of Music RS, at least from a commercial perspective. In Section 4 we identify future challenges for the design of diversity-aware Music RS. In particular, we present findings from different disciplines which can provide to MIR practitioners new perspectives for integrating diversity not exclusively from a computational viewpoint. Lastly, we draw conclusions in Section 5.

At the time of writing, surveys on music recommendation diversity have not yet been presented, but in the following we present related surveys that may help the reader in deepening aspects not fully covered here. Castells et al. () review evaluation procedures, algorithmic solutions, and empirical results connected with the notions of diversity and novelty in RS research. Diversity-related metrics are also discussed by Kaminskas and Bridge (), together with other beyond-accuracy objectives proposed in the literature. Kunaver and Požrl () present an overview of RS diversification techniques, focusing both on algorithmic solutions and evaluation practices. From a wider perspective, Drosou et al. () discuss the role of diversity in Big Data applications, focusing on the selection task. These surveys comprehensively treat most of the algorithmic approaches proposed in the diversity-related literature, applicable as “off-the-shelf” methods also in MIR, and therefore we encourage their reading for practitioners interested in exploring technological solutions.

Instead, the goal of this overview is threefold: 1) to present the diversity by design paradigm in IT, and its implications for Music RS research; 2) to review the MIR literature discussing proposed approaches considering diversity in Music RS; 3) to identify open challenges for the design of diversity-aware music recommendations.

2. Diversity by Design in IT

Increasing attention to value sensitive design () in IT, and similarly in AI, emerged with the widespread introduction of such technologies in our daily life, and search engines and recommender systems are tangible proof (). Questions about a spectrum of topics, such as ethics, autonomy, fairness, or bias, revived the debate around practices of embedding human values and attributes in machines (). However, its relevance in the contemporary social system is undoubtedly increased as a consequence of the prominence and ubiquity of those technologies, nowadays central parts of our lives (; ). Throughout this overview, we consider diversity as the core principle that we want to incorporate in the value-sensitive design of Music RS. Indeed, preserving and supporting the multitude of musical languages and artistic expressions created and experienced by people all around the world is one of the goals that IT should pursue in the music domain ().

The conceptualisation and measurement of diversity have been the object of study of a broad range of disciplines (; ; ), and to identify a global framework for measuring music recommendation diversity is out of the scope of this overview. Several diversity indexes have been formulated to describe different kinds of populations, among which most of them fall within the category of the so-called dual-concept diversity (). Such indexes make use of two dimensions: variety, the number of categories in a population, and balance, representing the evenness of elements’ distribution across categories. For example, in a set of tracks classified with regards to their music genre, the variety is the number of genres within the set, while the balance is how tracks are distributed among the genres. Among the others, the Shannon, Simpson, and Herfindahl indexes fall within this category, widely used also in the MIR literature.

One of the main drawbacks of applying the dual-concept logic in the music domain is its frequentist definition of diversity, where conclusions are built only by observing the distribution of the data. Indeed, by exclusively using the variety and balance between elements and categories of a set, additional information about the nature of categories is discarded. Again in the case of tracks classified by genres, having a set of tracks half Blues and half Rock, and another one split equally into Blues and Electronic tracks, using a dual-concept diversity index the two sets may appear equally diverse in terms of track genre, while musically speaking in the former set we imagine that tracks could be less diverse, the two genres being closer than in the latter case. To overcome this issue, further dimensions describing differences between categories can be considered, as in the case of disparity for the Rao-Stirling index ().

From a wider perspective, the diversity by design paradigm is not just a matter of choosing the right metric, instead it is ‘[…] the idea that it is possible to create an architecture or service that helps people to make diverse choices’ (). Diversity interplays in different aspects of the design process of information systems, and in the next section we deepen the role of those in this process. To facilitate the reading, in Table 1 we summarise the concepts discussed in the overview.

Table 1

Summary of terms and definitions presented in the overview.


Terms & DefinitionsReference(s)

Cultural diversity: the uniqueness and plurality of the identities of the groups and societies making up humankind.UNESCO ()
Huron ()

Dual-concept diversity: measurement of diversity based on the variety and balance of the elements of a population divided into categories.
Variety: number of categories in a population.
Balance: distribution of elements over the categories of a population.
Disparity: differences between categories of a population.
McDonald and Dimmick ()
Stirling ()

Diversity by design: the creation of an architecture or service that helps people to make diverse choices.
Source diversity: the range of information providers.
Content diversity: the range of information provided.
Exposure diversity: the range of information accessed by people.
Individual autonomy perspective: provide people with a tool for exploiting their different interests.
Deliberative perspective: promote public awareness by showing divergent opinions.
Adversarial perspective: enhance the visibility of underrepresented opinions.
Napoli ()
Helberger ()
Helberger el al. ()
Loecherbach et al. ()

Diversity-aware RS: recommender systems designed to diversify the users’ experience.
Item diversity: the range of items recommended by a RS.
User diversity: the range of users interacting with a RS.
(User) behavioural diversity: the range of items accessed by the users.
(User) perceived diversity: the item diversity as perceived by the users.
Castells et al. ()
Kaminskas and Bridge ()
Kunaver and Požrl ()

2.1 Deconstruction, purpose and impact

Approaching diversity from an information perspective, a first step is to understand how to deconstruct this concept. Napoli () identifies three components cooperating in the design of IT: 1) source diversity, aspects related to the information providers; 2) content diversity, describing the composition of the information accessible to users; 3) exposure diversity, identifying what content users access in contrast to what is available. In the case of Music RS, content diversity can refer to the catalogue from which recommendations are provided, source diversity to the artists or record labels providing such catalogue, and exposure diversity relates to the recommendations that listeners eventually consume.

Secondly, we can identify the purpose of introducing the diversity by design approach. In broader terms, the goal is building systems that can guarantee people to be aware of the range of accessible information (). However, the motivation behind this choice may be not unique. Helberger et al. () identify three perspectives: individual autonomy, deliberative and adversarial. Under an individual autonomy perspective, the idea is to give individuals a tool to exploit their different interests. In this case, we imagine Music RS helping people in diversifying the listening experience, broadening the possible choices with regards to their music preferences. Pursuing a deliberative perspective, the aim is to promote the public debate, showing divergent opinions and helping people in constructing a critical view. Here, Music RS can be designed to make listeners explore music far from their preferences, to make them aware of the unknown parts of the musical panorama. With an adversarial perspective, the focus is to broaden the debate highlighting non-dominant visions. Similar to the previous case, Music RS can serve as a way to promote underrepresented groups, whether subcultures or non-mainstream musical styles, under a non-hegemonic view.

Finally, we want to understand the consequences of the presence of diversity, or the lack thereof. Positive benefits of implementing diversity policies can be several, starting from fostering innovation and creativity in the workplace (), to promote equality in access to knowledge and freedom of expression (). Such benefits hold true in the MIR field. Where diversity is lacking, damaging effects having a negative impact on people and society have already been found in IT. Among those, the phenomenon of being continuously over-exposed to content that fits our interests, named the filter bubble by Pariser (), is probably the most researched and discussed both within and outside the academic community. Similarly, echo chambers have been identified where technologies exacerbate the tendency to relate mainly with people with like-minded opinions (). From a societal view, balkanisation refers to the fragmentation of digital spaces into different communities based on their interests ().

Under this lens, it is possible to identify the role that Music RS play in determining the exposure to music, and how most of the research until now has focused on empowering exposure diversity under an individual autonomy perspective (). This may be linked with the emergence of filter bubbles and echo chambers created by Music RS, wherein adversarial or deliberative perspectives could help in alleviating such negative impact valorising underground artists, as recently explored by Kowald et al. ().

Whilst the areas in which such phenomena have been studied range from political views (), access to news (), social data (), and cultural products (), in the field of MIR they are still underexplored. Naturally, ethical considerations on the misuses of Music RS have already emerged in the MIR community (), among which the phenomenon of popularity bias and the underrepresentation of niche artists — the so-called long-tail — is possibly one of the most studied (). Nonetheless, several questions open the way to novel musically motivated analysis. What is the role of music recommendation diversity in shaping the listeners’ experience? What are the implications of the emergence of diversity-related phenomena (such as filter bubbles, echo chambers, or balkanisation) on people’s musical preferences? In the next section, reviewing the literature of diversity in Music RS we aim to understand at what stage the MIR research has contributed to this analysis.

3. Music Recommendation Diversity

Interactions between users and items are traditionally the main core of RS research, and Music RS does not differ in this aspect (). Inspired by studies on the semiology of music (; ), we can map the two elements of RS research, users and items, to two distinguishable domains, interdependent and both influential on the nature of music: the Poietic and the Esthetic domain. The Poietic domain (from Greek: poiētikós, ‘creative’) includes the works’ creative processes, influenced by aspects such as the composer formation, musical theories, or the historical context in which the work is created. The Esthetic domain (from Greek: aísthēsis, ‘perception’) comprehends the aspects related to the listeners, hence their musical background, the historical situation, their perception of the musical work, and their musical knowledge. Reviewing the literature of Music RS research, we then present studies related to diversity mapping items and users to such domains (Figure 1).

Figure 1 

Mind map of elements constituting Music RS diversity. Behavioural diversity, for instance represented by listening events, is measured when users access the information provided by the items (exposure, Section 2.1). These connection points rely on one side on the item diversity (Section 3.1), built on content and source item features, and on the other side on user diversity (Section 3.2), with regards to their characteristics. Additionally, perceived diversity (Section 3.2.1) creates a bridge between the Esthetic and Poietic domains.

3.1 Poietic domain – the item side

Music RS can be designed to recommend different categories of items, such as artists (e.g. ) or tracks (e.g. ). Several works in the RS diversity-related literature make use of Listening Event (LE) datasets, representing the interactions between items and users, for validating models and techniques by means of empirical analysis. In particular, data from the online music service Last.fm is a widely used resource for many in the RS community (e.g. ; ; ; ). Nonetheless, focusing on the MIR literature, we can have a more detailed understanding of how item diversity has been approached in the music domain.

A first line of research identifies diversity as the count of different items with which users interact, averaged and aggregated with different logics which often can be traced back to the dual-concept diversity. Schedl and Hauger () use the number of listened-to tracks and their musical genres as a proxy to characterise users’ musical taste in terms of diversity. A similar logic is proposed by Ferwerda and Schedl (), where diversity of listening behaviours is analyzed at a country-level. Again considering the country, Liu et al. () measure the diversity by analyzing the distribution of artists’ listening counts. The advantages of using such approaches rely on their not complex formulation and relatively simple implementation, and in addition they can be computed using only the listening events, eventually with artist or genre metadata. The main drawback is that they do not use any additional features to differentiate between items, risking to oversimplify the nature of concepts such as music genres (see Section 4.1).

A second line of work, building on top of the distribution of the user-item interactions, makes use of distance spaces containing additional information to diversify the items. For example, Park et al. () and Way et al. () use the Rao-Stirling diversity index, where a further dimension representing the closeness between items, genres in the former case and artists in the latter, is computed thanks to a co-consumption matrix. Porcaro and Gómez () build an embedding space modeling user-generated tags to compute the diversity of a playlist. Similarly, Anderson et al. () create a song-embedding space using user-generated playlists, from which a diversity score is derived. Although these data-driven approaches are able to compute items’ fine-grained features for estimating diversity going beyond the dual-concept logic, it is also true that they are generally expensive in terms of data and computational resources.

Furthermore, approaches based on latent features extracted using matrix factorisation (MF) techniques have been proposed. For instance, Ferwerda et al. () and Robinson et al. () compute diversity using the Euclidean distance between item vectors in the MF space. A great advantage of these methods is that they require only the user-item interaction matrix, however the little interpretability of the latent space makes it difficult to understand what are the item characteristics that determine the diversity. Alternative approaches using entropy-related metrics can be found, for instance using Shannon entropy (; ), and the Herfindahl-Hirschman index (; ).

What most of the aforementioned works share is their common perspective of measuring some sort of item diversity connected with the users’ behaviours, focusing mostly on exposure diversity (Table 2). Content and source diversity instead are considered mainly in works centered on the analysis of music lists (e.g. playlists, recommendation lists, sessions), where however the user is often left aside (Table 3). Grouping users by their diversity is intended as grouping them by the diversity of the items they consumed, and in this behavioural perspective several important aspects related to the listener, the end-user of Music RS, are neglected as discussed in the next section.

Table 2

List of works analyzing users’ behavioural diversity in the music domain, presented in chronological order.


ReferenceDiversity metric definition(s)
Dataset(s)

Farrahi et al. ()• Number of unique genres associated with the artists listened to by a user.
MMTD ().
Schedl and Hauger ()• Users’ average track listening frequency; number of distinct track genres.
Last.fm LEs.
Ferwerda et al. ()• Aggregation of each user’s listening history by artist and genre.
LFM-1b ().
Ferwerda and Schedl ()• Overall volume of genre occurrences; relative listening volume exceeding one per mille; Shannon index computed over artist genre.
LFM-1b ().
Park et al. ()• Rao-Stirling index computed over artist genre.
Last.fm users’ top artists.
Datta et al. ()• Log number of unique artists, songs, and genres listened to; number of unique top artists in a user’s geographic region divided by the number of unique artists listened to over the same time period; Herfindahl index computed over a user’s weekly plays.
Spotify LEs.
Wang et al. ()• Ratio of unique artists in a user’s playlists over all the artists listened to by the user; same ratio computed over artist genre.
Last.fm 1K ().
Li et al. ()• Hill-type true diversity (Rao-Stirling index) computed over album genre.
Xiami LEs.
Way et al. ()• Rao-Stirling index computed over artist genre.
Spotify LEs.
Poulain and Tarissan ()• Herfindahl-Hirschman index computed over tripartite graphs (users, tracks, tags).
MSD (); Amazon Dataset ().
Anderson et al. ()• Average cosine similarity between a track embedding and the average of the user’s track embeddings.
Spotify LEs.
Kowald et al. ()• Cosine similarity computed over the users’ track genre distributions.
LFM-BeyMS (), subset of LFM-1b ().

Table 3

List of works analysing item diversity in the music domain, presented in chronological order. We refer to Ziegler et al. () for the formula of the Intra-List Diversity (ILD).


ReferenceDiversity metric definition(s)
Dataset(s)

Slaney and White ()• Distribution of points in an 11-dimensional genre space computed over tracks’ acoustic features.
WebJay playlists.
Ferwerda et al. ()• ILD using Euclidean distance computed over the latent factor of item-user matrix factorisation.
Last.fm LEs; LFM-1b ().
Lu and Tintarev ()• ILD computed over weighted combinations of several diversity degrees for different attributes (release time, artist, genre, tempo, key).
Spotify users’ preferred songs;
Echo Nest Taste Profile Subset ().
Porcaro and Gómez ()• ILD using cosine distance computed over track tag embeddings.
Art of the Mix playlists ();
Yes.com radio playlists ();
MMTD (); Deezer users’ playlists.
Knees and Hübler ()• Simpson index computed over tracks’ record labels.
MPD ().
Robinson et al. ()• ILD using Euclidean distance computed over the latent factor of item-user matrix factorisation.
Last.fm LEs.
Jin et al. ()• ILD using Jaccard Index computed over track genre.
Spotify users’ recommendations.

3.2 Esthetic domain – the user side

Understanding music listeners is a hard problem, due to the multifaceted nature on one side of human behaviour, and on the other side of the act of listening to music. This problem was neglected by the MIR community in its early stages (; ), but more awareness of it is emerging recently (). As a starting point, in line with Knees et al. (), it is important to separate individual from collective aspects, elements interconnected but subject to different praxes. We refer to collective aspects when referring to aspects of the music listening shared among people belonging to a specific group, be it built on ethnic, geographical, generational, or other criteria.

3.2.1 Individual aspects

Scholars in the field of psychology of music have addressed for decades the study of the role of listening to music in people’s lives (). The several functions identified with this act, which juxtaposed bring out its ubiquity, are symptomatic of the adversities emerging while designing Music RS adaptable to different roles: entertainment, identity formation, escapism, mood management, self-determination, and social differentiation, just to mention a few ().

Among the aspects characterising individuals’ diversity needs while interacting with RS, personality traits have emerged as a focal feature for differentiating behaviours, largely investigated by the MIR community (, ; ; ). In these works, the five-factor model proposed by McCrae and John () is a commonly accepted taxonomy which groups personality traits in five main dimensions: Extraversion, Agreeableness, Conscientiousness, Neuroticism, and Openness to Experience. Analyzing the correlation between these factors and music preferences, researchers have highlighted points of intersection between personal traits and the demand to diversify the listening experience. New directions have also been explored concerning the relationship between musical taste and personal values (). An important outcome of these studies is the differentiation between metric-based diversity, as measurement based on designed features extractable by algorithmic processes, and perceived diversity, hence how people evaluate a degree of diversity based on their personality, background, and beliefs (see Section 4.2).

3.2.2 Collective aspects

Acknowledging music as a social phenomenon entails the understanding of the overlap between individual practices and collective habits. Analyzing the interactions between people and musical objects, it is possible to observe a dual structure, where social groups are formed based on shared interests and taste, and parallelly genres and subcultures are dependent on their publics (). This intuition is at the core of several recommendation algorithms belonging to Collaborative Filtering (CF) methods, one of the widespread frameworks in the RS panorama (). The idea that similar users like similar items simplifies sociological aspects of group formations, but still is no stranger to social phenomena. Not surprisingly, and in line with Bourdieu’s view that ‘nothing more clearly affirms one’s “class”, nothing more infallibly classifies, than tastes in music’ (), one among the first information filtering systems based on social information was a personalised Music RS named Ringo ().

A huge limitation when studying collective aspects in Music RS research is the lack of data available to perform diversity analysis (see Section 4.3). Indeed, when characterising groups of listeners, often only country-related information can be exploited. Several examples of cross-country analysis can be found in the MIR literature. For instance, Ferwerda et al. () use Hofstede’s cultural dimensions () to investigate users’ diversity needs across countries. Liu et al. (), along with such dimensions, consider economic and linguistic diversity when modelling distances between users’ country of origin. Alternatively, a characterisation of users’ diversity based on socio-economic factors is presented by Park et al. (). Nonetheless, the use of the country’s information as a proxy for classifying individuals can misrepresent the idea of culture with national culture, stigmatising aspects which however are not representative of multicultural environments (). An alternative approach of using country information has been proposed by Schedl et al. (), where country archetypes are created based on listening preferences.

4. Challenges and Research Gaps

We have presented what so far have been the research directions in which diversity has been investigated in the Music RS field. Most of the work has focused on establishing methods to estimate the diversity of users’ behaviours when interacting with recommendations, using as a proxy the diversity of the items consumed. Few works have also considered how recommendations interplay with the diversity of users’ characteristics, whether at an individual level such as personality traits, or at a collective level such as country of origin. Nonetheless, interdisciplinary perspectives while designing Music RS are often overlooked, a trend already highlighted by Laplante (). This motivates us to provide three examples wherein undertaking an interdisciplinary attitude could help in designing diversity-aware Music RS systems.

4.1 Item diversity and music classification

An aspect to consider when dealing with the measurement of item diversity in music RS research is the long-standing debate about the classification of music and culture (; ; ). Despite the dynamical, intrinsic, ambiguous, and context-dependent nature of concepts such as genre and style (; ), they have been historically used by the MIR community in different frameworks, mainly while presented in the form of tags (). However, when represented as tags, genres and styles are often deprived of meaningful historical and societal characteristics. Making such abstract concepts understandable by a machine is still an open question in MIR, and with current methods may still prove elusive, as observed by Sturm () for the task of Music Genre Recognition. For addressing this challenge, alternative approaches to the use of a fixed taxonomy could be considered, as done by Vlegels and Lievens () and Way et al. (), where the classification of items is based on the analysis of listeners’ behaviour, enhancing the duality of cultural networks ().

4.2 User diversity and the musical Self

Contextualising the relationships between individual listening experience and Music RS, two facets of these technologies can be identified following the work done by Foucault (): on one side as technologies of power, influencing the behaviours of individuals, on the other as technologies of the self, providing a tool for transforming oneself. What are the expectations when receiving recommendations according to the image we have about our musical Self? Diversity here plays a key role because the urge to diversify can emerge simply by being exposed to such systems, affecting how we behave. As observed by a https://www.Last.fm user reflecting on her listening habits ():

Last.fm has changed me. Made me too self-conscious of my listening habits. Before, I’d play the same artist for days and days, but now I constantly struggle to diversify. I recently made a playlist called “diversify!”[…].

Under this lens, while designing Music RS to consider the dichotomy proposed by Roth (), where algorithmic influence on individuals’ behaviours is classified into read our mind and change our mind processes, can lead to a deeper understanding of the interactions between listeners and Music RS.

4.3 User diversity and social background

Defining the relationships between social groups and musical tastes, Bourdieu’s perspective in Distinction, and the so-called omnivore thesis by Peterson can be considered as two well-established theories evidencing underlying mechanisms of social interactions with cultural objects (; ). Bourdieu’s work focuses on the analysis of taste formation and definition in relation to social status, showing how economic, cultural, and social capitals play a central role in these processes. A decade later, Peterson presented the omnivore-univore model to describe audience segmentation in the USA during the early 1990’s. In his model, omnivore refers to consumption habits of high-status participants characterised by a tendency to appreciate a wide variety of cultural products, while univore represents the part of the population used to consume few specific categories of cultural products.

The debate around such theories is still active (see Coulangeon and Lemel (); Atkinson ()), but both converge on the idea that the diversity of listening habits cannot be detached from the diversity of the social and cultural background in which people build their own experiences. In this respect, the work by Park et al. () is a notable example of using socio-economic information to characterise the listeners’ background, which goes beyond the cross-country analysis often pursued by MIR scholars.

5. Conclusions and Path Forward

Although we have some intuitions of how diversity can be treated when interacting with music recommendations, the lack of a bigger picture in which to frame such diversity analysis is a gap that we are witnessing (). To imagine how diversity can be included as a design principle for the next generation of Music RS is an open debate for the MIR community, to which we aim to contribute with this overview. In the future, we foresee the definition of a set of practical indications to follow when approaching diversity in Music RS research.

As a starting point, we believe that it is critical for practitioners to explain how diversity is inferred while designing diversity-aware Music RS. It could be done by identifying which components are investigated (source, content, exposure), which perspectives are undertaken when designing algorithmic procedures (individual autonomy, deliberative, or adversarial), but also what impact, being it positive or negative, could we expect by the introduction of a specific design. This latter aspect is still underexplored in MIR research, and having a deeper understanding of the dynamics of the interaction between listeners and RS is without doubt a core issue. The increasing interest by the RS community in simulation-based frameworks and longitudinal studies paves the way to new findings which can be applied in the future design of Music RS (). Furthermore, it is also important to investigate how different dimensions of diversity correlate when people interact with Music RS. Could we guarantee that the diversity of the items is not influential on the diversity of the users, and vice versa? The emergence of streaming services which target a music genre (e.g. IDAGIO for classical music), or developed in a specific region of the world (e.g. Anghami in the Middle East), poses new questions about how Music RS can be designed in scenarios wherein a globalist vision could fail in representing the peculiarities of artists and listeners.

In conclusion, we share the call for an interdisciplinary effort made by Born (), a necessary step to escape from the technological solutionism which has partly driven the Music RS research roadmap until now, a road which however may be full of traps (; ).