[Feedback needed] ListenBrainz statistics proposal

This thread is a companion to Statistics in ListenBrainz and the GSoC 2017 proposal. However, this proposal focuses on what statistics to collect over the technical implementation.

You can leave comments in this Google Docs page too!

Summary

Implement new statistics to calculate insightful data on the listens submitted to ListenBrainz, to make the platform more competitive to other solutions and more interesting to the user

Background

Currently, ListenBrainz collects two types of statistics:

  • Sitewide (code)
  • Individual user (code)

Sitewide statistics

ListenBrainz calculates a single statistic for the entire site:

  • Total number of artists submitted by all users

User statistics

ListenBrainz calculates the following statistics for users:

  • Top recordings over a time interval
  • Top artists over a time interval
  • Top releases over a time interval
  • Number of artists listened to over a time interval

Why now?

ListenBrainz does not yet collect many statistics. In the future, this will become an important focus for the platform to compete with other proprietary services and provide insightful information to the user.

Details

Two types of statistics were gathered for this proposal. Some are already implemented on other third-party sites, such as Last.fm, and others are new ideas that are not implemented by other competing, proprietary services.

Note that “entities” refers to artists, releases, and recordings.

Third-party statistics

Ideas for new statistics from third-party sites include…

  • Time spent listening: Total time spent listening to music over a time period
  • User percentiles for entities: Compare your listens for different entities compared to other users on the site (e.g. 80th percentile for number of artists listened to in a time period)
  • Top tags from entities: See patterns in tags from listens (i.e. pulling tags from MusicBrainz entities)
  • Listening clock: See when a user listened to the most music at what time of day over a time period
  • New discoveries: Highlight new entities that a user has not listened to before
  • Musical matches: See other users who listened to similar entities as you
  • Mainstream meter: Percentage of how many other users shared your listens
    • Example: If every user on the site listened to one artist, and a user only listened to that artist over a time period, their “mainstream meter” score would be 100%

New statistics

New statistic ideas not yet implemented by a competing service include…

  • Revivals: Highlight when a user listens to an entity that they haven’t listened to since a period of time
  • Streaks: Entities that consistently appear in listens in a measure of time
    • Example: User listens to the same artist every week for six weeks
  • “Love at first sight”: New listens for entities that are over a certain threshold
    • Example: User listens to a recording for the first time, and over a week, they play that recording over 100 times
  • “Where did you listen?”: Highlight what client a user submitted listens from
    • Listens do not currently collect client identifier (I don’t think)

Proposed action

This proposal maps ideas for new statistics and provides a chance to consider a long-term approach to how new statistics are added to ListenBrainz. Feedback is welcome, and a concrete action for where these suggestions should go is needed (new tickets?).

7 Likes

These are really good ideas!

1 Like

Thanks for making this list! We used this as a starting point for talking about LB graphs in our mini-summit here in Delhi. We’ll post summit notes with mock-ups of some of these graphs in the next day or so!

2 Likes

Awesome! :smile: I’m happy these were helpful. I’ll keep an eye out for the summit notes.

1 Like

Just curious, did this end up anywhere yet?

In case you are looking for the notes:

2 Likes

I wanted to go back and edit this in, but I don’t think I’m able to with my account privileges. :slightly_frowning_face:

I had another idea for new statistics:

Cultural diversity

Since this data already exists in MusicBrainz, I would love to know statistics of my listens with…

  • Different languages
  • Different scripts
  • Artists from different countries

Generation gap

I’d also like to know more about my music listening habits in terms of what years of music I listen to the most? Am I stuck in the jazz age of the 1950s? Do all of my 1980s scrobbles mean I am missing the disco and big hair too much? Or hey, what happened in 2003 to make 93% of my music plays from the last month in 2003?

These are all interesting questions I’d like to explore to better understand the cultural time period of music I fit into as well. :slightly_smiling_face:

3 Likes

Artists from different countries

We shipped this to beta today. iliekcomputers's Stats - ListenBrainz

Edit: It’s in production now.

4 Likes

I get an internal server error on that part of the page. Does anyone else have the same problem? https://listenbrainz.org/user/mfmeulenbelt/reports?range=week

This is great! What about “click on country” => “see artist list”? Is that doable?

We saw that error on Sentry, we’re looking into it!

2 Likes

We’re gonna add that to the API endpoint soon for sure, but I’m not sure if we’ll have time in the short term to show it on the site in a good way! But once the API endpoint allows it, it should be technically possible.

2 Likes

Thanks! It’s so good seeing all the progress that has been made on ListenBrainz recently.

1 Like