TY - GEN
T1 - Structured audio content analysis and metadata in a digital library
AU - Bainbridge, David
AU - Downie, John Stephen
AU - Ehmann, Andreas F.
PY - 2012
Y1 - 2012
N2 - This work illustrates how audio content analysis of music and manually assigned structural temporal metadata can be used to form a digital library designed for musicological exploration. In addition to text-based searching and browsing, the document view is enriched with an interactive structured audio time-line that shows ground-truth data representing the logical segments to the song, and a version that was automatically generated for comparison. A self-similarity "heat" map is also displayed, and is interactive. Clicking within the map at a co-ordinate (x,y) results in the audio being played simultaneous at time offset x and y, panned left and right, respectively, to make it easier for the listener to separate out the differences. The musicologist can also initiate an audio content based query starting at any point in the song. This produces a ranked result set which can be further studied through their respective document views. Alternatively they can perform a musical structure search (for example, for songs that contain the structure b, b, c, b, c).
AB - This work illustrates how audio content analysis of music and manually assigned structural temporal metadata can be used to form a digital library designed for musicological exploration. In addition to text-based searching and browsing, the document view is enriched with an interactive structured audio time-line that shows ground-truth data representing the logical segments to the song, and a version that was automatically generated for comparison. A self-similarity "heat" map is also displayed, and is interactive. Clicking within the map at a co-ordinate (x,y) results in the audio being played simultaneous at time offset x and y, panned left and right, respectively, to make it easier for the listener to separate out the differences. The musicologist can also initiate an audio content based query starting at any point in the song. This produces a ranked result set which can be further studied through their respective document views. Alternatively they can perform a musical structure search (for example, for songs that contain the structure b, b, c, b, c).
KW - audio content analysis
KW - digital libraries
KW - structured metadata
UR - http://www.scopus.com/inward/record.url?scp=84863554310&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84863554310&partnerID=8YFLogxK
U2 - 10.1145/2232817.2232927
DO - 10.1145/2232817.2232927
M3 - Conference contribution
AN - SCOPUS:84863554310
SN - 9781450311540
T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries
SP - 431
EP - 432
BT - JCDL '12 - Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries
T2 - 12th ACM/IEEE-CS Joint Conference on Digital Libraries, JCDL '12
Y2 - 10 June 2012 through 14 June 2012
ER -