用户名: 密码: 验证码:
Pitch-Related Identification of Instruments in Classical Music Recordings
详细信息    查看全文
  • 刊名:Lecture Notes in Computer Science
  • 出版年:2015
  • 出版时间:2015
  • 年:2015
  • 卷:8983
  • 期:1
  • 页码:194-209
  • 全文大小:2,103 KB
  • 参考文献:1.Boulanger-Lewandowski, N., Bengio, Y., Vincent, P.: Discriminative non-negative matrix factorization for multiple pitch estimation. In: 13th International Society for Music Information Retrieval Conference (ISMIR), pp. 205鈥?10 (2012)
    2.Breiman, L.: Random forests. Mach. Learn. 45, 5鈥?2 (2001)View Article MATH
    3.Essid, S., Richard, G., David, B.: Instrument recognition in polyphonic music based on automatic taxonomies. IEEE Trans. Audio Speech Lang. Process. 14(1), 68鈥?0 (2006)View Article
    4.Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: popular, classical, and jazz music databases. In: 3rd International Conference on Music Information Retrieval, pp. 287鈥?88 (2002)
    5.Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: music genre database and musical instrument sound database. In: 4th International Conference on Music Information Retrieval, pp. 229鈥?30 (2003)
    6.Heittola, T., Klapuri, A., Virtanen, A.: Musical instrument recognition in polyphonic audio using source-filter model for sound separation. In: 10th International Society for Music Information Retrieval Conference (2009)
    7.Herrera-Boyer, P., Klapuri, A., Davy, M.: Automatic classification of pitched musical instrument sounds. In: Klapuri, A., Davy, M. (eds.) Signal Processing Methods for Music Transcription. Springer Science+Business Media LLC, US (2006)
    8.ISO: MPEG-7 overview. http://鈥媤ww.鈥媍hiariglione.鈥媜rg/鈥媘peg/鈥?/span>
    9.Izmirli, O., Sharma, G.: Bridging printed music and audio through alignment using a mid-level score representation. In: 13th International Society for Music Information Retrieval Conference (ISMIR), pp. 61鈥?6 (2012)
    10.Kameoka, H., Nishimoto, T., Sagayama, S.: Multi-pitch detection algorithm using constrained gaussian mixture model and information criterion for simultaneous speech. In: Speech Prosody 2004, pp. 533鈥?36 (2004)
    11.Kirchhoff, H., Dixon, S., Klapuri, A.: Multi-template shift-variant non-negative matrix deconvolution for semi-automatic music transcription. In: 13th International Society for Music Information Retrieval Conference (ISMIR), pp. 415鈥?20 (2012)
    12.Kitahara, T., Goto, M., Komatani, K., Ogata, T., Okuno, H.G.: Instrument identification in polyphonic music: feature weighting to minimize influence of sound overlaps. EURASIP J. Appl. Signal Process. 2007, 1鈥?5 (2007)View Article
    13. Kubera, E., Wieczorkowska, A., Ra艣, Z., Skrzypiec, M.: Recognition of instrument timbres in real polytimbral audio recordings. In: Balc谩zar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds.) ECML PKDD 2010, Part II. LNCS, vol. 6322, pp. 97鈥?10. Springer, Heidelberg (2010) View Article
    14.Kubera, E., Wieczorkowska, A.A.: Mining audio data for multiple instrument recognition in classical music. In: Appice, A., Ceci, M., Loglisci, C., Manco, G., Masciari, E., Ras, Z.W. (eds.) NFMCP 2013. LNCS, vol. 8399, pp. 246鈥?60. Springer, Heidelberg (2014)View Article
    15. Kubera, E., Wieczorkowska, A.A., Skrzypiec, M.: Influence of feature sets on precision, recall, and accuracy of identification of musical instruments in audio recordings. In: Andreasen, T., Christiansen, H., Cubero, J.-C., Ra艣, Z.W. (eds.) ISMIS 2014. LNCS, vol. 8502, pp. 204鈥?13. Springer, Heidelberg (2014) View Article
    16. Kursa, M., Rudnicki, W., Wieczorkowska, A., Kubera, E., Kubik-Komar, A.: Musical instruments in random forest. In: Rauch, J., Ra艣, Z.W., Berka, P., Elomaa, T. (eds.) ISMIS 2009. LNCS, vol. 5722, pp. 281鈥?90. Springer, Heidelberg (2009) View Article
    17.Martins, L.G., Burred, J.J., Tzanetakis, G., Lagrange, M.: Polyphonic instrument recognition using spectral clustering. In: 8th International Society for Music Information Retrieval Conference (ISMIR) (2007)
    18.Max-Planck-Institut Informatik: chroma toolbox: pitch, chroma, CENS, CRP. http://鈥媤ww.鈥媘pi-inf.鈥媘pg.鈥媎e/鈥媟esources/鈥婱IR/鈥媍hromatoolbox/鈥?/span>
    19.MIDOMI: Search for music using your voice by singing or humming. http://鈥媤ww.鈥媘idomi.鈥媍om/鈥?/span>
    20. Miotto, R., Montecchio, N., Orio, N.: Statistical music modeling aimed at identification and alignment. In: Ra艣, Z.W., Wieczorkowska, A.A. (eds.) Adv. Music Inform. Retrieval. SCI, vol. 274, pp. 187鈥?12. Springer, Heidelberg (2010) View Article
    21. M眉ller, M.: Information Retrieval for Music and Motion. Springer, Heidelberg (2007) View Article
    22.Niewiadomy, D., Pelikant, A.: Implementation of MFCC vector generation in classification context. J. Appl. Comput. Sci. 16(2), 55鈥?5 (2008)
    23.Opolko, F., Wapnick, J.: MUMS 鈥?McGill University master samples: CD鈥檚 (1987)
    24.Oxford University press: Oxford dictionaries. http://鈥媤ww.鈥媜xforddictionari鈥媏s.鈥媍om/鈥?/span>
    25.Sakaue, D., Otsuka, T., Itoyama, K., Okuno, H.G.: Bayesian nonnegative harmonic-temporal factorization and its application to multipitch analysis. In: 13th International Society for Music Information Retrieval Conference (ISMIR), pp. 91鈥?6 (2012)
    26.Shazam entertainment ltd. http://鈥媤ww.鈥媠hazam.鈥媍om/鈥?/span>
    27.Subrahmanian, V.S.: Principles of Multimedia Database Systems. Morgan Kaufmann, San Francisco (1998)
    28.The University of IOWA electronic music studios: musical instrument samples. http://鈥媡heremin.鈥媘usic.鈥媢iowa.鈥媏du/鈥婱IS.鈥媓tml
    29.TrackID. https://鈥媝lay.鈥媑oogle.鈥媍om/鈥媠tore/鈥媋pps/鈥媎etails?鈥媔d=鈥媍om.鈥媠onyericsson.鈥媡rackid
    30. Vincent, E., Rodet, X.: Music transcription with ISA and HMM. In: Puntonet, C.G., Prieto, A.G. (eds.) ICA 2004. LNCS, vol. 3195, pp. 1197鈥?204. Springer, Heidelberg (2004) View Article
    31.Zhang, X., Marasek, K., Ras, Z.W.: Maximum likelihood study for sound pattern separation and recognition. In: IEEE CS International Conference on Multimedia and Ubiquitous Engineering (MUE 2007), Seoul, Korea, pp. 807鈥?12 (2007)
    32.Zweig, G., Nguyen, P.: A segmental CRF approach to large vocabulary continuous speech recognition. In: ASRU 2009: Automatic Speech Recognition and Understanding (2009)
  • 作者单位:El偶bieta Kubera (10)
    Alicja A. Wieczorkowska (11)

    10. University of Life Sciences in Lublin, Akademicka 13, 20-950, Lublin, Poland
    11. Polish-Japanese Academy of Information Technology, Koszykowa 86, 02-008, Warsaw, Poland
  • 丛书名:New Frontiers in Mining Complex Patterns
  • ISBN:978-3-319-17876-9
  • 刊物类别:Computer Science
  • 刊物主题:Artificial Intelligence and Robotics
    Computer Communication Networks
    Software Engineering
    Data Encryption
    Database Management
    Computation by Abstract Devices
    Algorithm Analysis and Problem Complexity
  • 出版者:Springer Berlin / Heidelberg
  • ISSN:1611-3349
文摘
Identification of particular voices in polyphonic and polytimbral music is a task often performed by musicians in their everyday life. However, the automation of this task is very challenging, because of high complexity of audio data. Usually additional information is supplied, and the results are far from satisfactory. In this paper, we focus on classical music recordings, without requiring the user to submit additional information. Our goal is to identify musical instruments playing in short audio frames of polyphonic recordings of classical music. Additionally, we extract pitches (or pitch ranges) which combined with instrument information can be used in score-following and audio alignment, see e.g. [9, 20], or in works towards automatic score extraction, which are a motivation behind this work. Also, since instrument timbre changes with pitch, separate classifiers are trained for various pitch ranges for each instrument. Four instruments are investigated, representing stringed and wind instruments. The influence of adding harmonic (pitch-based) features to the feature set on the results is also investigated. Random forests are applied as a classification tool, and the results are presented and discussed.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700