Head Shot Name: Ville Hautamäki, PhD
Occupation: Associate Professor in the University of Eastern Finland.
Currently (2022-2023) Finnish PI of the mobility project: "Planning with Deep Graph Neural Networks" funded by Academy of Finland.
Past (2019) PI of a MATINE funded deepfake detection project
(2018-2019) co-PI of consortium of "Deep reinforcement learning for physical agents (DEEPEN)" project funded by Academy of Finland.
(2015), PI in a one year foreign accent recognition project funded by MATINE, The Finnish Defence Forces.
(2011-2014), PI in a 3 year dialect and accent recognition post-doc project funded by Academy of Finland.
Research interests: Artificial Intelligence, Reinforcement Learning, Machine Learning, Bayesian Inference, Bioinformatics, Speech technlogy in general, but speaker recognition and language recognition in particular.
My Google Scholar profile

Pre-prints:

Journal publications:

  1. Jari Turkia, Ursula Schwab, Ville Hautamäki, Inferring personal intake recommendations of phosphorous and potassium for end-stage renal failure patients by simulating with Bayesian hierarchical multivariate model, Plos ONE, 2024 (accepted)
  2. Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li, Self-Supervised Training of Speaker Encoder With Multi-Modal Diverse Positive Pairs, IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 31, p. 1706-1719, 2023.
  3. Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki, "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters, IEEE Transactions on Games, vol. 15, no. 4, pp. 566-579, Dec. 2023.
  4. Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, and Junchi Yamagishi, "Optimizing Tandem Speaker Verification and Anti-Spoofing Systems", IEEE Transactions on Audio, Speech and Language Processing Vol. 30, pp. 477-488, January, 2022.
  5. Jari Turkia, Lauri Mehtätalo, Ursula Schwab, and Ville Hautamäki, Mixed-Effect Bayesian Network Reveals Personal Effects of Nutrition, Scientific Reports, Vol. 11, No. 12016, 2021.
  6. Ivan Kukanov, Trung Ngo Trong, Ville Hautamäki, Sabato Marco Siniscalchi, Valerio Mario Salerno, Kong Aik Lee, "Maximal Figure-of-Merit Framework to Detect Multi-label Phonetic Features for Spoken Language Recognition", IEEE Transactions on Audio, Speech and Language Processing , Vol. 28, pp. 682-695, January, 2020. Supplementary materials
  7. Trung Ngo Trong, Roger Kramer, Juha Mehtonen, Gerardo Gonzalez, Ville Hautamäki, Merja Heinäniemi, "Semi-Supervised generative Autoencoder for single cell data". Journal of Computational Biology, 2019 (accepted). Github
  8. R. Gonzalez Hautamäki, V. Hautamäki, T. Kinnunen, "On Limits of Automatic Speaker Verification: Explaining Degraded Recognizer Score Through Acoustic Changes Resulting from Voice Disguise", Journal of the Acoustic Society of America, 2019 (accepted)
  9. P. Pölönen, J. Mehtonen, J. Lin, T. Liuksiala, S. Häyrynen, S. Teppo, A. Mäkinen, A. Kumar, D. Malani, V. Pohjolainen, K. Porkka, CA. Heckman, P. May, V. Hautamäki, K. Granberg, O. Lohi, M. Nykter, M. Heinäniemi, Hemap: An interactive online resource for characterizing molecular phenotypes across hematologic malignancies, Cancer Research, doi: 10.1158/0008-5472.CAN-18-2970, 2019.
  10. J. Mehtonen, P. Pölönen, S. Häyrynen, J. Lin, T. Liuksiala, K. Granberg, O. Lohi, V. Hautamäki, M. Nykter, M. Heinäniemi, Data-driven characterization of molecular phenotypes across heterogenous sample collections. Nucleic Acids Research, doi: 10.1093/nar/gkz281, 2019.
  11. Rosa Gonzalez Hautamäki, Md Sahidullah, Ville Hautamäki and Tomi Kinnunen, "Acoustical and perceptual study of voice disguise by age modification in speaker verification", Speech Communication, Vol. 95, pp. 1-15, December 2017.
  12. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Tomi Kinnunen, and Chin-Hui Lee, "i-Vector Modeling of Speech Attributes for Automatic Foreign Accent Recognition", IEEE Transactions on Audio, Speech and Language Processing, Vol. 24, No 1, pp. 29-41, January 2016.
  13. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki and Anne-Maria Laukkanen, "Automatic versus human speaker verification: the case of voice mimicry", Speech Communication, Vol. 72, pp. 13-31, September, 2015.
  14. Hamid Behravan, Ville Hautamäki, and Tomi Kinnunen, "Factors Affecting i-Vector Based Foreign Accent Recognition: a Case Study in Spoken Finnish", Speech Communication, Vol. 66, pp. 118-129, February, 2015.
  15. Padmanabhan Rajan, Anton Afanasyev, Ville Hautamäki, and Tomi Kinnunen, "From single to multiple enrollment i-vectors: practical PLDA scoring variants for speaker verification", Digital Signal Processing, Vol. 31, pp. 93-101, 2014.
  16. Ville Hautamäki, Tomi Kinnunen, Filip Sedlak, Kong Aik Lee, Bin Ma, Haizhou Li, "Sparse Classifier Fusion for Speaker Verification", IEEE Transactions on Audio, Speech and Language Processing, Vol. 21, No. 8, pp. 1622-1631, August, 2013.
  17. Q. Zhao, V. Hautamäki, I. Kärkkäinen, and P. Fränti, "Random Swap EM algorithm for Gaussian Mixture Models", Pattern Recognition Letters, Vol. 19, No. 12, pp. 914-917, December, 2012.Supplementary material[C++ implementation]
  18. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, and P. Fränti, "Comparative Evaluation of Maximum a Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification", Pattern Recognition Letters, Vol. 30, No. 4, pp. 341-347, March, 2009.
  19. V. Hautamäki, T. Kinnunen, and P. Fränti, "Text-Independent Speaker Recognition Using Graph Matching", Pattern Recognition Letters, Vol. 29, No. 9, pp. 1427-1432, July, 2008.
  20. V. Hautamäki, T. Kinnunen, I. Kärkkäinen, J. Saastamoinen, M. Tuononen and P. Fränti, "Maximum a Posteriori Adaptation of the Centroid Model for Speaker Verification", IEEE Signal Processing Letters, Vol. 15, pp. 162-165. 2008.
  21. P. Fränti, O. Virmajoki and V. Hautamäki, "Fast agglomerative clustering using k nearest neighbor graph", IEEE Transactions on Pattern Analysis and Machine Intelligence , Vol. 28, No. 11, pp. 1875-188, November, 2006.
  22. J. Saastamoinen, E. Karpov, V. Hautamäki and P. Fränti, "Accuracy of MFCC based speaker recognition in series 60 device", Journal of Applied Signal Processing, Vol. 17, pp. 2816-2827, September, 2005.

Conference publications:

  1. Marko Tuononen, Dani Korpi and Ville Hautamäki, Interpreting Deep Neural Network-Based Receiver Under Varying Signal-To-Noise Ratios, ICASSP 2025.
  2. Ivan Kukanov, Janne Laakkonen, Tomi Kinnunen and Ville Hautamäki, "Meta-Learning Approaches for Improving Detection of Unseen Speech Deepfakes", SLT 2024.
  3. Meichen Gong, Konstantin Ivanov, Merja Heinäniemi, and Ville Hautanäki, Enhancing Single-Cell VAE Latent Space via Semi-Supervision, ICML 2024 workshop on Accessible and Efficient Foundation Models for Biological Discovery (accepted).
  4. Vishwanath Pratap Singh, Federico Malato, Ville Hautamäki, Md Sahidullah and Tomi Kinnunen, "ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2vec2.0 Based ASR", Interspeech 2024 (accepted).
  5. Federico Malato, Ville Hautamäki, "Online Adaptation for Enhancing Imitation Learning Policies", CoG 2024 (accepted).
  6. Federico Malato, Florian Leopold, Andrew Melnik, Ville Hautamäki, "Zero-shot Imitation Policy via Search in Demonstration Dataset", ICASSP 2024.
  7. Yi Ma, Kong Aik Lee, Ville Hautamäki, Meng Ge, Haizhou Li, "Gradient Weighting for Speaker Verification in Extremely Low Signal-to-Noise Ratio", ICASSP 2024.
  8. Stephanie Milani, Anssi Kanervisto, Karolis Ramanauskas, Sander Schulhoff, Brandon Houghton, Sharada Mohanty, Byron Galbraith, Ke Chen, Yan Song, Tianze Zhou, Bingquan Yu, He Liu, Kai Guan, Yujing Hu, Tangjie Lv, Federico Malato, Florian Leopold, Amogh Raut, Ville Hautamäki, Andrew Melnik, Shu Ishida, João Henriques, Robert Klassert, Walter Laurito, Lucas Cazzonelli, Cedric Kulbach, Nicholas Popovic, Marvin Schweizer, Ellen Novoseller, Vinicius Goecks, Nicholas Waytowich, David Watkins, Josh Miller, Rohin Shah, Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition, Proceedings of the NeurIPS 2022 Competitions Track, PMLR 220:171-188, 2022
  9. Ruijie Tao, Kong Aik Lee, Rohan Kumar Das, Ville Hautamäki, Haizhou Li, Self-supervised Speaker Recognition with Loss-gated Learning, ICASSP 2022 (accepted).
  10. Federico Malato, Joona Jehkonen, Ville Hautamäki, Improving Behavioural Cloning with Human-Driven Dynamic Dataset Augmentation, AAAI 2022 Workshop on Interactive Machine Learning (accepted).
  11. Anssi Kanervisto, Tomi Kinnunen, Ville Hautamäki, General Characterization of Agents by States they Visit, NeurIPS 2021 Workshop on Deep Reinforcement Learning (accepted).
  12. Yi Ma, Kong Aik Lee, Ville Hautamäki, Haizhou Li, PL-EERSR: Perceptual Loss Based End-to-End Robust Speaker Representation Extraction, ASRU 2021 (accepted).
  13. Khaled Hechmi, Trung Ngo Trung, Ville Hautamäki, Tomi Kinnunen, Voxceleb Enrichment for Age and Gender Recognition, ASRU 2021 (accepted).
  14. Anssi Kanervisto, Christian Scheller, Yanick Schraner and Ville Hautamäki, Distilling Reinforcement Learning Tricks for Video Games, CoG 2021 (accepted).
  15. Keishi Ishihara, Anssi Kanervisto, Jun Miura and Ville Hautamäki, "Multi-task Learning with Attention for End-to-end Autonomous Driving", CVPR 2021 Workshop on Autonomous Driving, 2021 (accepted). Github
  16. Ivan Kukanov, Janne Karttunen, Hannu Sillanpää, and Ville Hautamäki, "Cost Sensitive Optimization of Deepfake Detector", APSIPA 2020 (accepted).
  17. Anssi Kanervisto, Joonas Pussinen and Ville Hautamäki, "Benchmarking End-to-End Behavioural Cloning on Video Games", CoG 2020. Github
  18. Anssi Kanervisto, Christian Scheller and Ville Hautamäki, "Action Space Shaping in Deep Reinforcement Learning", CoG 2020. Github
  19. Anssi Kanervisto, Janne Karttunen, Ville Hautamäki, "Playing Minecraft with Behavioural Cloning", PMLR post proceedings - Competition Track@NeurIPS2019, 2020. Github
  20. Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, Junichi Yamagishi, An initial investigation on optimizing tandem speaker verification and countermeasure systems using reinforcement learning, Speaker Odyssey 2020. Github
  21. Janne Karttunen, Anssi Kanervisto, Ville Kyrki, Ville Hautamäki,"From Video Game to Real Robot: The Transfer Between Action Spaces", ICASSP 2020 (accepted).
  22. Trung Ngo Trong, Kristiina Jokinen, Ville Hautamäki"Enabling Spoken Dialogue Systems for Low-Resourced Languages -- End-to-End Dialect Recognition for North Sami" In: D'Haro L., Banchs R., Li H. (eds) 9th International Workshop on Spoken Dialogue System Technology. Lecture Notes in Electrical Engineering, vol 579. Springer, Singapore, 2019 (LNCS version of the IWSDS 2018 paper).
  23. Bilal Soomro, Anssi Kanervisto, Trung Ngo Trong and Ville Hautamäki, "Towards Debugging Deep Neural Networks by Generating Speech Utterances", Interspeech 2019 (accepted). Github
  24. Kong Aik Lee, Ville Hautamäki, Tomi Kinnunen, Hitoshi Yamamoto, Koji Okabe, Ville Vestman, Jing Huang, Guohong Ding, Hanwu Sun, Anthony Larcher, Rohan Kumar Das, Haizhou Li, Mickael Rouvier, Pierre-Michel Bousquet, Wei Rao, Qing Wang, Chunlei Zhang, Fahimeh Bahmaninezhad, Hector Delgado and Massimiliano Todisco, "I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences", Interspeech 2019 (accepted).
  25. Trung Ngo Trong, Roger Kramer, Juha Mehtonen, Gerardo Gonzalez, Ville Hautamäki, Merja Heinäniemi, "SISUA: SemI-SUpervised generative Autoencoder for single cell data". ICML Workshop on Computational Biology, 2019 (accepted). Github
  26. Anssi Kanervisto, Ville Hautamäki, "ToriLLE: Learning Environment for Hand-to-Hand Combat", CoG 2019 (accepted). Github
  27. Ville Vestman, Bilal Soomro, Anssi Kanervisto, Ville Hautamäki, Tomi Kinnunen, "Who Do I Sound Like? Showcasing Speaker Recognition Technology by Youtube Voice Search", ICASSP 2019 (accepted).
  28. Rosa Gonzalez Hautamäki, Anssi Kanervisto, Ville Hautamäki and Tomi Kinnunen, "Voice Disguise that Evades Speaker Recognition Does Not Have to Sound Believable", Odyssey 2018 (accepted).
  29. Trung Ngo Trong, Ville Hautamäki and Kristiina Jokinen,"Staircase Network: structural language identification via hierarchical attentive units", Odyssey 2018 (accepted).
  30. Trung Ngo Trong, Kristiina Jokinen and Ville Hautamäki, "Enabling Spoken Dialgoue Systems for low-resourced languages - end-to-end dialect recognition for North Sami", IWSDS 2018 Best paper award.
  31. Ivan Kukanov, Ville Hautamäki and Kong Aik Lee," Maximal Figure-of-Merit Embedding for Multi-label Audio Classification", ICASSP 2018 (accepted).
  32. K. A. Lee, V. Hautamäki, T. Kinnunen, A. Larcher, C. Zhang, A. Nautsch, T. Stafylakis, G. Liu, M. Rouvier, W. Rao, F. Alegre, J. Ma, M. W. Mak, A. K. Sarkar, H. Delgado, R. Saeidi, H. Aronowitz, A. Sizov, H. Sun, T. H. Nguyen, G. Wang, B. Ma, V. Vestman, M. Sahidullah, M. Halonen, A. Kanervisto, G. Le Lan, F. Bahmaninezhad, S. Isadskiy, C. Rathgeb, C. Busch, G. Tzimiropoulos, Q. Qian, Z. Wang, Q. Zhao, T. Wang, H. Li, J. Xue, S. Zhu, R. Jin, T. Zhao, P.-M. Bousquet, M. Ajili, W. B. Kheder, D. Matrouf, Z. H. Lim, C. Xu, H. Xu, X. Xiao, E. S. Chng, B. Fauve, K. Sriskandaraja, V. Sethu, W. W. Lin, D. A. L. Thomsen, Z.-H. Tan, M. Todisco, N. Evans, H. Li, J. H. L. Hansen, J.-F. Bonastre, E. Ambikairajah, The I4U Mega Fusion and Collaboration for NIST Speaker Recognition Evaluation 2016, Proc. Interspeech 2017.
  33. Tomi Kinnunen, Md Sahidullah, Mauro Falcone, Luca Costantini, Rosa Gonzalez Hautamäki, Dennis Thomsen, Achintya Sarkar, Zheng-Hua Tan, Hector Delgado, Massimiliano Todisco, Nicholas Evans, Ville Hautamäki and Kong Aik Lee, "RedDots Replayed: A New Replay Spoofing Attack Corpus for Text-dependent Speaker Verification Research", ICASSP 2017.
  34. Anssi Kanervisto, Ville Vestman, Md Sahidullah, Ville Hautamäkii, Tomi Kinnunen, "Effects of Gender Information in Text-independent and Text-dependent Speaker Verification", ICASSP 2017.
  35. Ivan Kukanov, Ville Hautamäki, Sabato Siniscalchi and, Kehuang Li, "Deep learning with Maximal Figure-of-Merit Cost to Advance Multi-label Speech Attribute Detection", SLT, San Diego, USA, December 2016.
  36. Kong Aik Lee, Haizhou Li, Li Deng, Ville Hautamäki, Rao Wei, Xiong Xiao, Anthony Larcher, Hanwu Sun, Trung Hieu Nguyen, Guangsen Wang, Aleksandr Sizov, Jianshu Chen, Ivan Kukanov, Amir Poorjam, Trung Ngo Trong, Cheng-Lin Xu, Haihua Xu, Bin Ma, Eng Siong Chng and Sylvain Meignier, "The 2015 NIST Language Recognition Evaluation: the Shared View of I2R, Fantastic4 and SingaMS", Interspeech, pp. 3211--3215, San Francisco, USA, September 2016.
  37. Tomi Kinnunen, Md Sahidullah, Ivan Kukanov, Hector Delgado, Massimiliano Todisco, Achintya sarkar, Nicolai Thomsen, Ville Hautamäki, Nicholas Evans and Zheng-Hua Tan, "Utterance Verification for Text-Dependent Speaker Recognition: a Comparative Assessment Using the RedDots Corpus", Interspeech, pp. 430--434, San Francisco, USA, September 2016.
  38. Md Sahidullah, Rosa Gonzalez Hautamäki, Dennis Alexander Lehmann Thomsen, Tomi Kinnunen, Zheng-Hua Tan, Ville Hautamäki, Robert Parts and Martti Pitkanen, "Robust Speaker Recognition with Combined Use of Acoustic and Throat Microphone Speech", Interspeech, pp. 1720--1724, San Francisco, USA, September 2016.
  39. Kristiina Jokinen, Trung Ngo Trong and Ville Hautamäki, "Variation in Spoken North Sami Language", Interspeech, pp. 3299--3303, San Francisco, USA, September 2016.
  40. Trung Ngo Trong, Ville Hautamäki and Kong Aik Lee, "Deep Language: a comprehensive deep learning approach to end-to-end language recognition", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  41. Rosa Gonzalez Hautamäki, Md Sahidullah, Tomi Kinnunen and Ville Hautamäki, "Age-Related Voice Disguise and its Impact on Speaker Verification Accuracy", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  42. Amir Hossein Poorjam, Rahim Saeidi, Tomi Kinnunen and Ville Hautamäki, "Incorporating uncertainty as a Quality Measure in I-Vector Based Language Recognition", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  43. Hamid Behravan, Tomi Kinnunen and Ville Hautamäki, "Out-of-set i-Vector Selection for Open-set Language Identification", Speaker Odyssey, Bilbao, Spain, 2016 (accepted).
  44. Ville Hautamäki, Sabato Siniscalchi, Hamid Behravan, Valerio Mario Salerno and Ivan Kukanov, "Boosting Universal Speech Attributes Classification with Deep Neural Network for Foreign Accent Characterization", Interspeech 2015, pp. 408-412, Dresden, Germany, September 2015.
  45. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Elie Khoury, Tommi Kurki, Tomi Kinnunen, and Chin-Hui Lee, "Dialect Levelling in Finnish: A Universal Speech Attribute Approach", Interspeech 2014, pp. 2165-2169, Singapore, September, 2014.
  46. Ville Hautamäki, Antti Pöllänen, Tomi Kinnunen, Kong Aik Lee, Haizou Li and Pasi Fränti, "A Comparison of Categorical Attribute Data Clustering Methods", S+SSPR 2014, pp. 53-62, Joensuu, Finland, August, 2014.
  47. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki and Anne-Maria Laukkanen, "Comparison of human listeners and speaker verification systems using voice mimicry data", Speaker Odyssey 2014, pp. 137-144 , Joensuu, Finland, June, 2014.
  48. Hamid Behravan, Ville Hautamäki, Sabato Siniscalchi, Tomi Kinnunen, and Chin-Hui Lee, "Introducing Attribute Features to Foreign Accent Recognition", ICASSP 2014, pp. 5369-5373, Florence, Italy, May, 2014.
  49. You-Chi Cheng, Ville Hautamäki, Zhen Huang, Kehuang Li, and Chin-Hui Lee. "An I-Vector Based Descriptor for Alphabetical Gesture Recognition", ICASSP 2014, pp. 6643-6647, Florence, Italy, May, 2014.
  50. Ville Hautamäki, You-Chi Cheng, Padmanabhan Rajan, and Chin-Hui Lee, "Minimax i-vector extractor for short duration speaker verification", Interspeech 2013, Lyon, France, August 2013.
  51. Ville Hautamäki, Kong Aik Lee, David van Leeuwen, Rahim Saeidi, Anthony Larcher, Tomi Kinnunen, Taufiq Hasan, Seyed Omid Sadjadi, Gang Liu, Hynek Boril, John H.L. Hansen and Benoit Fauve, "Automatic regularization of cross-entropy cost for speaker recognition fusion", Interspeech 2013, Lyon, France, August 2013.
  52. Rosa Gonzalez Hautamäki, Tomi Kinnunen, Ville Hautamäki, Timo Leino and Anne-Maria Laukkanen, "I-vectors meet imitators: on vulnerability of speaker verification systems against voice mimicry", Interspeech 2013, Lyon, France, August 2013.
  53. Rosa Gonzalez Hautamäki, Ville Hautamäki, Padmanabhan Rajan and Tomi Kinnunen, "Merging human and automatic system decisions to improve speaker recognition performance", Interspeech 2013, Lyon, France, August 2013.
  54. Rahim Saeidi, Kong Aik Lee, Tomi Kinnunen, Taufiq Hasan, Benoit Fauve, Pierre-Michel Bousquet, Elie Khoury, Pablo L. Sordo Martinez, Karen Kua, Changhuai You, hanwu sun, Anthony Larcher, Paddy Rajan, Ville Hautamaki, Cemal Hanilci, Billy Braithwaite, Rosa Gonzalez Hautamaki, Seyed Omid Sadjadi, Liu Gang and Hynek Boril, "I4U submission to NIST SRE 2012: A large-scale collaborative effort for noise-robust speaker verification", Interspeech 2013, Lyon, France, August 2013.
  55. Zhen Huang, You-Chi Cheng, Kehuang Li, Ville Hautamäki and Chin-Hui Lee, "A Blind Segmentation Approach to Acoustic Event Detection Based on I-Vector", Interspeech 2013, pp. 2282-2286, Lyon, France, August 2013.
  56. Hamid Behravan, Ville Hautamäki and Tomi Kinnunen, "Foreign Accent Detection from Spoken Finnish Using i-Vectors", Interspeech 2013, Lyon, France, August 2013.
  57. Padmanabhan Rajan, Tomi Kinnunen and Ville Hautamäki, "Effect of multicondition training on i-vector PLDA configurations for speaker recogntion", Interspeech 2013, (accepted).
  58. Ville Hautamäki, Kong Aik Lee, Anthony Larcher, Tomi Kinnunen, Bin Ma, and Haizhou Li, "Variational Bayes Logistic Regression as Regularized Fusion for NIST SRE 2010", In Speaker Odyssey 2012, June, Singapore, 2012.
  59. Van Hai Do, Xiong Xiao, Ville Hautamäki, Eng Siong Chng, "Speech Attribute Recognition using Context-Dependent Modeling", In APSIPA ASC 2011, October, Xi'an, China.[PDF]
  60. Qinpei Zhao, V. Hautamäki and Pasi Fränti, "RSEM: an accelerated algorithm on repeated EM", ICIG 2011.
  61. Ville Hautamäki, Kong Aik Lee,Tomi Kinnunen, Bin Ma, and Haizhou Li, "Regularized Logistic Regression Fusion for Speaker Verification", In Interspeech 2011, pp. 2745-2748, August,Florence, Italy. [PDF]
  62. Kong Aik Lee, Chang Huai You, Ville Hautamäki, Anthony Larcher,and Haizhou Li, "Spoken Language Recognition in the Latent Topic Simplex", In Interspeech 2011, pp. 2893--2896, August, Florence, Italy. [PDF]
  63. Filip Sedlak, Tomi Kinnunen, Ville Hautamäki, Kong Aik Lee,and Haizhou Li, "Classifier Subset Selection and Fusion for Speaker Verification", In ICASSP 2011 [PDF][video and slides].
  64. Ville Hautamäki, Tomi Kinnunen, Mohaddeseh Nosratighods, Kong Aik Lee, Bin Ma, and Haizhou Li , "Approaching Human Listener Accuracy with Modern Speaker Verification", In Interspeech 2010, Makuhari, Japan, pp. 1473-1476, September 2010. [PDF]
  65. Raymond W. M. Ng, Cheung-Chi Leung, Ville Hautamäki, Tan Lee, Bin Ma, and Haizhou Li , "Towards long-range prosodic attribute modeling for language recognition", In Interspeech 2010, Makuhari, Japan, 1792-1795, September 2010.
  66. P. Fränti, A. Tabarcea, J. Kuittinen, and V. Hautamäki " Location-based Search Engine for Multimedia Phones", In ICME 2010.
  67. A. Tabarcea, V. Hautamäki, and P. Fränti,"Ad-hoc Georeferencing of Web-pages Using Street-name Prefix Trees", In 6th International Conference on Web Information Systems and Technologies (WEBIST 2010).
  68. Q. Zhao, V. Hautamäki, I. Kärkkäinen, and P. Fränti, "Random Swap EM algorithm for Finite Mixture Models in Image Segmentation", In Proc. IEEE Int. Conf. on Image Processing (ICIP 2009),Cairo, Egypt, pp. 2397-2400, November 2009.
  69. T. Kinnunen, J. Saastamoinen, V. Hautamäki, M. Vinni, and P. Fränti, "Comparing Maximum A Posteriori Vector Quantization and Gaussian Mixture Models in Speaker Verification", In Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP 2009), Taipei, Taiwan, pp. 4545-4548, April 2009.
  70. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, and I. Sidoroff, "Developing Speaker Recognition System: from Prototype to Practical Application", In Proc. e-Forensics 2009 (accepted).
  71. Q. Zhao, V. Hautamäki, P. Fränti," Knee Point Detection in BIC for Detecting the Number of Clusters", Advanced Concepts for Intelligent Vision Systems (ACIVS 2008), Juan-les-Pins, France, pp. 664-673, October 2008. [PDF]
  72. V. Hautamäki, P. Nykänen and P. Fränti, "Time-series Clustering by Approximate Prototypes", 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, Florida, USA, December, 2008. [PDF]
  73. P. Fränti, O. Virmajoki and V. Hautamäki, "Probabilistic Clustering by Random Swap Algorithm", 19th International Conference on Pattern Recognition (ICPR 2008), Tampa, Florida, USA, December, 2008.
  74. V. Hautamäki, M. Tuononen, T. Niemi-Laitinen and P. Fränti, Improving Speaker Verification by Periodicity Based Voice Activity Detection, Proc. 12th International Conference on Speech and Computer (SPECOM 2007), Vol. 2, pp. 645-650, Moscow, October 2007.[PDF]
  75. T. Kinnunen, V. Hautamäki and P. Fränti, On the Use of Long-Term Average Spectrum in Automatic Speaker Recognition, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006), Vol II, Singapore, pp. 559-567, December 2006.
  76. R. Timofte, V. Hautamäki and P. Fränti, Speaker, Vocabulary and Context Independent Word Spotting System for Continuous Speech, Proc. 5th International Symposium on Chinese Spoken Language Processing (ISCSLP 2006), Vol II, Singapore, pp. 396-407, December 2006.
  77. H. Gupta, V. Hautamäki, T. Kinnunen and P. Fränti, Field Evaluation of Text-Dependent Speaker Recognition in an Access Control Application, Proc. 10th International Conference on Speech and Computer (SPECOM 2005), pp. 551-554, Patras, Greece, October 2005. [PDF]
  78. V. Hautamäki, S. Cherednichenko, I. Kärkkäinen, T. Kinnunen and P. Fränti, Improving K-Means by Outlier Removal, Proc. 14th Scandinavian Conference on Image Analysis (SCIA 2005), pp. 978-987, Joensuu, Finland, June 2005. [PDF]
  79. T. Kinnunen, V. Hautamäki, P. Fränti, Fusion of Spectral Feature Sets for Accurate Speaker Identification, Proc. 9th International Conference Speech and Computer (SPECOM 2004), pp. 361-365, St. Petersburg, Russia, September, 2004. [PDF]
  80. J. Saastamoinen, E. Karpov, V. Hautamäki and P. Fränti, "Automatic Speaker Recognition for Series 60 Mobile Devices", In Proc. 9th International Conference Speech and Computer (SPECOM'2004), pp. 353-360, St. Petersburg, Russia, September 20-22, 2004.
  81. V. Hautamäki, I. Kärkkäinen and P. Fränti, "Outlier Detection Using k-Nearest Neighbour Graph", 17th International Conference on Pattern Recognition (ICPR 2004), pp. 430-433, Cambridge, United Kingdom, August, 2004. [PDF] code
  82. P. Fränti, O. Virmajoki and V. Hautamäki, "Fast PNN-based clustering using k-nearest neighbor graph", IEEE International Conference on Data Mining (ICDM 2003), Melbourne, Florida, USA, 525-528, November 2003.
  83. T. Kinnunen, V. Hautamäki and P. Fränti, "On the fusion of dissimilarity- based classifiers for speaker identification", European Conference on Speech Communiation and Technology, (Eurospeech 2003), Geneva, Switzerland, 2641-2644, September 2003. [PDF]
  84. P. Fränti and V. Hautamäki, "Compression of aerial images for reduced-color devices", SPIE Conference on Image and Video Communications and Processing,Santa Clara, USA, SPIE Vol. 5022, Part II, 651-662, January 2003. [PDF]

Theses:

  1. Ville Hautamäki, Improving Pattern Recognition Methods for Speaker Recognition, PhD thesis, University of Joensuu, Deparment of Computer Science, October 2008. [PDF]
  2. Ville Hautamäki, Efficient Color Quantization by Hierarchical Clustering Algorithms, Master's thesis, University of Joensuu, Deparment of Computer Science, February 2005. [PDF]
  3. Ville Hautamäki and Jussi Heino, Evaluation of image compression methods for aerial photos, Bachelor's Thesis, University of Joensuu, Department of Computer Science, December 2000. [PDF]

Other publications:

  1. T. Kinnunen, V. Hautamäki, "Automaattinen puhujantunnistus", in O. Aaltonen, R. Aulanko, A. Iivonen, A. Klippi, M. Vainio (Eds.), Puhuva Ihminen - puhetieteiden perusteet, Otava, 2009. ("Automatic speaker recognition"; a book chapter in Finnish about basics of speaker recognition for non-technical audience).
  2. P. Fränti, J. Saastamoinen, I. Kärkkäinen, T. Kinnunen, V. Hautamäki, I. Sidoroff, Implementing Speaker Recognition System: from Matlab to Practice, Report series / University of Joensuu, Department of Computer Science and Statistics, A-2007-4 (ISBN 978-952-219-061-1, ISSN 1796-7317), November 2007.

updated:Tue Nov 24 08:44:59 EET 2009