School of Technology and Computer Science
Tata Institute of Fundamental Research
Homi Bhabha Road
Abstract: ASR or Automatic Speech Recognition is the process of conversion of spoken utterance by humans into text by machines. Currently used ASR tools use probabilistic techniques such as HMM (Hidden Markov Models) for modeling sounds and then employ pattern matching to decode an unknown speech utterance. The talk will highlight the major difficulties in the ASR process and how different algorithms, techniques try to overcome these difficulties.