default search action
Hemant A. Patil
Person information
- affiliation: Dhirubhai Ambani Institute of Information and Communication Technology (DA-IICT), Gandhinagar, India
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j34]Priyanka Gupta, Hemant A. Patil:
Morse wavelet transform-based features for voice liveness detection. Comput. Speech Lang. 84: 101571 (2024) - [j33]Priyanka Gupta, Hemant A. Patil, Rodrigo Capobianco Guido:
Vulnerability issues in Automatic Speaker Verification (ASV) systems. EURASIP J. Audio Speech Music. Process. 2024(1): 10 (2024) - [j32]Kirtana Sunil Phatnani, Hemant A. Patil:
Modeling musical expectancy via reinforcement learning and directed graphs. Multim. Tools Appl. 83(10): 28523-28547 (2024) - [j31]Dipesh K. Singh, Gauri P. Prajapati, Hemant A. Patil:
Voice Privacy Using Time-Scale and Pitch Modification. SN Comput. Sci. 5(2): 243 (2024) - [j30]Hemant A. Patil, Aastha Kachhi, Ankur T. Patil:
CQT-Based Cepstral Features for Classification of Normal vs. Pathological Infant Cry. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4713-4726 (2024) - [c223]Arth J. Shah, Manish Suthar, Hemant A. Patil:
Multi-Block U-Net for Wind Noise Reduction in Hearing Aids. ICPR (27) 2024: 234-249 - [c222]Arth J. Shah, Hiya Chaudhari, Hemant A. Patil:
Infant Cry Classification Using Modified Group Delay Cepstral Coefficients. ICPR (14) 2024: 275-289 - [c221]Aditya Pusuluri, Hemant A. Patil:
Linear Frequency Residual Cepstral Features for Dysarthria Severity Classification. ICPR (20) 2024: 316-331 - [c220]Ravindrakumar M. Purohit, Arushi Srivastava, Hemant A. Patil:
FCHiFi-GAN: Aggrandizing Fast Convergence with Batchwise Normalization. ICPR (31) 2024: 356-372 - [c219]Japan Bhatt, Harsh Patel, Hemant A. Patil:
Noise Robust Whisper Features for Dysarthric Automatic Speech Recognition. Odyssey 2024: 217-224 - [c218]Dipesh K. Singh, Preet P. Amin, Hemant A. Patil, Hardik B. Sailor:
Voice Conversion Based Data Augmentation Using CycleGAN for Children's ASR. SPCOM 2024: 1-5 - [c217]S. Uthiraa, Akshat Vora, Prathamesh Bonde, Aditya Pusuluri, Hemant A. Patil:
Spectral and Pitch Components of CQT Spectrum for Emotion Recognition. SPCOM 2024: 1-5 - 2023
- [j29]Kuldeep Khoria, Ankur T. Patil, Hemant A. Patil:
On significance of constant-Q transform for pop noise detection. Comput. Speech Lang. 77: 101421 (2023) - [j28]Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components. Comput. Speech Lang. 77: 101423 (2023) - [j27]Sylvio Barbon, Rodrigo Capobianco Guido, Gabriel Jonas Aguiar, Everton José Santana, Mario Lemes Proença Jr., Hemant A. Patil:
Multiple voice disorders in the same individual: Investigating handcrafted features, multi-label classification algorithms, and base-learners. Speech Commun. 152: 102952 (2023) - [c216]Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Relevance of Quadrature Phase For Replay Detection in Voice Assistants (VAs). APSIPA ASC 2023: 125-130 - [c215]Baveet Singh Hora, Krishna Parmar, Shrey Machhar, Hemant A. Patil, Kiran Praveen, Balaji Radhakrishnan:
Exploring Residual Cepstral Features for Spoken Language Identification. APSIPA ASC 2023: 131-138 - [c214]S. Uthiraa, Hemant A. Patil:
Analysis of Emotions in Speech using AESDD. APSIPA ASC 2023: 1036-1041 - [c213]Priyanka Gupta, Aastha Kachhi, Hemant A. Patil:
Classification of Normal vs. Pathological Infant Cries Using Morse Wavelets. APSIPA ASC 2023: 1310-1316 - [c212]Siddharth Rathod, Priyanka Gupta, Aastha Kachhi, Hemant A. Patil:
Cochlear Filter-Based Cepstral Features for Dysarthric Severity-Level Classification. EUSIPCO 2023: 1095-1099 - [c211]Hastin Modi, Maitreya Patel, Hemant A. Patil:
Attentions for Short Duration Speech Classification. EUSIPCO 2023: 1340-1344 - [c210]Siddharth Rathod, Monil Charola, Akshat Vora, Yash Jogi, Hemant A. Patil:
Whisper Features for Dysarthric Severity-Level Classification. INTERSPEECH 2023: 1523-1527 - [c209]Monil Charola, Aastha Kachhi, Hemant A. Patil:
Whisper Encoder features for Infant Cry Classification. INTERSPEECH 2023: 1773-1777 - [c208]S. Uthiraa, Aditya Pusuluri, Hemant A. Patil:
Modified Group Delay Features for Emotion Recognition. PReMI 2023: 321-330 - [c207]Siddharth Rathod, Monil Charola, Hemant A. Patil:
Noise Robust Whisper Features for Dysarthric Severity-Level Classification. PReMI 2023: 708-715 - [c206]Krishna Parmar, Baveet Singh Hora, Shrey Machhar, Hemant A. Patil, Kiran Praveen, Balaji Radhakrishnan:
Spoken Language Identification Using Linear Frequency Residual Cepstral Coefficients. PReMI 2023: 724-733 - [c205]Baveet Singh Hora, S. Uthiraa, Hemant A. Patil:
Linear Frequency Residual Cepstral Coefficients for Speech Emotion Recognition. SPECOM (1) 2023: 116-129 - [c204]Kirtana Sunil Phatnani, Hemant A. Patil:
Quantifying the Emotional Landscape of Music with Three Dimensions. SPECOM (2) 2023: 283-294 - [c203]S. Uthiraa, Hemant A. Patil:
Analysis of Mandarin vs English Language for Emotional Voice Conversion. SPECOM (2) 2023: 295-306 - [c202]Priyanka Gupta, Rajul Acharya, Ankur T. Patil, Hemant A. Patil:
On the Asymptotic Behaviour of the Speech Signal. SPECOM (2) 2023: 335-343 - [c201]Aditya Pusuluri, Aastha Kachhi, Hemant A. Patil:
Constant-Q Based Harmonic and Pitch Features for Normal vs. Pathological Infant Cry Classification. SPECOM (2) 2023: 407-420 - [c200]Monil Charola, Siddharth Rathod, Hemant A. Patil:
Robustness of Whisper Features for Infant Cry Classification. SPECOM (2) 2023: 421-433 - [c199]S. Uthiraa, Aastha Kachhi, Hemant A. Patil:
Linear Frequency Residual Features for Infant Cry Classification. SPECOM (1) 2023: 550-561 - [c198]Siddharth Rathod, Monil Charola, Hemant A. Patil:
Transfer Learning Using Whisper for Dysarthric Automatic Speech Recognition. SPECOM (1) 2023: 579-589 - 2022
- [j26]Ankur T. Patil, Rajul Acharya, Hemant A. Patil, Rodrigo Capobianco Guido:
Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection. Comput. Speech Lang. 72: 101281 (2022) - [j25]Ankur T. Patil, Hemant A. Patil, Kuldeep Khoria:
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection. Comput. Speech Lang. 72: 101301 (2022) - [j24]Gauri P. Prajapati, Dipesh K. Singh, Preet P. Amin, Hemant A. Patil:
Voice privacy using CycleGAN and time-scale modification. Comput. Speech Lang. 74: 101353 (2022) - [j23]Kirtana Sunil Phatnani, Hemant A. Patil:
Music footprint recognition via sentiment, identity, and setting identification. Multim. Tools Appl. 81(16): 22247-22262 (2022) - [c197]Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Morlet Wavelet-Based Voice Liveness Detection using Convolutional Neural Network. EUSIPCO 2022: 100-104 - [c196]Ankur T. Patil, Kuldeep Khoria, Hemant A. Patil:
Voice Liveness Detection using Constant-Q Transform-Based Features. EUSIPCO 2022: 110-114 - [c195]Priyanka Gupta, Hemant A. Patil:
Linear Frequency Residual Cepstral Features for Replay Spoof Detection on ASVSpoof 2019. EUSIPCO 2022: 349-353 - [c194]Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Energy Separation Based Instantaneous Frequency Estimation from Quadrature and In-Phase Components for Replay Spoof Detection. EUSIPCO 2022: 369-373 - [c193]Hemant A. Patil, Rajul Acharya, Ankur T. Patil, Priyanka Gupta:
Non-Cepstral Uncertainty Vector for Replay Spoofed Speech Detection. EUSIPCO 2022: 374-378 - [c192]Aastha Kachhi, Priyanka Gupta, Hemant A. Patil:
Features Motivated From Uncertainty Principle for Classification of Normal vs. Pathological Infant Cry. EUSIPCO 2022: 1253-1257 - [c191]Ankur T. Patil, Aastha Kachhi, Hemant A. Patil:
Subband Teager Energy Representations for Infant Cry Analysis and Classification. EUSIPCO 2022: 1313-1317 - [c190]Hemant A. Patil, Ankur T. Patil, Aastha Kachhi:
Constant Q Cepstral coefficients for classification of normal vs. Pathological infant cry. ICASSP 2022: 7392-7396 - [c189]Madhu R. Kamble, Hemant A. Patil:
The Impact of Room Acoustics on Replay Speech Signal. ISCSLP 2022: 105-109 - [c188]Priyanka Gupta, Hemant A. Patil:
Effect of Speaker-Microphone Proximity on Pop Noise: Continuous Wavelet Transform-Based Approach. ISCSLP 2022: 110-114 - [c187]Aastha Kachhi, Shreya S. Chaturvedi, Hemant A. Patil, Dipesh K. Singh:
Data Augmentation for Infant Cry Classification. ISCSLP 2022: 433-437 - [c186]Anand Therattil, Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Teager Energy Based-Detection of One-point and Two-point Replay Attacks: Towards Cross-Database Generalization. Odyssey 2022: 47-54 - [c185]Shreya S. Chaturvedi, Hardik B. Sailor, Hemant A. Patil:
Noisy Student Teacher Training with Self Supervised Learning for Children ASR. SPCOM 2022: 1-5 - [c184]Piyushkumar K. Chodingala, Shreya S. Chaturvedi, Ankur T. Patil, Hemant A. Patil:
Robustness of DAS Beamformer Over MVDR for Replay Attack Detection On Voice Assistants. SPCOM 2022: 1-5 - [c183]Priyanka Gupta, Piyushkumar K. Chodingala, Hemant A. Patil:
Morse Wavelet Features for Pop Noise Detection. SPCOM 2022: 1-5 - [c182]Gauri P. Prajapati, Dipesh K. Singh, Hemant A. Patil:
Significance of Distance Measures for Speaker Anonymization. SPCOM 2022: 1-5 - [c181]Priyanka Gupta, Hemant A. Patil:
Significance of Distance on Pop Noise for Voice Liveness Detection. SPECOM 2022: 226-237 - [c180]Aastha Kachhi, Anand Therattil, Priyanka Gupta, Hemant A. Patil:
Continuous Wavelet Transform for Severity-Level Classification of Dysarthria. SPECOM 2022: 312-324 - [c179]Aastha Kachhi, Anand Therattil, Ankur T. Patil, Hardik B. Sailor, Hemant A. Patil:
Significance of Energy Features for Severity Classification of Dysarthria. SPECOM 2022: 325-337 - [c178]Aditya Pusuluri, Aastha Kachhi, Hemant A. Patil:
Analysis of Time-Averaged Feature Extraction Techniques on Infant Cry Classification. SPECOM 2022: 590-603 - 2021
- [j22]Madhu R. Kamble, Hemant A. Patil:
Detection of replay spoof speech using teager energy feature cues. Comput. Speech Lang. 65: 101140 (2021) - [j21]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. Int. J. Speech Technol. 24(4): 1067-1088 (2021) - [j20]Siddhant Gupta, Ankur T. Patil, Mirali Purohit, Mihir Parmar, Maitreya Patel, Hemant A. Patil, Rodrigo Capobianco Guido:
Residual Neural Network precisely quantifies dysarthria severity-level based on short-duration speech segments. Neural Networks 139: 105-117 (2021) - [j19]Meet H. Soni, Hemant A. Patil:
Non-intrusive quality assessment of noise-suppressed speech using unsupervised deep features. Speech Commun. 130: 27-44 (2021) - [c177]Madhu R. Kamble, Shekhar Nayak, M. Ali Basha Shaik, Shakti P. Rath, Vikram Vij, Hemant A. Patil:
Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition. APSIPA ASC 2021: 491-496 - [c176]Siddhant Gupta, Kuldeep Khoria, Ankur T. Patil, Hemant A. Patil:
Deep Convolutional Neural Network for Voice Liveness Detection. APSIPA ASC 2021: 775-779 - [c175]Nirmesh J. Shah, M. Ali Basha Shaik, P. Periyasamy, Hemant A. Patil, Vikram Vij:
Exploiting Phase-based Features for Whisper vs. Speech Classification. EUSIPCO 2021: 21-25 - [c174]Kuldeep Khoria, Ankur T. Patil, Hemant A. Patil:
Significance of Constant-Q Transform for Voice Liveness Detection. EUSIPCO 2021: 126-130 - [c173]Shrishti Singh, Kuldeep Khoria, Hemant A. Patil:
Modified Group Delay Cepstral Coefficients for Voice Liveness Detection. EUSIPCO 2021: 146-150 - [c172]Dipesh K. Singh, Preet P. Amin, Hardik B. Sailor, Hemant A. Patil:
Data Augmentation Using CycleGAN for End-to-End Children ASR. EUSIPCO 2021: 511-515 - [c171]Rajul Acharya, Harsh Kotta, Ankur T. Patil, Hemant A. Patil:
Cross-Teager Energy Cepstral Coefficients for Replay Spoof Detection on Voice Assistants. ICASSP 2021: 6364-6368 - [c170]Gauri P. Prajapati, Dipesh K. Singh, Preet P. Amin, Hemant A. Patil:
Voice Privacy Through x-Vector and CycleGAN-Based Anonymization. Interspeech 2021: 1684-1688 - [c169]Gauri P. Prajapati, Dipesh K. Singh, Hemant A. Patil:
Voice Privacy Through Time-Scale and Pitch Modification. PReMI 2021: 72-80 - [c168]Priyanka Gupta, Siddhant Gupta, Hemant A. Patil:
Voice Liveness Detection Using Bump Wavelet with CNN. PReMI 2021: 91-98 - [c167]Ankur T. Patil, Harsh Kotta, Rajul Acharya, Hemant A. Patil:
Spectral Root Features for Replay Spoof Detection in Voice Assistants. SPECOM 2021: 504-515 - [c166]Shrishti Singh, Kuldeep Khoria, Hemant A. Patil:
Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection. SPECOM 2021: 649-659 - [i2]Nirmalya Sen, Md. Sahidullah, Hemant A. Patil, Shyamal Kumar Das Mandal, Krothapalli Sreenivasa Rao, Tapan Kumar Basu:
Utterance partitioning for speaker recognition: an experimental review and analysis with new findings under GMM-SVM framework. CoRR abs/2105.11728 (2021) - 2020
- [j18]Madhu R. Kamble, Hemlata Tak, Hemant A. Patil:
Amplitude and Frequency Modulation-based features for detection of replay Spoof Speech. Speech Commun. 125: 114-127 (2020) - [j17]Madhu R. Kamble, Hemant A. Patil:
Combination of Amplitude and Frequency Modulation Features for Presentation Attack Detection. J. Signal Process. Syst. 92(8): 777-791 (2020) - [c165]Kirtana Sunil Phatnani, Hemant A. Patil:
Symmetry In The Structure Of Musical Nodes. APSIPA 2020: 353-358 - [c164]Ankur T. Patil, Hemant A. Patil:
Significance of CMVN for Replay Spoof Detection. APSIPA 2020: 532-537 - [c163]Harsh Kotta, Ankur T. Patil, Rajul Acharya, Hemant A. Patil:
Subband Channel Selection using TEO for Replay Spoof Detection in Voice Assistants. APSIPA 2020: 538-542 - [c162]Priyanka Gupta, Gauri P. Prajapati, Shrishti Singh, Madhu R. Kamble, Hemant A. Patil:
Design of Voice Privacy System using Linear Prediction. APSIPA 2020: 543-549 - [c161]Neil Shah, Sreeraj R, Maulik C. Madhavi, Nirmesh J. Shah, Hemant A. Patil:
Query-By-Example Spoken Term Detection Using Generative Adversarial Network. APSIPA 2020: 644-648 - [c160]Kuldeep Khoria, Madhu R. Kamble, Hemant A. Patil:
Teager Energy Cepstral Coefficients for Classification of Normal vs. Whisper Speech. EUSIPCO 2020: 1-5 - [c159]Mirali Purohit, Mihir Parmar, Maitreya Patel, Harshit Malaviya, Hemant A. Patil:
Weak Speech Supervision: A case study of Dysarthria Severity Classification. EUSIPCO 2020: 101-105 - [c158]Gauri P. Prajapati, Madhu R. Kamble, Hemant A. Patil:
Energy Separation Based Features for Replay Spoof Detection for Voice Assistant. EUSIPCO 2020: 386-390 - [c157]Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil:
CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion. EUSIPCO 2020: 411-415 - [c156]Harshit Malaviya, Jui Shah, Maitreya Patel, Jalansh Munshi, Hemant A. Patil:
Mspec-Net : Multi-Domain Speech Conversion Network. ICASSP 2020: 7764-7768 - [c155]Madhu R. Kamble, Hemant A. Patil:
Novel Variable Length Teager Energy Profiles for Replay Spoof Detection. Odyssey 2020: 143-150 - [c154]Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Hemant A. Patil:
Analysis of Teager Energy Profiles for Spoof Speech Detection. Odyssey 2020: 304-311 - [c153]Mirali Purohit, Maitreya Patel, Harshit Malaviya, Ankur T. Patil, Mihir Parmar, Nirmesh J. Shah, Savan Doshi, Hemant A. Patil:
Intelligibility Improvement of Dysarthric Speech using MMSE DiscoGAN. SPCOM 2020: 1-5 - [c152]Divyesh G. Rajpura, Jui Shah, Maitreya Patel, Harshit Malaviya, Kirtana Phatnani, Hemant A. Patil:
Effectiveness of Transfer Learning on Singing Voice Conversion in the Presence of Background Music. SPCOM 2020: 1-5 - [i1]Maitreya Patel, Mirali Purohit, Jui Shah, Hemant A. Patil:
CinC-GAN for Effective F0 prediction for Whisper-to-Normal Speech Conversion. CoRR abs/2008.07788 (2020)
2010 – 2019
- 2019
- [j16]Nirmesh J. Shah, Hemant A. Patil:
A novel approach to remove outliers for parallel voice conversion. Comput. Speech Lang. 58: 127-152 (2019) - [j15]Maulik C. Madhavi, Hemant A. Patil:
Vocal Tract Length Normalization using a Gaussian mixture model framework for query-by-example spoken term detection. Comput. Speech Lang. 58: 175-202 (2019) - [c151]Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Hemant A. Patil:
Novel Adaptive Generative Adversarial Network for Voice Conversion. APSIPA 2019: 1273-1281 - [c150]Madhu R. Kamble, Aditya Krishna Sai Pulikonda, Maddala Venkata Siva Krishna, Ankur T. Patil, Rajul Acharya, Hemant A. Patil:
Speech Demodulation-based Techniques for Replay and Presentation Attack Detection. APSIPA 2019: 1545-1550 - [c149]Rajul Acharya, Hemant A. Patil, Harsh Kotta:
Novel Enhanced Teager Energy Based Cepstral Coefficients for Replay Spoof Detection. ASRU 2019: 342-349 - [c148]Mihir Parmar, Savan Doshi, Nirmesh J. Shah, Maitreya Patel, Hemant A. Patil:
Effectiveness of Cross-Domain Architectures for Whisper-to-Normal Speech Conversion. EUSIPCO 2019: 1-5 - [c147]Hemant A. Patil:
Combining Evidences from Variable Teager Energy Source and Mel Cepstral Features for Classification of Normal vs. Pathological Voices. EUSIPCO 2019: 1-5 - [c146]Hemant A. Patil, Srikant Viswanath:
Energy Separation Algorithm Based Spectrum Estimation for Very Short Duration of Speech. EUSIPCO 2019: 1-5 - [c145]Madhu R. Kamble, Hemant A. Patil:
Analysis of Reverberation via Teager Energy Features for Replay Spoof Speech Detection. ICASSP 2019: 2607-2611 - [c144]Nirmesh J. Shah, Hemant A. Patil:
Novel Metric Learning for Non-parallel Voice Conversion. ICASSP 2019: 3722-3726 - [c143]Nirmesh J. Shah, Hemant A. Patil:
Phone Aware Nearest Neighbor Technique Using Spectral Transition Measure for Non-Parallel Voice Conversion. INTERSPEECH 2019: 639-643 - [c142]Nirmesh J. Shah, Hardik B. Sailor, Hemant A. Patil:
Whether to Pretrain DNN or not?: An Empirical Analysis for Voice Conversion. INTERSPEECH 2019: 1586-1590 - [c141]Ankur T. Patil, Rajul Acharya, Pulikonda Krishna Aditya Sai, Hemant A. Patil:
Energy Separation-Based Instantaneous Frequency Estimation for Cochlear Cepstral Feature for Replay Spoof Detection. INTERSPEECH 2019: 2898-2902 - [c140]Madhu R. Kamble, Maddala Venkata Siva Krishna, Aditya Krishna Sai Pulikonda, Hemant A. Patil:
Novel Teager Energy Based Subband Features for Audio Acoustic Scene Detection and Classification. PReMI (2) 2019: 436-444 - [c139]Maitreya Patel, Mihir Parmar, Savan Doshi, Nirmesh Shah, Hemant A. Patil:
Novel Inception-GAN for Whispered-to-Normal Speech Conversion. SSW 2019: 87-92 - 2018
- [j14]Maulik C. Madhavi, Hemant A. Patil:
Design of mixture of GMMs for Query-by-Example Spoken Term Detection. Comput. Speech Lang. 52: 41-55 (2018) - [j13]Hemant A. Patil, Maulik C. Madhavi:
Combining evidences from magnitude and phase information using VTEO for person recognition using humming. Comput. Speech Lang. 52: 225-256 (2018) - [j12]Anshu Chittora, Hemant A. Patil:
Significance of Higher-Order Spectral Analysis in Infant Cry Classification. Circuits Syst. Signal Process. 37(1): 232-254 (2018) - [c138]Prasad A. Tapkir, Madhu R. Kamble, Hemant A. Patil, Maulik C. Madhavi:
Replay Spoof Detection using Power Function Based Features. APSIPA 2018: 1019-1023 - [c137]Hemant A. Patil, Madhu R. Kamble:
A Survey on Replay Attack Detection for Automatic Speaker Verification (ASV) System. APSIPA 2018: 1047-1053 - [c136]Neil Shah, Hemant A. Patil, Meet H. Soni:
Time-Frequency Mask-based Speech Enhancement using Convolutional Generative Adversarial Network. APSIPA 2018: 1246-1251 - [c135]Nirmesh J. Shah, R. Sreeraj, Neil Shah, Hemant A. Patil:
Novel Inter Mixture Weighted GMM Posteriorgram for DNN and GAN-based Voice Conversion. APSIPA 2018: 1776-1781 - [c134]Prasad A. Tapkir, Ankur T. Patil, Neil Shah, Hemant A. Patil:
Novel Spectral Root Cepstral Features for Replay Spoof Detection. APSIPA 2018: 1945-1950 - [c133]Prasad A. Tapkir, Hemant A. Patil:
Significance of Teager Energy Operator Phase for Replay Spoof Detection. APSIPA 2018: 1951-1956 - [c132]Meet H. Soni, Neil Shah, Hemant A. Patil:
Time-Frequency Masking-Based Speech Enhancement Using Generative Adversarial Network. ICASSP 2018: 5039-5043 - [c131]Madhu R. Kamble, Hemlata Tak, Hemant A. Patil:
Effectiveness of Speech Demodulation-Based Features for Replay Detection. INTERSPEECH 2018: 641-645 - [c130]Madhu R. Kamble, Hemant A. Patil:
Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection. INTERSPEECH 2018: 646-650 - [c129]Hardik B. Sailor, Madhu R. Kamble, Hemant A. Patil:
Auditory Filterbank Learning for Temporal Modulation Features in Replay Spoof Speech Detection. INTERSPEECH 2018: 666-670 - [c128]Hardik B. Sailor, Hemant A. Patil:
Auditory Filterbank Learning Using ConvRBM for Infant Cry Classification. INTERSPEECH 2018: 706-710 - [c127]Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Dynamic Features in INCA and Temporal Context-INCA. INTERSPEECH 2018: 711-715 - [c126]Prasad Tapkir, Hemant A. Patil:
Novel Empirical Mode Decomposition Cepstral Features for Replay Spoof Detection. INTERSPEECH 2018: 721-725 - [c125]Hemlata Tak, Hemant A. Patil:
Novel Linear Frequency Residual Cepstral Features for Replay Attack Detection. INTERSPEECH 2018: 726-730 - [c124]Nirmesh J. Shah, Maulik C. Madhavi, Hemant A. Patil:
Unsupervised Vocal Tract Length Warped Posterior Features for Non-Parallel Voice Conversion. INTERSPEECH 2018: 1968-1972 - [c123]Neil Shah, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of Generative Adversarial Network for Non-Audible Murmur-to-Whisper Speech Conversion. INTERSPEECH 2018: 3157-3161 - [c122]Hardik B. Sailor, Maddala Venkata Siva Krishna, Diksha Chhabra, Ankur T. Patil, Madhu R. Kamble, Hemant A. Patil:
DA-IICT/IIITV System for Low Resource Speech Recognition Challenge 2018. INTERSPEECH 2018: 3187-3191 - [c121]Srinivas Kantheti, Rohan Kumar Das, Hemant A. Patil:
Combining Phase-based Features for Replay Spoof Detection System. ISCSLP 2018: 151-155 - [c120]Madhu R. Kamble, Hemant A. Patil:
Novel Amplitude Weighted Frequency Modulation Features for Replay Spoof Detection. ISCSLP 2018: 185-189 - [c119]Madhu R. Kamble, Hemlata Tak, Maddala Venkata Siva Krishna, Hemant A. Patil:
Novel Demodulation-Based Features using Classifier-level Fusion of GMM and CNN for Replay Detection. ISCSLP 2018: 334-338 - [c118]Hardik B. Sailor, Ankur T. Patil, Hemant A. Patil:
Advances in Low Resource ASR: A Deep Learning Perspective. SLTU 2018: 15-19 - [c117]Srinivas Kantheti, Hemant A. Patil:
Relative Phase Shift Features for Replay Spoof Detection System. SLTU 2018: 98-102 - [c116]Hardik B. Sailor, Hemant A. Patil:
Neural Networks-based Automatic Speech Recognition for Agricultural Commodity in Gujarati Language. SLTU 2018: 162-166 - [c115]Ami Gandhi, Hemant A. Patil:
Feature Extraction from Temporal Phase for Speaker Recognition. SPCOM 2018: 382-386 - 2017
- [j11]Maulik C. Madhavi, Hemant A. Patil:
Partial matching and search space reduction for QbE-STD. Comput. Speech Lang. 45: 58-82 (2017) - [j10]Tanvina B. Patel, Hemant A. Patil:
Cochlear Filter and Instantaneous Frequency Based Features for Spoofed Speech Detection. IEEE J. Sel. Top. Signal Process. 11(4): 618-631 (2017) - [j9]Tanvina B. Patel, Hemant A. Patil:
Significance of Source-Filter Interaction for Classification of Natural vs. Spoofed Speech. IEEE J. Sel. Top. Signal Process. 11(4): 644-659 (2017) - [c114]Nirmesh J. Shah, Hemant A. Patil:
On the convergence of INCA algorithm. APSIPA 2017: 559-562 - [c113]Maulik C. Madhavi, Hemant A. Patil:
Combining evidences from detection sources for query-by-example spoken term detection. APSIPA 2017: 563-568 - [c112]Nirmesh J. Shah, Pramod B. Bachhav, Hemant A. Patil:
A novel filtering-based F0 estimation algorithm with an application to voice conversion. APSIPA 2017: 1528-1531 - [c111]Madhu R. Kamble, Hemant A. Patil:
Novel energy separation based instantaneous frequency features for spoof speech detection. EUSIPCO 2017: 106-110 - [c110]Maulik C. Madhavi, Hemant A. Patil:
VTLN-warped Gaussian posteriorgram for QbE-STD. EUSIPCO 2017: 563-567 - [c109]Meet H. Soni, Hemant A. Patil:
Effectiveness of ideal ratio mask for non-intrusive quality assessment of noise suppressed speech. EUSIPCO 2017: 573-577 - [c108]Dharmesh M. Agrawal, Hardik B. Sailor, Meet H. Soni, Hemant A. Patil:
Novel TEO-based Gammatone features for environmental sound classification. EUSIPCO 2017: 1809-1813 - [c107]Madhu R. Kamble, Hemant A. Patil:
Novel Energy Separation Based Frequency Modulation Features for Spoofed Speech Classification. ICAPR 2017: 1-6 - [c106]Maulik C. Madhavi, Hemant A. Patil:
Two Stage Zero-resource Approaches for QbE-STD. ICAPR 2017: 1-5 - [c105]Hardik B. Sailor, Hemant A. Patil, Avni Rajpal:
Unsupervised Filterbank Learning for Speech-based Access System for Agricultural Commodity. ICAPR 2017: 1-6 - [c104]Meet H. Soni, Manisha Sharma, Hardik B. Sailor, Hemant A. Patil:
Sub-band Autoencoder features for Automatic Speech Recognition. ICAPR 2017: 1-5 - [c103]Avni Rajpal, Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Quality assessment of voice converted speech using articulatory features. ICASSP 2017: 5515-5519 - [c102]Nirmesh J. Shah, Hemant A. Patil:
Novel Amplitude Scaling method for bilinear frequency Warping-based Voice Conversion. ICASSP 2017: 5520-5524 - [c101]Hemant A. Patil, Madhu R. Kamble, Tanvina B. Patel, Meet H. Soni:
Novel Variable Length Teager Energy Separation Based Instantaneous Frequency Features for Replay Detection. INTERSPEECH 2017: 12-16 - [c100]Hardik B. Sailor, Madhu R. Kamble, Hemant A. Patil:
Unsupervised Representation Learning Using Convolutional Restricted Boltzmann Machine for Spoof Speech Detection. INTERSPEECH 2017: 2601-2605 - [c99]Hardik B. Sailor, Dharmesh M. Agrawal, Hemant A. Patil:
Unsupervised Filterbank Learning Using Convolutional Restricted Boltzmann Machine for Environmental Sound Classification. INTERSPEECH 2017: 3107-3111 - [c98]Meet H. Soni, Rishabh Tak, Hemant A. Patil:
Novel Shifted Real Spectrum for Exact Signal Reconstruction. INTERSPEECH 2017: 3112-3116 - [c97]Nirmesh J. Shah, Hemant A. Patil:
Analysis of Features and Metrics for Alignment in Text-Dependent Voice Conversion. PReMI 2017: 299-307 - [c96]Madhu R. Kamble, Hemant A. Patil:
Effectiveness of Mel Scale-Based ESA-IFCC Features for Classification of Natural vs. Spoofed Speech. PReMI 2017: 308-316 - [c95]Rishabh N. Tak, Dharmesh M. Agrawal, Hemant A. Patil:
Novel Phase Encoded Mel Filterbank Energies for Environmental Sound Classification. PReMI 2017: 317-325 - [c94]Maulik C. Madhavi, Hemant A. Patil, Nikhil Bhendawade:
Spoken Keyword Retrieval Using Source and System Features. PReMI 2017: 333-341 - [c93]Ankit Nagpal, Hemant A. Patil:
Novel Gammatone Filterbank Based Spectro-Temporal Features for Robust Phoneme Recognition. PReMI 2017: 342-350 - [c92]Purvi Agrawal, Hemant A. Patil:
Fusion of a Novel Volterra-Wiener Filter Based Nonlinear Residual Phase and MFCC for Speaker Verification. SPECOM 2017: 389-397 - [c91]Ami Gandhi, Hemant A. Patil:
Novel Linear Prediction Temporal Phase Based Features for Speaker Recognition. SPECOM 2017: 564-571 - [c90]Apeksha J. Naik, Rishabh Tak, Hemant A. Patil:
Novel Phase Encoded Mel Cepstral Features for Speaker Verification. SPECOM 2017: 572-581 - 2016
- [j8]Anshu Chittora, Hemant A. Patil:
Spectral analysis of infant cries and adult speech. Int. J. Speech Technol. 19(4): 841-856 (2016) - [j7]Anshu Chittora, Hemant A. Patil:
Newborn infant's cry analysis. Int. J. Speech Technol. 19(4): 919-928 (2016) - [j6]Hardik B. Sailor, Hemant A. Patil:
Novel Unsupervised Auditory Filterbank Learning Using Convolutional RBM for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2341-2353 (2016) - [c89]Hardik B. Sailor, Hemant A. Patil:
Unsupervised learning of temporal receptive fields using convolutional RBM for ASR task. EUSIPCO 2016: 873-877 - [c88]Meet H. Soni, Hemant A. Patil:
Novel deep autoencoder features for non-intrusive speech quality assessment. EUSIPCO 2016: 2315-2319 - [c87]Tanvina B. Patel, Hemant A. Patil:
Effectiveness of fundamental frequency (F0) and strength of excitation (SOE) for spoofed speech detection. ICASSP 2016: 5105-5109 - [c86]Tanvina B. Patel, Hemant A. Patil:
Analysis of natural and synthetic speech using Fujisaki model. ICASSP 2016: 5250-5254 - [c85]Hardik B. Sailor, Hemant A. Patil:
Filterbank learning using Convolutional Restricted Boltzmann Machine for speech recognition. ICASSP 2016: 5895-5899 - [c84]Himanshu N. Bhavsar, Tanvina B. Patel, Hemant A. Patil:
Novel Nonlinear Prediction Based Features for Spoofed Speech Detection. INTERSPEECH 2016: 155-159 - [c83]Meet H. Soni, Tanvina B. Patel, Hemant A. Patil:
Novel Subband Autoencoder Features for Detection of Spoofed Speech. INTERSPEECH 2016: 1820-1824 - [c82]Avni Rajpal, Tanvina B. Patel, Hardik B. Sailor, Maulik C. Madhavi, Hemant A. Patil, Hiroya Fujisaki:
Native Language Identification Using Spectral and Source-Based Features. INTERSPEECH 2016: 2383-2387 - [c81]Hardik B. Sailor, Hemant A. Patil:
Unsupervised Deep Auditory Model Using Stack of Convolutional RBMs for Speech Recognition. INTERSPEECH 2016: 3379-3383 - [c80]Meet H. Soni, Hemant A. Patil:
Novel Subband Autoencoder Features for Non-Intrusive Quality Assessment of Noise Suppressed Speech. INTERSPEECH 2016: 3708-3712 - [c79]Deep Gandhi, Tanvina B. Patel, Hemant A. Patil:
A novel lowpass filtering-based approach for estimating strength of excitation from speech signal. SPCOM 2016: 1-5 - [c78]Maulik C. Madhavi, Hemant A. Patil:
Modification in sequential dynamic time warping for fast computation of query-by-example spoken term detection task. SPCOM 2016: 1-5 - [c77]Mohammadi Zaki, Hardik B. Sailor, Hemant A. Patil:
Analysis of hierarchical bottleneck framework for improved phoneme recognition. SPCOM 2016: 1-5 - [c76]Avni Rajpal, Hemant A. Patil:
Jerk Minimization for Acoustic-To-Articulatory Inversion. SSW 2016: 82-87 - [c75]Meet H. Soni, Hemant A. Patil:
Non-intrusive Quality Assessment of Synthesized Speech using Spectral Features and Support Vector Regression. SSW 2016: 127-133 - [c74]Sushant V. Rao, Nirmesh J. Shah, Hemant A. Patil:
Novel Pre-processing using Outlier Removal in Voice Conversion. SSW 2016: 134-139 - 2015
- [c73]Maulik C. Madhavi, Hemant A. Patil, Bhavik B. Vachhani:
Spectral transition measure for detection of obstruents. EUSIPCO 2015: 330-334 - [c72]Anshu Chittora, Hemant A. Patil:
Classification of normal and pathological infant cries using bispectrum features. EUSIPCO 2015: 639-643 - [c71]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension for improvement of frame classification rate. EUSIPCO 2015: 1018-1022 - [c70]Pramod B. Bachhav, Hemant A. Patil, Tanvina B. Patel:
A novel filtering based approach for epoch extraction. ICASSP 2015: 4784-4788 - [c69]Tanvina B. Patel, Hemant A. Patil:
Combining evidences from mel cepstral, cochlear filter cepstral and instantaneous frequency features for detection of natural vs. spoofed speech. INTERSPEECH 2015: 2062-2066 - [c68]Hardik B. Sailor, Maulik C. Madhavi, Hemant A. Patil:
Significance of Phase-based Features for Person Recognition Using Humming. PerMIn 2015: 99-103 - [c67]Shubham Sharma, Hemant A. Patil:
Combining Evidences from Bark Scale and Mel Scale Warped Features for VTLN. PerMIn 2015: 133-136 - [c66]Anshu Chittora, Hemant A. Patil, Kewal D. Malde:
Classification of Stop Consonants using Modulation Spectrogram-Based Features. PerMIn 2015: 145-150 - [c65]Purvi Agrawal, Hemant A. Patil:
Fusion of TEO Phase with MFCC Features for Speaker Verification. PerMIn 2015: 161-166 - [c64]Anshu Chittora, Hemant A. Patil:
Significance of Unvoiced Segments and Fundamental Frequency in Infant Cry Analysis. TSD 2015: 273-281 - [c63]Maulik C. Madhavi, Shubham Sharma, Hemant A. Patil:
Vocal Tract Length Normalization Features for Audio Search. TSD 2015: 387-395 - [c62]Aditya Raikar, Ami Gandhi, Hemant A. Patil:
Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech. TSD 2015: 405-413 - [c61]Anshu Chittora, Hemant A. Patil:
Modified Group Delay Based Features for Asthma and HIE Infant Cries Classification. TSD 2015: 595-602 - 2014
- [c60]Kishore Prahallad, Anandaswarup Vadapalli, Santosh Kesiraju, Hema A. Murthy, Swaran Lata, T. Nagarajan, S. R. Mahadeva Prasanna, Hemant A. Patil, Anil Kumar Sao, Simon King, Alan W. Black, Keiichi Tokuda:
The Blizzard Challenge 2014. Blizzard Challenge 2014 - [c59]Anshu Chittora, Hemant A. Patil:
Classification of phonemes using modulation spectrogram based features for Gujarati language. IALP 2014: 46-49 - [c58]Bhavik B. Vachhani, Kewal D. Malde, Maulik C. Madhavi, Hemant A. Patil:
A spectral transition measure based MELCEPSTRAL features for obstruent detection. IALP 2014: 50-53 - [c57]Shubham Sharma, Maulik C. Madhavi, Hemant A. Patil:
Vocal tract length normalization for vowel recognition in low resource languages. IALP 2014: 54-57 - [c56]Purushotam G. Radadia, Hemant A. Patil:
A Cepstral Mean Subtraction based features for Singer Identification. IALP 2014: 58-61 - [c55]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of multiscale fractal dimension-based phonetic segmentation in speech synthesis for low resource language. IALP 2014: 103-106 - [c54]Nirmesh J. Shah, Mohammadi Zaki, Hemant A. Patil:
Influence of various asymmetrical contextual factors for TTS in a low resource language. IALP 2014: 107-110 - [c53]Maulik C. Madhavi, Shubham Sharma, Hemant A. Patil:
Development of language resources for speech application in Gujarati and Marathi. IALP 2014: 115-118 - [c52]Hemant A. Patil, S. Adarsa:
Nonlinear analysis of natural vs. HTS-based synthetic speech. IALP 2014: 119-122 - [c51]Anshu Chittora, Hemant A. Patil:
Use of glottal inverse filtering for asthma and HIE infant cries classification. IALP 2014: 158-161 - [c50]Nirmesh J. Shah, Bhavik B. Vachhani, Hardik B. Sailor, Hemant A. Patil:
Effectiveness of PLP-based phonetic segmentation for speech synthesis. ICASSP 2014: 270-274 - [c49]Hemant A. Patil, Tanvina B. Patel:
Chaotic mixed excitation source for speech synthesis. INTERSPEECH 2014: 785-789 - [c48]Tanvina B. Patel, Hemant A. Patil:
Novel approach for estimating length of the vocal folds using Fujisaki model. ISCSLP 2014: 308-312 - [c47]Mohammadi Zaki, Nirmesh J. Shah, Hemant A. Patil:
Effectiveness of fractal dimension for ASR in low resource language. ISCSLP 2014: 464-468 - [c46]Hardik B. Sailor, Hemant A. Patil:
Fusion of magnitude and phase-based features for objective evaluation of TTS voice. ISCSLP 2014: 521-525 - [c45]Nirmesh J. Shah, Hemant A. Patil, Maulik C. Madhavi, Hardik B. Sailor, Tanvina B. Patel:
Deterministic annealing EM algorithm for developing TTS system in Gujarati. ISCSLP 2014: 526-530 - [c44]Anshu Chittora, Hemant A. Patil:
Classification of pathological infant cries using modulation spectrogram features. ISCSLP 2014: 541-545 - [c43]Ankur G. Undhad, Hemant A. Patil, Maulik C. Madhavi:
Exploiting speech source information for vowel landmark detection for low resource language. ISCSLP 2014: 546-550 - [c42]Maulik C. Madhavi, Hemant A. Patil:
Exploiting Variable length Teager Energy Operator in melcepstral features for person recognition from humming. ISCSLP 2014: 624-628 - [c41]Anshu Chittora, Kewal D. Malde, Hemant A. Patil:
Obstruent classification using modulation spectrogram based features. O-COCOSDA 2014: 1-6 - [c40]Shubham Sharma, Maulik C. Madhavi, Hemant A. Patil:
Development of vocal tract length normalized phonetic engine for Gujarati and Marathi languages. O-COCOSDA 2014: 1-6 - 2013
- [c39]Swati Talesara, Hemant A. Patil, Tanvina B. Patel, Hardik B. Sailor, Nirmesh J. Shah:
A Novel Gaussian Filter-Based Automatic Labeling of Speech Data for TTS System in Gujarati Language. IALP 2013: 139-142 - [c38]Bhavik B. Vachhani, Hemant A. Patil:
Use of PLP Cepstral Features for Phonetic Segmentation. IALP 2013: 143-146 - [c37]Hemant A. Patil, Tanvina B. Patel:
Nonlinear prediction of speech signal using volterra-wiener series. INTERSPEECH 2013: 1687-1691 - [c36]Nirmalya Sen, Hemant A. Patil, Shyamal Kr. Das Mandal, K. Sreenivasa Rao:
Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification. MIKE 2013: 780-789 - [c35]Nirav H. Chhayani, Hemant A. Patil:
Development of corpora for person recognition using humming, singing and speech. O-COCOSDA/CASLRE 2013: 1-6 - [c34]Anshu Chittora, Hemant A. Patil:
Data collection and corpus design for analysis of nonnal and pathological infant cry. O-COCOSDA/CASLRE 2013: 1-6 - [c33]Kewal D. Malde, Bhavik B. Vachhani, Maulik C. Madhavi, Nirav H. Chhayani, Hemant A. Patil:
Development of speech corpora in Gujarati and Marathi for phonetic transcription. O-COCOSDA/CASLRE 2013: 1-6 - [c32]Hemant A. Patil, Tanvina B. Patel, Nirmesh J. Shah, Hardik B. Sailor, Raghava Krishnan, G. R. Kasthuri, T. Nagarajan, S. Lilly Christina, Naresh Kumar, Veera Raghavendra, S. Prahallad Kishore, S. R. Mahadeva Prasanna, Nagaraj Adiga, Sanasam Ranbir Singh, Anand Konjengbam, Pranaw Kumar, Bira Chandra Singh, S. L. Binil Kumar, T. G. Bhadran, T. Sajini, Arup Saha, Tulika Basu, K. Sreenivasa Rao, N. P. Narendra, Anil Kumar Sao, Rakesh Kumar, Pranhari Talukdar, Purnendu Acharyaa, Somnath Chandra, Swaran Lata, Hema A. Murthy:
A syllable-based framework for unit selection synthesis in 13 Indian languages. O-COCOSDA/CASLRE 2013: 1-8 - [c31]Hemant A. Patil, Tanvina B. Patel, Swati Talesara, Nirmesh J. Shah, Hardik B. Sailor, Bhavik B. Vachhani, Janki Akhani, Bhargav Kanakiya, Yashesh Gaur, Vibha Prajapati:
Algorithms for speech segmentation at syllable-level for text-to-speech synthesis system in Gujarati. O-COCOSDA/CASLRE 2013: 1-7 - [c30]Kewal D. Malde, Anshu Chittora, Hemant A. Patil:
Classification of Fricatives Using Novel Modulation Spectrogram Based Features. PReMI 2013: 134-139 - [c29]Yashesh Gaur, Maulik C. Madhavi, Hemant A. Patil:
Speaker Recognition Using Sparse Representation via Superimposed Features. PReMI 2013: 140-147 - 2012
- [j5]Hemant A. Patil, Maulik C. Madhavi, Keshab K. Parhi:
Static and dynamic information derived from source and system features for person recognition from humming. Int. J. Speech Technol. 15(3): 393-406 (2012) - [c28]Hemant A. Patil, Purushotam G. Radadia, T. K. Basu:
Combining Evidences from Mel Cepstral Features and Cepstral Mean Subtracted Features for Singer Identification. IALP 2012: 145-148 - [c27]Hemant A. Patil, Maulik C. Madhavi, Nirav H. Chhayani:
Person Recognition Using Humming, Singing and Speech. IALP 2012: 149-152 - [c26]Hemant A. Patil, Maulik C. Madhavi, Kewal D. Malde, Bhavik B. Vachhani:
Phonetic Transcription of Fricatives and Plosives for Gujarati and Marathi Languages. IALP 2012: 177-180 - [c25]Pallavi N. Baljekar, Hemant A. Patil:
A comparison of waveform fractal dimension techniques for voice pathology classification. ICASSP 2012: 4461-4464 - [c24]Hemant A. Patil, Maulik C. Madhavi:
Significance of magnitude and phase information via VTEO for humming based biometrics. ICB 2012: 372-377 - [c23]Hemant A. Patil, Maulik C. Madhavi, Rahul Jain, Alok K. Jain:
Combining Evidence from Temporal and Spectral Features for Person Recognition Using Humming. PerMIn 2012: 321-328 - [c22]Hemant A. Patil, Parth A. Goswami, Tapan Kumar Basu:
Novel Interleaving Schemes for Speaker Recognition over Lossy Networks. PerMIn 2012: 329-337 - 2011
- [j4]Hemant A. Patil, Viswanath Srikanth:
Effectiveness of Teager energy operator for epoch detection from speech signals. Int. J. Speech Technol. 14(4): 321 (2011) - [c21]Prakhar Kant Jain, Robin Jain, Hemant A. Patil, T. K. Basu:
Design of a Query-by-Humming System for Hindi Songs Using DDTW Based Approach. IALP 2011: 240-243 - [c20]Hemant A. Patil, Pallavi N. Baljekar, T. K. Basu:
Novel Temporal and Spectral Features Derived from TEO for Classification Normal and Dysphonic Voices. ICFCE 2011: 559-567 - [c19]Hemant A. Patil, Maulik C. Madhavi, Keshab K. Parhi:
Combining Evidence from Spectral and Source-Like Features for Person Recognition from Humming. INTERSPEECH 2011: 369-372 - [c18]Hemant A. Patil, Pallavi N. Baljekar:
Novel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices. INTERSPEECH 2011: 509-512 - 2010
- [c17]Hemant A. Patil, Keshab K. Parhi:
Novel Variable length Teager Energy Based features for person recognition from their hum. ICASSP 2010: 4526-4529
2000 – 2009
- 2009
- [c16]Hemant A. Patil:
Infant Identification from Their Cry. ICAPR 2009: 107-110 - [c15]Hemant A. Patil, Prakhar Kant Jain, Robin Jain:
A Novel Approach to Identification of Speakers from Their Hum. ICAPR 2009: 167-170 - [c14]Hemant A. Patil, T. K. Basu:
A Novel Modified Polynomial Network Design for Dialect Recognition. ICAPR 2009: 175-178 - [c13]Hemant A. Patil, Sunayana Sitaram, Esha Sharma:
DA-IICT Cross-lingual and Multilingual Corpora for Speaker Recognition. ICAPR 2009: 187-190 - [c12]Mayank Mishra, Hemant A. Patil:
Design and Implementation of HMM-VQ based Isolated Digit Recognition System. IICAI 2009: 1754-1763 - [c11]Hemant A. Patil, Keshab K. Parhi:
Variable Length Teager Energy Based Mel Cepstral Features for Identification of Twins. PReMI 2009: 525-530 - 2008
- [j3]Hemant A. Patil, T. K. Basu:
Identifying Perceptually Similar Languages Using Teager Energy Based Cepstrum. Eng. Lett. 16(1): 151-159 (2008) - [j2]Hemant A. Patil, Tapan Kumar Basu:
LP spectra vs. Mel spectra for identification of professional mimics in Indian languages. Int. J. Speech Technol. 11(1): 1-16 (2008) - [j1]Hemant A. Patil, T. K. Basu:
Development of speech corpora for speaker recognition research and evaluation in Indian languages. Int. J. Speech Technol. 11(1): 17-32 (2008) - [c10]Vikrant Tomar, Hemant A. Patil:
On the development of variable length Teager energy operator (VTEO). INTERSPEECH 2008: 1056-1059 - [c9]Hemant A. Patil, Robin Jain, Prakhar Kant Jain:
Identification of Speakers from Their Hum. TSD 2008: 461-468 - [p1]Hemant A. Patil, T. K. Basu:
A Novel Approach to Language Identification Using Modified Polynomial Networks. Speech, Audio, Image and Biomedical Signal Processing using Neural Networks 2008: 117-143 - 2007
- [c8]Hemant A. Patil, T. K. Basu:
Identifying Phonetically Similar Languages Using Teager Energy Based Cepstrum. Artificial Intelligence and Pattern Recognition 2007: 1-8 - [c7]Hemant A. Patil, T. K. Basu:
Advances in Speaker Recognition: A Feature Based Approach. Artificial Intelligence and Pattern Recognition 2007: 528-537 - [c6]Hemant A. Patil, T. K. Basu:
Cepstral Domain Teager Energy for Identifying Perceptually Similar Languages. PReMI 2007: 455-462 - 2006
- [c5]Hemant A. Patil, T. K. Basu:
Design of Cubic Spline Wavelet for Open Set Speaker Classification in Marathi. ISCSLP (Selected Papers) 2006: 126-137 - [c4]Hemant A. Patil, T. K. Basu:
A New Data Fusion Technique and Performance Measure for Identification of Twins in Marathi. ISCSLP 2006 - [c3]Hemant A. Patil, S. Ghosh, A. Si, T. K. Basu:
Design of Cross-lingual and Multilingual Corpora for Speaker Recognition Research and Evaluation in Indian Languages. ISCSLP 2006 - 2005
- [c2]Hemant A. Patil, Pranab Kumar Dutta, T. K. Basu:
The Wavelet Packet Based Cepstral Features for Open Set Speaker Classification in Marathi. GfKl 2005: 134-141 - 2004
- [c1]Hemant A. Patil, T. K. Basu:
The Teager Energy Based Features for Identification of Identical Twins in Multi-lingual Environment. ICONIP 2004: 333-337
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-20 22:55 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint