default search action
Gaël Richard
Person information
- affiliation: Télécom Paris, Paris, France
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j68]Sharon Gannot, Walter Kellermann, Zbynek Koldovský, Shoko Araki, Gaël Richard:
Special Issue on Model-Based and Data-Driven Audio Signal Processing [From the Guest Editors]. IEEE Signal Process. Mag. 41(6): 8-11 (2024) - [j67]Gaël Richard, Vincent Lostanlen, Yi-Hsuan Yang, Meinard Müller:
Model-Based Deep Learning for Music Information Research: Leveraging diverse knowledge sources to enhance explainability, controllability, and resource efficiency [Special Issue On Model-Based and Data-Driven Audio Signal Processing]. IEEE Signal Process. Mag. 41(6): 51-59 (2024) - [j66]Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gaël Richard, Florence d'Alché-Buc:
Tackling Interpretability in Audio Classification Networks With Non-negative Matrix Factorization. IEEE ACM Trans. Audio Speech Lang. Process. 32: 1392-1405 (2024) - [c183]David Perera, Slim Essid, Gaël Richard:
Invariance-Based Layer Regularization for Sound Event Detection. EUSIPCO 2024: 51-55 - [c182]Benoît Giniès, Xiaoyu Bie, Olivier Fercoq, Gaël Richard:
Using Random Codebooks for Audio Neural AutoEncoders. EUSIPCO 2024: 311-315 - [c181]Gaël Richard, Pierre Chouteau, Bernardo Torres:
A Fully Differentiable Model for Unsupervised Singing Voice Separation. ICASSP 2024: 946-950 - [c180]Manvi Agarwal, Changhong Wang, Gaël Richard:
Structure-Informed Positional Encoding for Music Generation. ICASSP 2024: 951-955 - [c179]Teysir Baoueb, Haocheng Liu, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis. ICASSP 2024: 986-990 - [c178]Bernardo Torres, Geoffroy Peeters, Gaël Richard:
Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport. ICASSP 2024: 1176-1180 - [c177]Haocheng Liu, Teysir Baoueb, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
GLA-GRAD: A Griffin-Lim Extended Waveform Generation Diffusion Model. ICASSP 2024: 11611-11615 - [c176]Victor Letzelter, David Perera, Cédric Rommel, Mathieu Fontaine, Slim Essid, Gaël Richard, Patrick Pérez:
Winner-takes-all learners are geometry-aware conditional density estimators. ICML 2024 - [c175]Teysir Baoueb, Xiaoyu Bie, Hicham Janati, Gaël Richard:
Wavetransfer: A Flexible End-to-End Multi-Instrument Timbre Transfer with Diffusion. MLSP 2024: 1-6 - [c174]Xuanyu Zhuang, Geoffroy Peeters, Gaël Richard:
Episodic Fine-Tuning Prototypical Networks for Optimization-Based Few-Shot Learning: Application to Audio Classification. MLSP 2024: 1-6 - [i43]Bernardo Torres, Stefan Lattner, Gaël Richard:
Singer Identity Representation Learning using Self-Supervised Techniques. CoRR abs/2401.05064 (2024) - [i42]Teysir Baoueb, Haocheng Liu, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
SpecDiff-GAN: A Spectrally-Shaped Noise Diffusion GAN for Speech and Music Synthesis. CoRR abs/2402.01753 (2024) - [i41]Manvi Agarwal, Changhong Wang, Gaël Richard:
Structure-informed Positional Encoding for Music Generation. CoRR abs/2402.13301 (2024) - [i40]Haocheng Liu, Teysir Baoueb, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
GLA-Grad: A Griffin-Lim Extended Waveform Generation Diffusion Model. CoRR abs/2402.15516 (2024) - [i39]Victor Letzelter, David Perera, Cédric Rommel, Mathieu Fontaine, Slim Essid, Gaël Richard, Patrick Pérez:
Winner-takes-all learners are geometry-aware conditional density estimators. CoRR abs/2406.04706 (2024) - [i38]Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard:
Speech dereverberation constrained on room impulse response characteristics. CoRR abs/2407.08657 (2024) - [i37]David Perera, Victor Letzelter, Théo Mariotte, Adrien Cortés, Mickaël Chen, Slim Essid, Gaël Richard:
Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing. CoRR abs/2407.15580 (2024) - [i36]Xiaoyu Bie, Xubo Liu, Gaël Richard:
Learning Source Disentanglement in Neural Audio Codec. CoRR abs/2409.11228 (2024) - [i35]Teysir Baoueb, Xiaoyu Bie, Hicham Janati, Gaël Richard:
WaveTransfer: A Flexible End-to-end Multi-instrument Timbre Transfer with Diffusion. CoRR abs/2409.15321 (2024) - [i34]Xuanyu Zhuang, Geoffroy Peeters, Gaël Richard:
Episodic fine-tuning prototypical networks for optimization-based few-shot learning: Application to audio classification. CoRR abs/2410.05302 (2024) - [i33]David Perera, François Derrida, Théo Mariotte, Gaël Richard, Slim Essid:
Multiple Choice Learning for Efficient Speech Separation with Many Speakers. CoRR abs/2411.18497 (2024) - 2023
- [j65]Gaël Richard, Paris Smaragdis, Sharon Gannot, Patrick A. Naylor, Shoji Makino, Walter Kellermann, Akihiko Sugiyama:
Audio Signal Processing in the 21st Century: The important outcomes of the past 25 years. IEEE Signal Process. Mag. 40(5): 12-26 (2023) - [j64]Kilian Schulze-Forster, Gaël Richard, Liam Kelley, Clement S. J. Doire, Roland Badeau:
Unsupervised Music Source Separation Using Differentiable Parametric Source Models. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1276-1289 (2023) - [j63]Laure Prétet, Gaël Richard, Clément Souchier, Geoffroy Peeters:
Video-to-Music Recommendation Using Temporal Alignment of Segments. IEEE Trans. Multim. 25: 2898-2911 (2023) - [c173]Félix Mathieu, Thomas Courtat, Gaël Richard, Geoffroy Peeters:
Learning Interpretable Filters In Wav-UNet For Speech Enhancement. ICASSP 2023: 1-5 - [c172]Changhong Wang, Gaël Richard, Brian McFee:
Transfer Learning and Bias Correction With Pre-Trained Audio Embeddings. ISMIR 2023: 64-70 - [c171]Bernardo Torres, Stefan Lattner, Gaël Richard:
Singer Identity Representation Learning Using Self-Supervised Techniques. ISMIR 2023: 448-456 - [c170]Victor Letzelter, Mathieu Fontaine, Mickaël Chen, Patrick Pérez, Slim Essid, Gaël Richard:
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis. NeurIPS 2023 - [e3]Augusto Sarti, Fabio Antonacci, Mark Sandler, Paolo Bestagini, Simon Dixon, Beici Liang, Gaël Richard, Johan Pauwels:
Proceedings of the 24th International Society for Music Information Retrieval Conference, ISMIR 2023, Milan, Italy, November 5-9, 2023. 2023, ISBN 978-1-7327299-3-3 [contents] - [i32]Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Gaël Richard, Florence d'Alché-Buc:
Tackling Interpretability in Audio Classification Networks with Non-negative Matrix Factorization. CoRR abs/2305.07132 (2023) - [i31]Laure Prétet, Gaël Richard, Clément Souchier, Geoffroy Peeters:
Video-to-Music Recommendation using Temporal Alignment of Segments. CoRR abs/2306.07187 (2023) - [i30]Changhong Wang, Gaël Richard, Brian McFee:
Transfer Learning and Bias Correction with Pre-trained Audio Embeddings. CoRR abs/2307.10834 (2023) - [i29]Victor Letzelter, Mathieu Fontaine, Mickaël Chen, Patrick Pérez, Slim Essid, Gaël Richard:
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis. CoRR abs/2311.01052 (2023) - [i28]Bernardo Torres, Geoffroy Peeters, Gaël Richard:
Unsupervised Harmonic Parameter Estimation Using Differentiable DSP and Spectral Optimal Transport. CoRR abs/2312.14507 (2023) - 2022
- [c169]Milad Sefidgaran, Amin Gohari, Gaël Richard, Umut Simsekli:
Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms. COLT 2022: 4416-4463 - [c168]David Perera, Slim Essid, Gaël Richard:
Latent and Adversarial Data Augmentations for Sound Event Detection and Classification. DCASE 2022 - [c167]Félix Mathieu, Thomas Courtat, Gaël Richard, Geoffroy Peeters:
Phase Shifted Bedrosian Filterbank: An Interpretable Audio Front-End for Time-Domain Audio Source Separation. ICASSP 2022: 531-535 - [c166]Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts. ISMIR 2022: 186-192 - [c165]Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc, Gaël Richard:
Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF. NeurIPS 2022 - [e2]Mathieu Lagrange, Annamaria Mesaros, Thomas Pellegrini, Gaël Richard, Romain Serizel, Dan Stowell:
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, DCASE 2022, Nancy, France, November 3-4, 2022. Tampere University 2022, ISBN 978-952-03-2677-7 [contents] - [i27]Kilian Schulze-Forster, Clement S. J. Doire, Gaël Richard, Roland Badeau:
Unsupervised Audio Source Separation Using Differentiable Parametric Source Models. CoRR abs/2201.09592 (2022) - [i26]Jayneel Parekh, Sanjeel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc, Gaël Richard:
Listen to Interpret: Post-hoc Interpretability for Audio Networks with NMF. CoRR abs/2202.11479 (2022) - [i25]Milad Sefidgaran, Amin Gohari, Gaël Richard, Umut Simsekli:
Rate-Distortion Theoretic Generalization Bounds for Stochastic Learning Algorithms. CoRR abs/2203.02474 (2022) - [i24]Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Exploiting Device and Audio Data to Tag Music with User-Aware Listening Contexts. CoRR abs/2211.07250 (2022) - 2021
- [j62]Kilian Schulze-Forster, Clement S. J. Doire, Gaël Richard, Roland Badeau:
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation. IEEE ACM Trans. Audio Speech Lang. Process. 29: 2382-2395 (2021) - [c164]Giorgia Cantisani, Slim Essid, Gaël Richard:
Neuro-Steered Music Source Separation With EEG-Based Auditory Attention Decoding And Contrastive-NMF. ICASSP 2021: 36-40 - [c163]Ondrej Cífka, Alexey Ozerov, Umut Simsekli, Gaël Richard:
Self-Supervised VQ-VAE for One-Shot Music Style Transfer. ICASSP 2021: 96-100 - [c162]Antoine Liutkus, Ondrej Cífka, Shih-Lun Wu, Umut Simsekli, Yi-Hsuan Yang, Gaël Richard:
Relative Positional Encoding for Transformers with Linear Complexity. ICML 2021: 7067-7079 - [c161]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Cross-Modal Music-Video Recommendation: A Study of Design Choices. IJCNN 2021: 1-9 - [c160]Javier Nistal, Stefan Lattner, Gaël Richard:
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis With GANs. ISMIR 2021: 484-492 - [c159]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Is there a "language of music-video clips" ? A qualitative and quantitative study. ISMIR 2021: 539-546 - [c158]Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gaël Richard:
The Words Remain the Same: Cover Detection with Lyrics Transcription. ISMIR 2021: 714-721 - [c157]Melih Barsbey, Milad Sefidgaran, Murat A. Erdogdu, Gaël Richard, Umut Simsekli:
Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks. NeurIPS 2021: 29364-29378 - [c156]Giorgia Cantisani, Alexey Ozerov, Slim Essid, Gaël Richard:
User-Guided One-Shot Deep Model Adaptation for Music Source Separation. WASPAA 2021: 111-115 - [c155]Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard:
VQCPC-GAN: Variable-Length Adversarial Audio Synthesis Using Vector-Quantized Contrastive Predictive Coding. WASPAA 2021: 116-120 - [p5]Geoffroy Peeters, Gaël Richard:
Deep Learning for Audio and Music. Multi-faceted Deep Learning 2021: 231-266 - [i23]Ondrej Cífka, Alexey Ozerov, Umut Simsekli, Gaël Richard:
Self-Supervised VQ-VAE For One-Shot Music Style Transfer. CoRR abs/2102.05749 (2021) - [i22]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Cross-Modal Music-Video Recommendation: A Study of Design Choices. CoRR abs/2104.14799 (2021) - [i21]Javier Nistal, Cyran Aouameur, Stefan Lattner, Gaël Richard:
VQCPC-GAN: Variable-length Adversarial Audio Synthesis using Vector-Quantized Contrastive Predictive Coding. CoRR abs/2105.01531 (2021) - [i20]Antoine Liutkus, Ondrej Cífka, Shih-Lun Wu, Umut Simsekli, Yi-Hsuan Yang, Gaël Richard:
Relative Positional Encoding for Transformers with Linear Complexity. CoRR abs/2105.08399 (2021) - [i19]Melih Barsbey, Milad Sefidgaran, Murat A. Erdogdu, Gaël Richard, Umut Simsekli:
Heavy Tails in SGD and Compressibility of Overparametrized Neural Networks. CoRR abs/2106.03795 (2021) - [i18]Benoit Fuentes, Gaël Richard:
Probabilistic semi-nonnegative matrix factorization: a Skellam-based framework. CoRR abs/2107.03317 (2021) - [i17]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Is there a "language of music-video clips" ? A qualitative and quantitative study. CoRR abs/2108.00970 (2021) - [i16]Javier Nistal, Stefan Lattner, Gaël Richard:
DarkGAN: Exploiting Knowledge Distillation for Comprehensible Audio Synthesis with GANs. CoRR abs/2108.01216 (2021) - 2020
- [j61]Didier Bresch, Nicolas Cellier, Frédéric Couderc, Marguerite Gisclon, Pascal Noble, Gaël Richard, Christian Ruyer-Quil, Jean-Paul Vila:
Augmented skew-symmetric system for shallow-water system with surface tension allowing large gradient of density. J. Comput. Phys. 419: 109670 (2020) - [j60]Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Weakly Supervised Representation Learning for Audio-Visual Scene Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 28: 416-428 (2020) - [j59]Ondrej Cífka, Umut Simsekli, Gaël Richard:
Groove2Groove: One-Shot Music Style Transfer With Supervision From Synthetic Data. IEEE ACM Trans. Audio Speech Lang. Process. 28: 2638-2650 (2020) - [c154]Javier Nistal, Stefan Lattner, Gaël Richard:
Comparing Representations for Audio Synthesis Using Generative Adversarial Networks. EUSIPCO 2020: 161-165 - [c153]Karim M. Ibrahim, Jimena Royo-Letelier, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Audio-Based Auto-Tagging With Contextual Tags for Music. ICASSP 2020: 16-20 - [c152]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Learning to Rank Music Tracks Using Triplet Loss. ICASSP 2020: 511-515 - [c151]Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gaël Richard, Florence d'Alché-Buc:
Audio-Based Detection of Explicit Content in Music. ICASSP 2020: 526-530 - [c150]Enguerrand Gentet, Bertrand David, Sébastien Denjean, Gaël Richard, Vincent Roussarie:
Speech Intelligibility Enhancement by Equalization for in-Car Applications. ICASSP 2020: 6934-6938 - [c149]Kilian Schulze-Forster, Clement S. J. Doire, Gaël Richard, Roland Badeau:
Joint Phoneme Alignment and Text-Informed Speech Separation on Highly Corrupted Speech. ICASSP 2020: 7274-7278 - [c148]Enguerrand Gentet, Bertrand David, Sébastien Denjean, Gaël Richard, Vincent Roussarie:
Neutral to Lombard Speech Conversion with Deep Learning. ICASSP 2020: 7739-7743 - [c147]Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Should we consider the users in contextual music auto-tagging models?. ISMIR 2020: 295-301 - [c146]Andrea Vaglio, Romain Hennequin, Manuel Moussallam, Gaël Richard, Florence d'Alché-Buc:
Multilingual lyrics-to-audio alignment. ISMIR 2020: 512-519 - [c145]Javier Nistal, Stefan Lattner, Gaël Richard:
DRUMGAN: Synthesis of Drum Sounds with Timbral Feature Conditioning Using Generative Adversarial Networks. ISMIR 2020: 590-597 - [c144]Thomas Janssoone, Kévin Bailly, Gaël Richard, Chloé Clavel:
The POTUS Corpus, a Database of Weekly Addresses for the Study of Stance in Politics and Virtual Agents. LREC 2020: 1546-1553 - [c143]Karim M. Ibrahim, Elena V. Epure, Geoffroy Peeters, Gaël Richard:
Confidence-based Weighted Loss for Multi-label Classification with Missing Labels. ICMR 2020: 291-295 - [c142]Simon Henriet, Benoit Fuentes, Umut Simsekli, Gaël Richard:
Matrix Factorization for High Frequency Non Intrusive Load Monitoring: Definitions and Algorithms. NILM@SenSys 2020: 20-24 - [d1]Ondrej Cífka, Umut Simsekli, Gaël Richard:
Groove2Groove MIDI Dataset: synthetic accompaniments in 3k styles. Zenodo, 2020 - [i15]Laure Prétet, Gaël Richard, Geoffroy Peeters:
Learning to rank music tracks using triplet loss. CoRR abs/2005.12977 (2020) - [i14]Javier Nistal, Stefan Lattner, Gaël Richard:
Comparing Representations for Audio Synthesis Using Generative Adversarial Networks. CoRR abs/2006.09266 (2020) - [i13]Javier Nistal, Stefan Lattner, Gaël Richard:
DrumGAN: Synthesis of Drum Sounds With Timbral Feature Conditioning Using Generative Adversarial Networks. CoRR abs/2008.12073 (2020)
2010 – 2019
- 2019
- [j58]Simon Henriet, Umut Simsekli, Sergio Dos Santos, Benoit Fuentes, Gaël Richard:
Independent-Variation Matrix Factorization With Application to Energy Disaggregation. IEEE Signal Process. Lett. 26(11): 1643-1647 (2019) - [j57]Zhiyao Duan, Slim Essid, Cynthia C. S. Liem, Gaël Richard, Gaurav Sharma:
Audiovisual Analysis of Music Performances: Overview of an Emerging Field. IEEE Signal Process. Mag. 36(1): 63-73 (2019) - [c141]Thanh Huy Nguyen, Umut Simsekli, Gaël Richard:
Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization. ICML 2019: 4810-4819 - [c140]Ondrej Cífka, Umut Simsekli, Gaël Richard:
Supervised Symbolic Music Style Translation Using Synthetic Data. ISMIR 2019: 588-595 - [c139]Thanh Huy Nguyen, Umut Simsekli, Mert Gürbüzbalaban, Gaël Richard:
First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise. NeurIPS 2019: 273-283 - [c138]Giorgia Cantisani, Gabriel Trégoat, Slim Essid, Gaël Richard:
MAD-EEG: an EEG dataset for decoding auditory attention to a target instrument in polyphonic music. SMM 2019 - [c137]Giorgia Cantisani, Slim Essid, Gaël Richard:
EEG-Based Decoding of Auditory Attention to a Target Instrument in Polyphonic Music. WASPAA 2019: 80-84 - [c136]Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Identify, Locate and Separate: Audio-Visual Object Extraction in Large Video Collections Using Weak Supervision. WASPAA 2019: 268-272 - [c135]Kilian Schulze-Forster, Clement S. J. Doire, Gaël Richard, Roland Badeau:
Weakly Informed Audio Source Separation. WASPAA 2019: 273-277 - [i12]Thanh Huy Nguyen, Umut Simsekli, Gaël Richard:
Non-Asymptotic Analysis of Fractional Langevin Monte Carlo for Non-Convex Optimization. CoRR abs/1901.07487 (2019) - [i11]Thanh Huy Nguyen, Umut Simsekli, Mert Gürbüzbalaban, Gaël Richard:
First Exit Time Analysis of Stochastic Gradient Descent Under Heavy-Tailed Gradient Noise. CoRR abs/1906.09069 (2019) - [i10]Ondrej Cífka, Umut Simsekli, Gaël Richard:
Supervised Symbolic Music Style Translation Using Synthetic Data. CoRR abs/1907.02265 (2019) - [i9]Didier Bresch, Nicolas Cellier, Frédéric Couderc, Marguerite Gisclon, Pascal Noble, Gaël Richard, Christian Ruyer-Quil, Jean-Paul Vila:
Augmented Skew-Symetric System for Shallow-Water System with Surface Tension Allowing Large Gradient of Density. CoRR abs/1911.12217 (2019) - [i8]Umut Simsekli, Mert Gürbüzbalaban, Thanh Huy Nguyen, Gaël Richard, Levent Sagun:
On the Heavy-Tailed Theory of Stochastic Gradient Descent for Deep Neural Networks. CoRR abs/1912.00018 (2019) - 2018
- [j56]Thanh Huy Nguyen, Umut Simsekli, Gaël Richard, Ali Taylan Cemgil:
Efficient Bayesian Model Selection in PARAFAC via Stochastic Thermodynamic Integration. IEEE Signal Process. Lett. 25(5): 725-729 (2018) - [j55]Simon Leglaive, Roland Badeau, Gaël Richard:
Student's t Source and Mixing Models for Multichannel Audio Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 26(6): 1150-1164 (2018) - [j54]Clement Laroche, Matthieu Kowalski, Hélène Papadopoulos, Gaël Richard:
Hybrid Projective Nonnegative Matrix Factorization With Drum Dictionaries for Harmonic/Percussive Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1499-1511 (2018) - [c134]Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. CVPR Workshops 2018: 2518-2519 - [c133]Umut Simsekli, Halil Erdogan, Simon Leglaive, Antoine Liutkus, Roland Badeau, Gaël Richard:
Alpha-Stable Low-Rank Plus Residual Decomposition for Speech Enhancement. ICASSP 2018: 651-655 - [c132]Umut Simsekli, Çagatay Yildiz, Thanh Huy Nguyen, A. Taylan Cemgil, Gaël Richard:
Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization. ICML 2018: 4681-4690 - [e1]Mark D. Plumbley, Christian Kroos, Juan Pablo Bello, Gaël Richard, Daniel P. W. Ellis, Annamaria Mesaros:
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, DCASE 2018, Surrey, UK, November 19-20, 2018. 2018, ISBN 978-952-15-4262-6 [contents] - [i7]Simon Henriet, Umut Simsekli, Benoit Fuentes, Gaël Richard:
A Generative Model for Non-Intrusive Load Monitoring in Commercial Buildings. CoRR abs/1803.00515 (2018) - [i6]Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Weakly Supervised Representation Learning for Unsynchronized Audio-Visual Events. CoRR abs/1804.07345 (2018) - [i5]Umut Simsekli, Çagatay Yildiz, Thanh Huy Nguyen, Gaël Richard, A. Taylan Cemgil:
Asynchronous Stochastic Quasi-Newton MCMC for Non-Convex Optimization. CoRR abs/1806.02617 (2018) - [i4]Sanjeel Parekh, Alexey Ozerov, Slim Essid, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Identify, locate and separate: Audio-visual object extraction in large video collections using weak supervision. CoRR abs/1811.04000 (2018) - 2017
- [j53]Thomas Janssoone, Chloé Clavel, Kevin Bailly, Gaël Richard:
Règles d'associations temporelles de signaux sociaux pour la synthèse d'agents conversationnels animés. Application aux attitudes sociales. Rev. d'Intelligence Artif. 31(5): 511-536 (2017) - [j52]Sébastien Fenet, Roland Badeau, Gaël Richard:
Reassigned time-frequency representations of discrete time signals and application to the Constant-Q Transform. Signal Process. 132: 170-176 (2017) - [j51]Karan Nathwani, Gaël Richard, Bertrand David, Pierre Prablanc, Vincent Roussarie:
Speech intelligibility improvement in car noise environment by voice transformation. Speech Commun. 91: 17-27 (2017) - [j50]Simon Durand, Juan Pablo Bello, Bertrand David, Gaël Richard:
Robust Downbeat Tracking Using an Ensemble of Convolutional Networks. IEEE ACM Trans. Audio Speech Lang. Process. 25(1): 72-85 (2017) - [j49]Gaël Richard, Tuomas Virtanen, Juan Pablo Bello, Nobutaka Ono, Hervé Glotin:
Introduction to the Special Section on Sound Scene and Event Analysis. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1169-1171 (2017) - [j48]Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard:
Feature Learning With Matrix Factorization Applied to Acoustic Scene Classification. IEEE ACM Trans. Audio Speech Lang. Process. 25(6): 1216-1229 (2017) - [c131]Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard:
Nonnegative Feature Learning Methods for Acoustic Scene Classification. DCASE 2017: 22-26 - [c130]Simon Leglaive, Roland Badeau, Gaël Richard:
Semi-blind student's t source separation for multichannel audio convolutive mixtures. EUSIPCO 2017: 2259-2263 - [c129]Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Motion informed audio source separation. ICASSP 2017: 6-10 - [c128]Simon Leglaive, Roland Badeau, Gaël Richard:
Multichannel audio source separation: Variational inference of time-frequency sources from time-domain observations. ICASSP 2017: 26-30 - [c127]Victor Bisot, Slim Essid, Gaël Richard:
Overlapping sound event detection with supervised Nonnegative Matrix Factorization. ICASSP 2017: 31-35 - [c126]Romain Serizel, Victor Bisot, Slim Essid, Gaël Richard:
Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification. ICASSP 2017: 36-40 - [c125]Clement Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard:
Drum extraction in single channel audio signals using multi-layer Non negative Matrix Factor Deconvolution. ICASSP 2017: 46-50 - [c124]Simon Leglaive, Umut Simsekli, Antoine Liutkus, Roland Badeau, Gaël Richard:
Alpha-stable multichannel audio source separation. ICASSP 2017: 576-580 - [c123]Umut Simsekli, Alain Durmus, Roland Badeau, Gaël Richard, Eric Moulines, A. Taylan Cemgil:
Parallelized Stochastic Gradient Markov Chain Monte Carlo algorithms for non-negative matrix factorization. ICASSP 2017: 2242-2246 - [c122]Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard:
Leveraging deep neural networks with nonnegative representations for improved environmental sound classification. MLSP 2017: 1-6 - [c121]Simon Henriet, Umut Simsekli, Gaël Richard, Benoit Fuentes:
Synthetic dataset generation for non-intrusive load monitoring in commercial buildings. BuildSys 2017: 39:1-39:2 - [c120]Sanjeel Parekh, Slim Essid, Alexey Ozerov, Ngoc Q. K. Duong, Patrick Pérez, Gaël Richard:
Guiding audio source separation by video object information. WASPAA 2017: 61-65 - [c119]Simon Leglaive, Roland Badeau, Gaël Richard:
Separating time-frequency sources from time-domain convolutive mixtures using non-negative matrix factorization. WASPAA 2017: 264-268 - 2016
- [j47]Xabier Jaureguiberry, Emmanuel Vincent, Gaël Richard:
Fusion Methods for Speech Enhancement and Audio Source Separation. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1266-1279 (2016) - [j46]Simon Leglaive, Roland Badeau, Gaël Richard:
Multichannel Audio Source Separation With Probabilistic Reverberation Priors. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2453-2465 (2016) - [c118]Simon Leglaive, Roland Badeau, Gaël Richard:
Autoregressive moving average modeling of late reverberation in the frequency domain. EUSIPCO 2016: 1478-1482 - [c117]Simon Durand, Juan Pablo Bello, Bertrand David, Gaël Richard:
Feature adapted convolutional neural networks for downbeat tracking. ICASSP 2016: 296-300 - [c116]Umut Simsekli, Roland Badeau, Gaël Richard, Ali Taylan Cemgil:
Stochastic thermodynamic integration: Efficient Bayesian model selection via stochastic gradient MCMC. ICASSP 2016: 2574-2578 - [c115]Karan Nathwani, Morgane Daniel, Gaël Richard, Bertrand David, Vincent Roussarie:
Formant shifting for speech intelligibility improvement in car noise environment. ICASSP 2016: 5375-5379 - [c114]Romain Serizel, Slim Essid, Gaël Richard:
Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification. ICASSP 2016: 5470-5474 - [c113]Victor Bisot, Romain Serizel, Slim Essid, Gaël Richard:
Acoustic scene classification with matrix factorization for unsupervised feature learning. ICASSP 2016: 6445-6449 - [c112]Romain Serizel, Victor Bisot, Slim Essid, Gaël Richard:
Machine listening techniques as a complement to video image analysis in forensics. ICIP 2016: 948-952 - [c111]Umut Simsekli, Roland Badeau, A. Taylan Cemgil, Gaël Richard:
Stochastic Quasi-Newton Langevin Monte Carlo. ICML 2016: 642-651 - [c110]Clement Laroche, Hélène Papadopoulos, Matthieu Kowalski, Gaël Richard:
Genre Specific Dictionaries for Harmonic/Percussive Source Separation. ISMIR 2016: 407-413 - [c109]Thomas Janssoone, Chloé Clavel, Kevin Bailly, Gaël Richard:
Using Temporal Association Rules for the Synthesis of Embodied Conversational Agents with a Specific Stance. IVA 2016: 175-189 - [c108]Romain Serizel, Slim Essid, Gaël Richard:
Mini-batch stochastic approaches for accelerated multiplicative updates in nonnegative matrix factorisation with beta-divergence. MLSP 2016: 1-6 - [c107]Alain Durmus, Umut Simsekli, Eric Moulines, Roland Badeau, Gaël Richard:
Stochastic Gradient Richardson-Romberg Markov Chain Monte Carlo. NIPS 2016: 2047-2055 - [i3]Simon Durand, Juan Pablo Bello, Bertrand David, Gaël Richard:
Robust Downbeat Tracking Using an Ensemble of Convolutional Networks. CoRR abs/1605.08396 (2016) - 2015
- [j45]Hequn Bai, Gaël Richard, Laurent Daudet:
Late Reverberation Synthesis: From Radiance Transfer to Feedback Delay Networks. IEEE ACM Trans. Audio Speech Lang. Process. 23(12): 2260-2271 (2015) - [j44]Aymeric Masurelle, Ahmed Rida Sekkat, Slim Essid, Gaël Richard:
TPT-Dance&Actions : un corpus multimodal d'activités humaines. Traitement du Signal 32(4): 443-475 (2015) - [c106]Victor Bisot, Slim Essid, Gaël Richard:
HOG and subband power distribution image features for acoustic scene classification. EUSIPCO 2015: 719-723 - [c105]Clement Laroche, Matthieu Kowalski, Hélène Papadopoulos, Gaël Richard:
A structured nonnegative matrix factorization for source separation. EUSIPCO 2015: 2033-2037 - [c104]Camila de Andrade Scatolini, Gaël Richard, Benoit Fuentes:
Multipitch estimation using a PLCA-based model: Impact of partial user annotation. ICASSP 2015: 186-190 - [c103]Simon Durand, Juan Pablo Bello, Bertrand David, Gaël Richard:
Downbeat tracking with multiple features and deep neural networks. ICASSP 2015: 409-413 - [c102]Hequn Bai, Gaël Richard, Laurent Daudet:
Geometric-based reverberator using acoustic rendering networks. WASPAA 2015: 1-5 - [c101]Simon Leglaive, Roland Badeau, Gaël Richard:
Multichannel audio source separation with probabilistic reverberation modeling. WASPAA 2015: 1-5 - 2014
- [j43]Manuel Moussallam, Alexandre Gramfort, Laurent Daudet, Gaël Richard:
Blind Denoising with Random Greedy Pursuits. IEEE Signal Process. Lett. 21(11): 1341-1345 (2014) - [j42]Justin Salamon, Emilia Gómez, Daniel P. W. Ellis, Gaël Richard:
Melody Extraction from Polyphonic Music Signals: Approaches, applications, and challenges. IEEE Signal Process. Mag. 31(2): 118-134 (2014) - [c100]Benoit Fuentes, Roland Badeau, Gaël Richard:
Controlling the convergence rate to help parameter estimation in a PLCA-based model. EUSIPCO 2014: 626-630 - [c99]Aymeric Masurelle, Slim Essid, Gaël Richard:
Gesture recognition using a NMF-based representation of motion-traces extracted from depth silhouettes. ICASSP 2014: 1275-1279 - [c98]Simon Durand, Bertrand David, Gaël Richard:
Enhancing downbeat detection when facing different music styles. ICASSP 2014: 3132-3136 - [c97]Nicolás López, Yves Grenier, Gaël Richard, Ivan Bourmeyster:
Single channel reverberation suppression based on sparse linear prediction. ICASSP 2014: 5182-5186 - [c96]Xabier Jaureguiberry, Emmanuel Vincent, Gaël Richard:
Multiple-order non-negative matrix factorization for speech enhancement. INTERSPEECH 2014: 2838-2842 - [c95]Emmanouil Benetos, Roland Badeau, Tillman Weyde, Gaël Richard:
Template Adaptation for Improving Automatic Music Transcription. ISMIR 2014: 175-180 - [c94]Gaël Richard:
Informed Audio Source Separation. Semantic Audio 2014 - [c93]Xabier Jaureguiberry, Emmanuel Vincent, Gaël Richard:
Variational Bayesian model averaging for audio source separation. SSP 2014: 33-36 - 2013
- [j41]Slim Essid, Xinyu Lin, Marc Gowing, Georgios Kordelas, Anil Aksay, Philip Kelly, Thomas Fillon, Qianni Zhang, Alfred Dielmann, Vlado Kitanovski, Robin Tournemenne, Aymeric Masurelle, Ebroul Izquierdo, Noel E. O'Connor, Petros Daras, Gaël Richard:
A multi-modal dance corpus for research into interaction between humans in virtual environments. J. Multimodal User Interfaces 7(1-2): 157-170 (2013) - [j40]Gaël Richard, Shiva Sundaram, Shrikanth S. Narayanan:
An Overview on Perceptually Motivated Audio Indexing and Classification. Proc. IEEE 101(9): 1939-1954 (2013) - [j39]Olivier Derrien, Roland Badeau, Gaël Richard:
Parametric Audio Coding With Exponentially Damped Sinusoids. IEEE Trans. Speech Audio Process. 21(7): 1489-1501 (2013) - [j38]Alexey Ozerov, Antoine Liutkus, Roland Badeau, Gaël Richard:
Coding-Based Informed Source Separation: Nonnegative Tensor Factorization Approach. IEEE Trans. Speech Audio Process. 21(8): 1699-1712 (2013) - [j37]Benoit Fuentes, Roland Badeau, Gaël Richard:
Harmonic Adaptive Latent Component Analysis of Audio and Application to Music Transcription. IEEE Trans. Speech Audio Process. 21(9): 1854-1866 (2013) - [j36]Cyril Joder, Slim Essid, Gaël Richard:
Learning Optimal Features for Polyphonic Audio-to-Score Alignment. IEEE Trans. Speech Audio Process. 21(10): 2118-2128 (2013) - [c92]Nicolás López, Mounira Maazaoui, Yves Grenier, Gaël Richard, Ivan Bourmeyster:
Does dereverberation help multichannel source separation? A case study. EUSIPCO 2013: 1-5 - [c91]Antoine Liutkus, Roland Badeau, Gaël Richard:
Low bitrate informed source separation of realistic mixtures. ICASSP 2013: 66-70 - [c90]Sébastien Fenet, Yves Grenier, Gaël Richard:
An Extended Audio Fingerprint Method with Capabilities for Similar Music Detection. ISMIR 2013: 569-574 - [c89]Xabier Jaureguiberry, Gaël Richard, Pierre Leveau, Romain Hennequin, Emmanuel Vincent:
Introducing a simple fusion framework for audio source separation. MLSP 2013: 1-6 - [c88]Hequn Bai, Gaël Richard, Laurent Daudet:
Modeling early reflections of room impulse responses using a radiance transfer method. WASPAA 2013: 1-4 - [c87]Konstantinos C. Apostolakis, Dimitrios S. Alexiadis, Petros Daras, David S. Monaghan, Noel E. O'Connor, Benjamin Prestele, Peter Eisert, Gaël Richard, Qianni Zhang, Ebroul Izquierdo, Maher Ben Moussa, Nadia Magnenat-Thalmann:
Blending real with virtual in 3DLife. WIAMIS 2013: 1-4 - [c86]Rémi Foucard, Slim Essid, Gaël Richard, Mathieu Lagrange:
Exploring new features for music classification. WIAMIS 2013: 1-4 - [c85]Antoine Liutkus, Jean-Louis Durrieu, Laurent Daudet, Gaël Richard:
An overview of informed audio source separation. WIAMIS 2013: 1-4 - [c84]Aymeric Masurelle, Slim Essid, Gaël Richard:
Multimodal classification of dance movements using body joint trajectories and step sounds. WIAMIS 2013: 1-4 - [p4]Chloé Clavel, Gaël Richard:
Recognition of Acoustic Emotion. Emotion-Oriented Systems 2013: 139-167 - [i2]Manuel Moussallam, Alexandre Gramfort, Laurent Daudet, Gaël Richard:
Blind Denoising with Random Greedy Pursuits. CoRR abs/1312.5444 (2013) - 2012
- [j35]Antoine Liutkus, Jonathan Pinel, Roland Badeau, Laurent Girin, Gaël Richard:
Informed source separation through spectrogram coding and data embedding. Signal Process. 92(8): 1937-1949 (2012) - [j34]Manuel Moussallam, Laurent Daudet, Gaël Richard:
Matching Pursuits with random sequential subdictionaries. Signal Process. 92(10): 2532-2544 (2012) - [j33]Mathieu Ramona, Gaël Richard, Bertrand David:
Multiclass Feature Selection With Kernel Gram-Matrix-Based Criteria. IEEE Trans. Neural Networks Learn. Syst. 23(10): 1611-1623 (2012) - [c83]Sébastien Fenet, Manuel Moussallam, Yves Grenier, Gaël Richard, Laurent Daudet:
A framework for fingerprint-based detection of repeating objects in multimedia streams. EUSIPCO 2012: 1464-1468 - [c82]Antoine Liutkus, Stanislaw Gorlow, Nicolas Sturmel, Shuhua Zhang, Laurent Girin, Roland Badeau, Laurent Daudet, Sylvain Marchand, Gaël Richard:
Informed audio source separation: A comparative study. EUSIPCO 2012: 2397-2401 - [c81]Antoine Liutkus, Alexey Ozerov, Roland Badeau, Gaël Richard:
Spatial coding-based Informed Source Separation. EUSIPCO 2012: 2407-2411 - [c80]Manuel Moussallam, Gaël Richard, Laurent Daudet:
Audio source separation informed by redundancy with greedy multiscale decompositions. EUSIPCO 2012: 2644-2648 - [c79]Benoit Fuentes, Roland Badeau, Gaël Richard:
Blind Harmonic Adaptive Decomposition applied to supervised source separation. EUSIPCO 2012: 2654-2658 - [c78]Antoine Liutkus, Zafar Rafii, Roland Badeau, Bryan Pardo, Gaël Richard:
Adaptive filtering for music/voice separation exploiting the repeating musical structure. ICASSP 2012: 53-56 - [c77]Rémi Foucard, Slim Essid, Mathieu Lagrange, Gaël Richard:
A regressive boosting approach to automatic audio tagging based on soft annotator fusion. ICASSP 2012: 73-76 - [c76]Maksim Khadkevich, Thomas Fillon, Gaël Richard, Maurizio Omologo:
A probabilistic approach to simultaneous extraction of beats and downbeats. ICASSP 2012: 445-448 - [c75]Manuel Moussallam, Laurent Daudet, Gaël Richard:
Random time-frequency subdictionary design for sparse representations with greedy algorithms. ICASSP 2012: 3577-3580 - [c74]Benoit Fuentes, Antoine Liutkus, Roland Badeau, Gaël Richard:
Probabilistic model for main melody extraction using Constant-Q transform. ICASSP 2012: 5357-5360 - [p3]Slim Essid, Gaël Richard:
Fusion of Multimodal Information in Music Content Analysis. Multimodal Music Processing 2012: 37-52 - 2011
- [j32]Przemyslaw Dymarski, Nicolas Moreau, Gaël Richard:
Greedy sparse decompositions: a comparative study. EURASIP J. Adv. Signal Process. 2011: 34 (2011) - [j31]Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard, Shigeki Sagayama:
Introduction to the Special Issue on Music Signal Processing. IEEE J. Sel. Top. Signal Process. 5(6): 1085-1087 (2011) - [j30]Meinard Müller, Daniel P. W. Ellis, Anssi Klapuri, Gaël Richard:
Signal Processing for Music Analysis. IEEE J. Sel. Top. Signal Process. 5(6): 1088-1110 (2011) - [j29]Jean-Louis Durrieu, Bertrand David, Gaël Richard:
A Musically Motivated Mid-Level Representation for Pitch Estimation and Musical Audio Source Separation. IEEE J. Sel. Top. Signal Process. 5(6): 1180-1191 (2011) - [j28]Cyril Joder, Slim Essid, Gaël Richard:
A Conditional Random Field Framework for Robust and Scalable Audio-to-Score Matching. IEEE ACM Trans. Audio Speech Lang. Process. 19(8): 2385-2397 (2011) - [j27]Antoine Liutkus, Roland Badeau, Gaël Richard:
Gaussian Processes for Underdetermined Source Separation. IEEE Trans. Signal Process. 59(7): 3155-3167 (2011) - [c73]Cyril Joder, Slim Essid, Gaël Richard:
Hidden Discrete Tempo Model: A tempo-aware timing model for audio-to-score alignment. ICASSP 2011: 397-400 - [c72]Benoit Fuentes, Roland Badeau, Gaël Richard:
Adaptive harmonic time-frequency decomposition of audio using shift-invariant PLCA. ICASSP 2011: 401-404 - [c71]Manuel Moussallam, Laurent Daudet, Gaël Richard:
Audio Signal Representations for Factorization in the Sparse Domain. ICASSP 2011: 513-516 - [c70]Felix Weninger, Jean-Louis Durrieu, Florian Eyben, Gaël Richard, Björn W. Schuller:
Combining monaural source separation with Long Short-Term Memory for increased robustness in vocalist gender recognition. ICASSP 2011: 2196-2199 - [c69]Olivier Derrien, Roland Badeau, Gaël Richard:
Entropy-constrained quantization of exponentially damped sinusoids parameters. ICASSP 2011: 4064-4067 - [c68]Sébastien Fenet, Gaël Richard, Yves Grenier:
A Scalable Audio Fingerprint Method with Robustness to Pitch-Shifting. ISMIR 2011: 121-126 - [c67]Sébastien Gulluni, Slim Essid, Olivier Buisson, Gaël Richard:
An Interactive System for Electro-Acoustic Music Analysis. ISMIR 2011: 145-150 - [c66]Rémi Foucard, Slim Essid, Mathieu Lagrange, Gaël Richard:
Multi-scale temporal fusion by boosting for music classification. ISMIR 2011: 663-668 - [c65]Gaël Richard:
Tutorial on multimedia music signal processing. ACM Multimedia 2011: 633-634 - [c64]Slim Essid, Yves Grenier, Mounira Maazaoui, Gaël Richard, Robin Tournemenne:
An audio-driven virtual dance-teaching assistant. ACM Multimedia 2011: 675-678 - [c63]Sébastien Gulluni, Slim Essid, Olivier Buisson, Gaël Richard:
Interactive Classification of Sound Objects for Polyphonic Electro-Acoustic Music Annotation. Semantic Audio 2011 - [c62]Cyril Joder, Slim Essid, Gaël Richard:
Optimizing the mapping from a symbolic to an audio representation for music-to-score alignment. WASPAA 2011: 121-124 - [c61]Alexey Ozerov, Antoine Liutkus, Roland Badeau, Gaël Richard:
Informed source separation: Source coding meets source separation. WASPAA 2011: 257-260 - [p2]Rachid Benmokhtar, Benoit Huet, Gaël Richard, Slim Essid:
Feature Extraction for Multimedia Analysis. Multimedia Semantics 2011: 35-58 - [p1]Slim Essid, Marine Campedel, Gaël Richard, Tomas Piatrik, Rachid Benmokhtar, Benoit Huet:
Machine Learning Techniques for Multimedia Analysis. Multimedia Semantics 2011: 59-80 - [i1]Manuel Moussallam, Laurent Daudet, Gaël Richard:
Matching Pursuits with Random Sequential Subdictionaries. CoRR abs/1107.2509 (2011) - 2010
- [j26]Mathieu Lagrange, Martin Raspaud, Roland Badeau, Gaël Richard:
Explicit modeling of temporal dynamics within musical signals for acoustical unit similarity. Pattern Recognit. Lett. 31(12): 1498-1506 (2010) - [j25]Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Audio Signal Representations for Indexing in the Transform Domain. IEEE Trans. Speech Audio Process. 18(3): 434-446 (2010) - [j24]Jean-Louis Durrieu, Gaël Richard, Bertrand David, Cédric Févotte:
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals. IEEE Trans. Speech Audio Process. 18(3): 564-575 (2010) - [c60]Simon Bozonnet, Félicien Vallet, Nicholas W. D. Evans, Slim Essid, Gaël Richard, Jean Carrive:
A multimodal approach to initialisation for top-down speaker diarization of television shows. EUSIPCO 2010: 581-585 - [c59]Antoine Liutkus, Roland Badeau, Gaël Richard:
Informed Source Separation Using Latent Components. LVA/ICA 2010: 498-505 - [c58]Elsa Dupraz, Gaël Richard:
Robust frequency-based Audio Fingerprinting. ICASSP 2010: 281-284 - [c57]Mathieu Lagrange, Roland Badeau, Gaël Richard:
Robust similarity metrics between audio signals based on asymmetrical spectral envelope matching. ICASSP 2010: 405-408 - [c56]Cyril Joder, Slim Essid, Gaël Richard:
A comparative study of tonal acoustic features for a symbolic level music-to-score alignment. ICASSP 2010: 409-412 - [c55]Rémi Foucard, Jean-Louis Durrieu, Mathieu Lagrange, Gaël Richard:
Multimodal similarity between musical streams for cover version detection. ICASSP 2010: 5514-5517 - [c54]Félicien Vallet, Slim Essid, Jean Carrive, Gaël Richard:
Robust visual features for the multimodal identification of unregistered speakers in TV talk-shows. ICIP 2010: 1469-1472 - [c53]Cyril Joder, Slim Essid, Gaël Richard:
An Improved Hierarchical Approach for Music-to-symbolic Score Alignment. ISMIR 2010: 39-45 - [c52]Benoît Mathieu, Slim Essid, Thomas Fillon, Jacques Prado, Gaël Richard:
YAAFE, an Easy to Use and Efficient Audio Feature Extraction Software. ISMIR 2010: 441-446 - [c51]Cyril Joder, Slim Essid, Gaël Richard:
A conditional random field viewpoint of symbolic audio-to-score matching. ACM Multimedia 2010: 871-874
2000 – 2009
- 2009
- [j23]Cyril Joder, Slim Essid, Gaël Richard:
Temporal Integration for Audio Classification With Application to Musical Instrument Classification. IEEE Trans. Speech Audio Process. 17(1): 174-186 (2009) - [c50]Jean-Louis Durrieu, Alexey Ozerov, Cédric Févotte, Gaël Richard, Bertrand David:
Main instrument separation from stereophonic audio signals using a source/filter model. EUSIPCO 2009: 15-19 - [c49]Mathieu Ramona, Gaël Richard:
Comparison of different strategies for a SVM-based audio segmentation. EUSIPCO 2009: 20-24 - [c48]Jean-Louis Durrieu, Gaël Richard, Bertrand David:
An iterative approach to monaural musical mixture de-soloing. ICASSP 2009: 105-108 - [c47]Maxime Lardeur, Slim Essid, Gaël Richard, Martin Haller, Thomas Sikora:
Incorporating prior knowledge on the digital media creation process into audio classifiers. ICASSP 2009: 1653-1656 - [c46]Jan Weil, Thomas Sikora, Jean-Louis Durrieu, Gaël Richard:
Automatic Generation of Lead Sheets from Polyphonic Music Signals. ISMIR 2009: 603-608 - [r1]Gaël Richard:
Audio Indexing. Encyclopedia of Data Warehousing and Mining 2009: 104-109 - 2008
- [j22]Chloé Clavel, Ioana Vasilescu, Laurence Devillers, Gaël Richard, Thibaut Ehrette:
Fear-type emotion recognition for future audio-based surveillance systems. Speech Commun. 50(6): 487-503 (2008) - [j21]Pierre Leveau, Emmanuel Vincent, Gaël Richard, Laurent Daudet:
Instrument-Specific Harmonic Atoms for Mid-Level Music Representation. IEEE Trans. Speech Audio Process. 16(1): 116-128 (2008) - [j20]Olivier Gillet, Gaël Richard:
Transcription and Separation of Drum Signals From Polyphonic Music. IEEE Trans. Speech Audio Process. 16(3): 529-540 (2008) - [j19]Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Union of MDCT Bases for Audio Coding. IEEE Trans. Speech Audio Process. 16(8): 1361-1372 (2008) - [j18]Olivier Derrien, Gaël Richard:
A New Model-Based Algorithm for Optimizing the MPEG-AAC in MS-Stereo. IEEE Trans. Speech Audio Process. 16(8): 1373-1382 (2008) - [j17]Roland Badeau, Gaël Richard, Bertrand David:
Performance of ESPRIT for Estimating Mixtures of Complex Exponentials Modulated by Polynomials. IEEE Trans. Signal Process. 56(2): 492-504 (2008) - [j16]Michaël Betser, Patrice Collen, Gaël Richard, Bertrand David:
Estimation of Frequency for AM/FM Models Using the Phase Vocoder Framework. IEEE Trans. Signal Process. 56(2): 505-517 (2008) - [j15]Roland Badeau, Gaël Richard, Bertrand David:
Fast and Stable YAST Algorithm for Principal and Minor Subspace Tracking. IEEE Trans. Signal Process. 56(8-1): 3437-3446 (2008) - [j14]Roland Badeau, Bertrand David, Gaël Richard:
CramÉr-Rao Bounds for Multiple Poles and Coefficients of Quasi-Polynomials in Colored Noise. IEEE Trans. Signal Process. 56(8-1): 3458-3467 (2008) - [c45]Cyril Joder, Slim Essid, Gaël Richard:
Alignment kernels for audio classification with application to music instrument recognition. EUSIPCO 2008: 1-5 - [c44]Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Matching pursuit in adaptive dictionaries for scalable audio coding. EUSIPCO 2008: 1-5 - [c43]Sebastian Wegener, Martin Haller, Juan José Burred, Thomas Sikora, Slim Essid, Gaël Richard:
On the robustness of audio features for musical instrument classification. EUSIPCO 2008: 1-5 - [c42]Jean-Louis Durrieu, Gaël Richard, Bertrand David:
Singer melody extraction in polyphonic signals using source separation methods. ICASSP 2008: 169-172 - [c41]Mathieu Ramona, Gaël Richard, Bertrand David:
Vocal detection in music with support vector machines. ICASSP 2008: 1885-1888 - [c40]Emmanuel Ravelli, Gaël Richard, Laurent Daudet:
Fast MIR in a Sparse Transform Domain. ISMIR 2008: 527-532 - 2007
- [j13]Miguel A. Alonso, Gaël Richard, Bertrand David:
Accurate tempo estimation based on harmonic + noise decomposition. EURASIP J. Adv. Signal Process. 2007 (2007) - [j12]Olivier Gillet, Slim Essid, Gaël Richard:
On the Correlation of Automatic Audio and Visual Segmentations of Music Videos. IEEE Trans. Circuits Syst. Video Technol. 17(3): 347-355 (2007) - [c39]Kevin McGuinness, Olivier Gillet, Noel E. O'Connor, Gaël Richard:
Visual analysis for drum sequence transcription. EUSIPCO 2007: 312-316 - [c38]Chloé Clavel, Laurence Devillers, Gaël Richard, Ioana Vasilescu, Thibaut Ehrette:
Detection and Analysis of Abnormal Situations Through Fear-Type Acoustic Manifestations. ICASSP (4) 2007: 21-24 - [c37]Nancy Bertin, Roland Badeau, Gaël Richard:
Blind Signal Decompositions for Automatic Transcription of Polyphonic Music: NMF and K-SVD on the Benchmark. ICASSP (1) 2007: 65-68 - [c36]Gaël Richard, Mathieu Ramona, Slim Essid:
Combined Supervised and Unsupervised Approaches for Automatic Segmentation of Radiophonic Audio Streams. ICASSP (2) 2007: 461-464 - [c35]Roland Badeau, Bertrand David, Gaël Richard:
Conjugate Gradient Algorithms for Minor Subspace Analysis. ICASSP (3) 2007: 1013-1016 - [c34]Olivier Gillet, Gaël Richard:
Supervised and Unsupervised Sequence Modelling for Drum Transcription. ISMIR 2007: 219-224 - 2006
- [j11]Chloé Clavel, Ioana Vasilescu, Gaël Richard, Laurence Devillers:
De la construction du corpus émotionnel au système de détection. Le point de vue applicatif de la surveillance dans les lieux publics. Rev. d'Intelligence Artif. 20(4-5): 529-551 (2006) - [j10]Slim Essid, Gaël Richard, Bertrand David:
Instrument recognition in polyphonic music based on automatic taxonomies. IEEE Trans. Speech Audio Process. 14(1): 68-80 (2006) - [j9]Olivier Derrien, Pierre Duhamel, Maurice Charbit, Gaël Richard:
A new quantization optimization algorithm for the MPEG advanced audio coder using a statistical subband model of the quantization noise. IEEE Trans. Speech Audio Process. 14(4): 1328-1339 (2006) - [j8]Slim Essid, Gaël Richard, Bertrand David:
Musical instrument recognition by pairwise classification strategies. IEEE Trans. Speech Audio Process. 14(4): 1401-1412 (2006) - [j7]Roland Badeau, Bertrand David, Gaël Richard:
A new perturbation analysis for signal enumeration in rotational invariance techniques. IEEE Trans. Signal Process. 54(2): 450-458 (2006) - [j6]Roland Badeau, Bertrand David, Gaël Richard:
High-resolution spectral analysis of mixtures of complex exponentials modulated by polynomials. IEEE Trans. Signal Process. 54(4): 1341-1350 (2006) - [c33]Michaël Betser, Patrice Collen, Gaël Richard:
Frequency estimation based on adjacent DFT bins. EUSIPCO 2006: 1-5 - [c32]Olivier Gillet, Gaël Richard:
Comparing Audio and Video Segmentations for Music Videos Indexing. ICASSP (5) 2006: 21-24 - [c31]Bertrand David, Roland Badeau, Gaël Richard:
Hrhatrac Algorithm for Spectral Line Tracking of Musical Signals. ICASSP (3) 2006: 45-48 - [c30]Roland Badeau, Bertrand David, Gaël Richard:
Yast Algorithm for Minor Subspace Tracking. ICASSP (3) 2006: 552-555 - [c29]Slim Essid, Gaël Richard, Bertrand David:
Hierarchical Classification of Musical Instruments on Solo Recordings. ICASSP (5) 2006: 817-820 - [c28]Olivier Gillet, Gaël Richard:
ENST-Drums: an extensive audio-visual database for drum signals processing. ISMIR 2006: 156-159 - [c27]Chloé Clavel, Ioana Vasilescu, Laurence Devillers, Thibaut Ehrette, Gaël Richard:
Fear-type emotions of the SAFE Corpus: annotation issues. LREC 2006: 1099-1104 - 2005
- [j5]Olivier Gillet, Gaël Richard:
Drum Loops Retrieval from Spoken Queries. J. Intell. Inf. Syst. 24(2-3): 159-177 (2005) - [j4]Roland Badeau, Bertrand David, Gaël Richard:
Fast approximated power iteration subspace tracking. IEEE Trans. Signal Process. 53(8-1): 2931-2941 (2005) - [c26]Olivier Gillet, Gaël Richard:
Automatic transcription of drum sequences using audiovisual features. ICASSP (3) 2005: 205-208 - [c25]Slim Essid, Gaël Richard, Bertrand David:
Instrument recognition in polyphonic music. ICASSP (3) 2005: 245-248 - [c24]Mathieu Franck Guillaume, Yves Grenier, Gaël Richard:
Iterative algorithms for multichannel equalization in sound reproduction systems. ICASSP (3) 2005: 269-272 - [c23]Roland Badeau, Bertrand David, Gaël Richard:
Yet another subspace tracker. ICASSP (4) 2005: 329-332 - [c22]Miguel A. Alonso, Gaël Richard, Bertrand David:
Extracting note onsets from musical recordings. ICME 2005: 896-899 - [c21]Chloé Clavel, Thibaut Ehrette, Gaël Richard:
Events Detection for an Audio-Based Surveillance System. ICME 2005: 1306-1309 - [c20]Olivier Gillet, Gaël Richard:
Drum Track Transcription of Polyphonic Music Using Noise Subspace Projection. ISMIR 2005: 92-99 - [c19]Slim Essid, Gaël Richard, Bertrand David:
Inferring Efficient Hierarchical Taxonomies for MIR Tasks: Application to Musical Instruments. ISMIR 2005: 324-328 - 2004
- [j3]Roland Badeau, Gaël Richard, Bertrand David:
Sliding window adaptive SVD algorithms. IEEE Trans. Signal Process. 52(1): 1-10 (2004) - [c18]Slim Essid, Gaël Richard, Bertrand David:
Musical instrument recognition on solo performances. EUSIPCO 2004: 1289-1292 - [c17]Olivier Gillet, Gaël Richard:
Automatic transcription of drum loops. ICASSP (4) 2004: 269-272 - [c16]Roland Badeau, Bertrand David, Gaël Richard:
Selecting the modeling order for the ESPRIT high resolution method: an alternative approach. ICASSP (2) 2004: 1025-1028 - [c15]Miguel A. Alonso, Gaël Richard, Bertrand David:
Tempo And Beat Estimation Of Musical Signals. ISMIR 2004 - [c14]Laurent Daudet, Gaël Richard, Pierre Leveau:
Methodology and Tools for the evaluation of automatic onset detection algorithms in music. ISMIR 2004 - [c13]Slim Essid, Gaël Richard, Bertrand David:
Musical instrument recognition based on class pairwise feature selection. ISMIR 2004 - 2003
- [c12]Roland Badeau, Gaël Richard, Bertrand David:
Adaptive ESPRIT algorithm based on the PAST subspace tracker. ICASSP (6) 2003: 229-232 - [c11]Roland Badeau, Karim Abed-Meraim, Gaël Richard, Bertrand David:
Sliding window orthonormal PAST algorithm. ICASSP (5) 2003: 261-264 - [c10]Olivier Gillet, Gaël Richard:
Automatic labeling of tabla signals. ISMIR 2003 - [c9]Roland Badeau, Gaël Richard, Bertrand David, Karim Abed-Meraim:
Approximated power iterations for fast subspace tracking. ISSPA (2) 2003: 583-586 - 2001
- [j2]Henk van den Heuvel, Louis Boves, Asunción Moreno, Maurizio Omologo, Gaël Richard, Eric Sanders:
Annotation in the SpeechDat Projects. Int. J. Speech Technol. 4(2): 127-143 (2001) - 2000
- [c8]Asunción Moreno, Børge Lindberg, Christoph Draxler, Gaël Richard, Khalid Choukri, Stephan Euler, Jeffrey Allen:
SPEECHDAT-CAR. A Large Speech Database for Automotive Environments. LREC 2000
1990 – 1999
- 1999
- [c7]Anastasios Tefas, Yann Menguy, Constantine Kotropoulos, Gaël Richard, Ioannis Pitas, Philip Lockwood:
Compensating for variable recording conditions in frontal face authentication algorithms. ICASSP 1999: 3561-3564 - [c6]Gaël Richard, Yann Menguy, I. Guis, N. Suaudeau, Jérôme Boudy, Philip Lockwood, C. Fernández, Fernando Fernández, Constantine Kotropoulos, Anastasios Tefas, Ioannis Pitas, R. Heimgartner, Peter Ryser, Charles Beumier, Patrick Verlinde, Steven Pigeon, George Matas, Josef Kittler, Josef Bigün, Yousri Abdeljaoued, Eric Meurville, Laurent Besacier, Michael Ansorge, Gilbert Maître, Juergen Luettin, S. Ben-Yacoub, Belén Ruíz, K. Aldama, J. Cortes:
Multi Modal Verification for Teleservices and Security Applications (M2VTS). ICMCS, Vol. 2 1999: 1061-1064 - [c5]Henk van den Heuvel, Jérôme Boudy, Robrecht Comeyne, Stephan Euler, Asunción Moreno, Gaël Richard:
The speechdat-car multilingual speech databases for in-car applications: some first validation results. EUROSPEECH 1999 - 1997
- [c4]Samir Chennoukh, Daniel J. Sinder, Gaël Richard, James L. Flanagan:
Voice mimic system using an articulatory codebook for estimation of vocal tract shape. EUROSPEECH 1997: 429-432 - 1996
- [j1]Gaël Richard, Christophe d'Alessandro:
Analysis/synthesis and modification of the speech aperiodic component. Speech Commun. 19(3): 221-244 (1996) - 1995
- [c3]Gaël Richard, M. Liu, D. Snider, H. Duncan, Qiguang Lin, James L. Flanagan, Stephen E. Levinson, Donald Davis, Scott Slimon:
Numerical simulations of fluid flow in the vocal tract. EUROSPEECH 1995: 1297-1300 - 1994
- [c2]Gaël Richard, Christophe d'Alessandro:
Time-domain analysis/synthesis of the aperiodic component of speech signals. SSW 1994: 5-8 - 1993
- [c1]Sophie Grau, Christophe d'Alessandro, Gaël Richard:
A speech formant synthesizer based on harmonic + random formant-waveforms representations. EUROSPEECH 1993: 1697-1700
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-01-27 00:50 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint