CMU Sphinx

CMU Sphinx is a collection of speech recognition systems developed at Carnegie Mellon University, including Sphinx 2 to Sphinx 4 and SphinxTrain. The systems utilize various acoustic models and have been made open-source, with the latest developments focusing on flexibility and advanced recognition techniques. PocketSphinx is a version designed for embedded systems, while Sphinx 4 represents a complete rewrite aimed at research and development in speech recognition.

Uploaded by

rhea.stuart.russell

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views3 pages

CMU Sphinx

Uploaded by

rhea.stuart.russell

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

CMU Sphinx

CMU Sphinx, also called Sphinx for short, is the

Sphinx4
general term to describe a group of speech recognition
systems developed at Carnegie Mellon University. Stable release 5-prealpha / August 3,
These include a series of speech recognizers (Sphinx 2 2015
- 4) and an acoustic model trainer (SphinxTrain). Written in Java

In 2000, the Sphinx group at Carnegie Mellon Operating system Cross-platform

committed to open source several speech recognizer Type Image library
components, including Sphinx 2 and later Sphinx 3 (in License BSD-style[1]
2001). The speech decoders come with acoustic Website cmusphinx.github.io/wiki/
models and sample applications. The available (https://cmusphinx.github.i
resources include in addition software for acoustic o/wiki/)
model training, language model compilation and a
public domain pronunciation dictionary, cmudict.
Pocketsphinx
Sphinx encompasses a number of software systems, Stable release 5-prealpha / August 5,
described below. 2015
Written in C

Sphinx Operating system Cross-platform

Type Image library
Sphinx is a continuous-speech, speaker-independent License BSD-style
recognition system making use of hidden Markov
Website cmusphinx.github.io/wiki/
acoustic models (HMMs) and an n-gram statistical
(https://cmusphinx.github.i
language model. It was developed by Kai-Fu Lee.
o/wiki/)
Sphinx featured feasibility of continuous-speech,
speaker-independent large-vocabulary recognition, the
possibility of which was in dispute at the time (1986).[2]

Sphinx is of historical interest only; it has been superseded in performance by subsequent versions.

Sphinx 2
A fast performance-oriented recognizer, originally developed by Xuedong Huang at Carnegie Mellon and
released as open-source with a BSD-style license on SourceForge by Kevin Lenzo at LinuxWorld in
2000. Sphinx 2 focuses on real-time recognition suitable for spoken language applications. As such it
incorporates functionality such as end-pointing, partial hypothesis generation, dynamic language model
switching and so on. It is used in dialog systems and language learning systems. It can be used in
computer based PBX systems such as Asterisk. Sphinx 2 code has also been incorporated into a number
of commercial products. It is no longer under active development (other than for routine maintenance).
Current real-time decoder development is taking place in the Pocket Sphinx project.[3]

Sphinx 3
Sphinx 2 used a semi-continuous representation for acoustic modeling (i.e., a single set of Gaussians is
used for all models, with individual models represented as a weight vector over these Gaussians). Sphinx
3 adopted the prevalent continuous HMM representation and has been used primarily for high-accuracy,
non-real-time recognition. Recent developments (in algorithms and in hardware) have made Sphinx 3
"near" real-time, although not yet suitable for critical interactive applications. Sphinx 3 is under active
development and in conjunction with SphinxTrain provides access to a number of modern modeling
techniques, such as LDA/MLLT, MLLR and VTLN, that improve recognition accuracy (see the article on
Speech Recognition for descriptions of these techniques).

Sphinx 4
Sphinx 4 is a complete rewrite of the Sphinx engine with the goal of providing a more flexible framework
for research in speech recognition, written entirely in the Java programming language. Sun Microsystems
supported the development of Sphinx 4 and contributed software engineering expertise to the project.
Participants included individuals at MERL, MIT and CMU. (Currently supported languages are C, C++,
C#, Python, Ruby, Java, and JavaScript.)

Current development goals include:

developing a new (acoustic model) trainer

implementing speaker adaptation (e.g. MLLR)
improving configuration management
creating a graph-based UI for graphical system design

PocketSphinx
A version of Sphinx that can be used in embedded systems (e.g., based on an ARM processor).
PocketSphinx is under active development and incorporates features such as fixed-point arithmetic and
efficient algorithms for GMM computation.

See also
Speech recognition software for Linux
List of speech recognition software
Project LISTEN

References
1. http://www.speech.cs.cmu.edu/sphinx
2. Lee, K.-F.; Hon, H.-W.; Reddy, R. (January 1990). "An overview of the SPHINX speech
recognition system" (https://ieeexplore.ieee.org/document/45616). IEEE Transactions on
Acoustics, Speech, and Signal Processing. 38 (1): 35–45. doi:10.1109/29.45616 (https://doi.
org/10.1109%2F29.45616).
3. Huang, Xuedong; Alleva, Fileno; Hwang, Mei-Yuh; Rosenfeld, Ronald (1993). "An overview
of the SPHINX-II speech recognition system" (https://dx.doi.org/10.3115/1075671.1075690).
Proceedings of the Workshop on Human Language Technology - HLT '93. Morristown, NJ,
USA: Association for Computational Linguistics: 81. doi:10.3115/1075671.1075690 (https://d
oi.org/10.3115%2F1075671.1075690). ISBN 1-55860-324-7.

External links
Sphinx developers recommend Vosk now (https://alphacephei.com/vosk/)
CMU Sphinx homepage (https://cmusphinx.github.io/wiki/)
Sphinx' repository (https://github.com/cmusphinx) on GitHub should be considered the
definitive source for code
SourceForge (http://sourceforge.net/projects/cmusphinx) hosts older releases and files
NeXT on Campus Fall 1990 (https://web.archive.org/web/20170324083105/http://nextstuff.in
fo/mirrors/otto/html/pub/Documents/user-groups/OnCampus/NOCFall90/NOCFall90Text.ps.
gz) (This document is postscript format compressed with gzip.) Carnegie Mellon University -
Breakthroughs in speech recognition and document management, pgs. 12-13

Retrieved from "https://en.wikipedia.org/w/index.php?title=CMU_Sphinx&oldid=1263415119"

Final Python
No ratings yet
Final Python
718 pages
Voice Recognition
60% (5)
Voice Recognition
31 pages
Programming Logic Concepts
No ratings yet
Programming Logic Concepts
81 pages
Text To Speech Converter Documentation
50% (4)
Text To Speech Converter Documentation
28 pages
Chapter 1 - Introduction: Dept. of Electronics and Communication Engineering 1
0% (1)
Chapter 1 - Introduction: Dept. of Electronics and Communication Engineering 1
38 pages
Java Voice Assistant Paper
No ratings yet
Java Voice Assistant Paper
4 pages
Sign Language Translator
100% (1)
Sign Language Translator
4 pages
Ranjith S - Mini Project
No ratings yet
Ranjith S - Mini Project
72 pages
Lec01 Introduction
No ratings yet
Lec01 Introduction
65 pages
Voice Response System
0% (1)
Voice Response System
74 pages
Introduction To Python Programming: Text Book: Core Python Programming, Wesley J. Chun, Second Edition, Pearson
No ratings yet
Introduction To Python Programming: Text Book: Core Python Programming, Wesley J. Chun, Second Edition, Pearson
28 pages
Pocket Sphinx
No ratings yet
Pocket Sphinx
31 pages
Speech Integration in SH1
No ratings yet
Speech Integration in SH1
36 pages
Java Programming Using Voice Input:: Adding Java Support To Voicecode
No ratings yet
Java Programming Using Voice Input:: Adding Java Support To Voicecode
16 pages
Oops 1 Notes
No ratings yet
Oops 1 Notes
39 pages
Viva Speech
100% (1)
Viva Speech
4 pages
Sphinx 4
No ratings yet
Sphinx 4
18 pages
Complete Python Notes
No ratings yet
Complete Python Notes
42 pages
Development of Speech Recognition System Based On CMUSphinx For Khmer Language
No ratings yet
Development of Speech Recognition System Based On CMUSphinx For Khmer Language
6 pages
Lesson 1
No ratings yet
Lesson 1
53 pages
Logic Chapter04 Predicate Logic
No ratings yet
Logic Chapter04 Predicate Logic
11 pages
Speech Recognition For Mobile Systems: BY: Pratibha Channamsetty Shruthi Sambasivan
No ratings yet
Speech Recognition For Mobile Systems: BY: Pratibha Channamsetty Shruthi Sambasivan
36 pages
Enfoques Programacion
No ratings yet
Enfoques Programacion
12 pages
Automatic Urdu Speech Recognition Using
No ratings yet
Automatic Urdu Speech Recognition Using
5 pages
HISTORY OF PROGRAMMING LANGUAGES Ali Zaidi - 049
No ratings yet
HISTORY OF PROGRAMMING LANGUAGES Ali Zaidi - 049
9 pages
Introduction To Computing (Using Python) : Evolution of Programming Languages, Software Requirements For Programming
No ratings yet
Introduction To Computing (Using Python) : Evolution of Programming Languages, Software Requirements For Programming
30 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
No ratings yet
Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System For Hand-Held Devices
4 pages
Sign Language Converter
No ratings yet
Sign Language Converter
4 pages
PYTHON
No ratings yet
PYTHON
68 pages
Python Lesson1
No ratings yet
Python Lesson1
23 pages
Sorting Synopsis
No ratings yet
Sorting Synopsis
10 pages
Speech Recognition System: Surabhi Bansal Ruchi Bahety
No ratings yet
Speech Recognition System: Surabhi Bansal Ruchi Bahety
5 pages
A Greek Voice Recognition Interface For ROV Applications, Using Machine Learning Technologies and The CMU Sphinx Platform
No ratings yet
A Greek Voice Recognition Interface For ROV Applications, Using Machine Learning Technologies and The CMU Sphinx Platform
11 pages
3 Features of Python
No ratings yet
3 Features of Python
16 pages
Building An Application With Sphinx4
No ratings yet
Building An Application With Sphinx4
7 pages
PPL Asg
No ratings yet
PPL Asg
16 pages
Sharika Malayalam Speech Recognition System: Shyam.k MES College of Engineering, Kuttipuram
No ratings yet
Sharika Malayalam Speech Recognition System: Shyam.k MES College of Engineering, Kuttipuram
4 pages
CMU Sphinx
No ratings yet
CMU Sphinx
3 pages
Voice - Assistant - Research Paper
No ratings yet
Voice - Assistant - Research Paper
6 pages
Science Mathematics Engineering Art Software Engineering
No ratings yet
Science Mathematics Engineering Art Software Engineering
3 pages
CMU Sphinx: Speech Recognition Toolkit
No ratings yet
CMU Sphinx: Speech Recognition Toolkit
1 page
Dreu Paper
No ratings yet
Dreu Paper
5 pages
Computer Basics For Wouldbe Programmers
No ratings yet
Computer Basics For Wouldbe Programmers
37 pages
Lecture1 Introduction To Python 3 June 2024
No ratings yet
Lecture1 Introduction To Python 3 June 2024
55 pages
Sphinx Speech Recognition
No ratings yet
Sphinx Speech Recognition
5 pages
Ijseas On Audio To Sign
No ratings yet
Ijseas On Audio To Sign
6 pages
Speech Recognition As Emerging Revolutionary Technology
No ratings yet
Speech Recognition As Emerging Revolutionary Technology
4 pages
Introduction To Speech Recognition
No ratings yet
Introduction To Speech Recognition
3 pages
The Source Code
No ratings yet
The Source Code
3 pages
IT Paper
No ratings yet
IT Paper
3 pages
Ignition Scripting
0% (1)
Ignition Scripting
152 pages
BI Trouble Shotting (364547.1)
No ratings yet
BI Trouble Shotting (364547.1)
13 pages
4 Basic PLC Programming
100% (1)
4 Basic PLC Programming
30 pages
DevOps Roadmap by CloudChamp
No ratings yet
DevOps Roadmap by CloudChamp
18 pages
Smart Money Concept Indicator
No ratings yet
Smart Money Concept Indicator
11 pages
Numerical Method Using Python: (MCSC-202)
No ratings yet
Numerical Method Using Python: (MCSC-202)
41 pages
Data Structure Using C Laboratory Manual
No ratings yet
Data Structure Using C Laboratory Manual
12 pages
ABAP General Naming Standards Quick Reference
No ratings yet
ABAP General Naming Standards Quick Reference
3 pages
Python Lab Record
No ratings yet
Python Lab Record
81 pages
Siebel Interview Question
No ratings yet
Siebel Interview Question
14 pages
Arm Assembly Language Programming
100% (1)
Arm Assembly Language Programming
9 pages
MCA Syllabus 2018
No ratings yet
MCA Syllabus 2018
14 pages
Automated Testing
No ratings yet
Automated Testing
56 pages
Blockchain & Smart Contract Security
No ratings yet
Blockchain & Smart Contract Security
52 pages
Design Verification With SystemVerilog - UVM - Udemy
0% (1)
Design Verification With SystemVerilog - UVM - Udemy
6 pages
Stockfish (Chess)
No ratings yet
Stockfish (Chess)
28 pages
May 2010 Infor Tech Past Paper 2
No ratings yet
May 2010 Infor Tech Past Paper 2
12 pages
Project Report G3
No ratings yet
Project Report G3
51 pages
Frrouting Developers Guide
No ratings yet
Frrouting Developers Guide
321 pages
Lect 6 Programming Logic Using C' Data Types
No ratings yet
Lect 6 Programming Logic Using C' Data Types
30 pages
Info Resume
No ratings yet
Info Resume
2 pages
Mycin
No ratings yet
Mycin
5 pages
Matheamatical Libray Classes
No ratings yet
Matheamatical Libray Classes
15 pages
Minds DB
No ratings yet
Minds DB
4 pages
PARRY
No ratings yet
PARRY
2 pages
Chinese Room
No ratings yet
Chinese Room
28 pages
Jabberwacky
No ratings yet
Jabberwacky
2 pages
Aman Babu S Resume
No ratings yet
Aman Babu S Resume
1 page
Modern C++ For Absolute Beginners: A Friendly Introduction To C++ Programming Language and C++11 To C++20 Standards 1st Edition Slobodan Dmitrovi
No ratings yet
Modern C++ For Absolute Beginners: A Friendly Introduction To C++ Programming Language and C++11 To C++20 Standards 1st Edition Slobodan Dmitrovi
49 pages
Comparison of Deep Learning Software - Wikipedia
No ratings yet
Comparison of Deep Learning Software - Wikipedia
4 pages
CPIT110 - Chapter 5
No ratings yet
CPIT110 - Chapter 5
227 pages
Web Services Core Programming Guide
No ratings yet
Web Services Core Programming Guide
24 pages
Data Applied
No ratings yet
Data Applied
1 page
Artificial Intelligence Markup Language
No ratings yet
Artificial Intelligence Markup Language
4 pages
Synthetic Environment For Analysis and Simulations
No ratings yet
Synthetic Environment For Analysis and Simulations
3 pages
Free HAL
No ratings yet
Free HAL
2 pages
List of Artificial Intelligence Projects
No ratings yet
List of Artificial Intelligence Projects
12 pages
Daa File
No ratings yet
Daa File
22 pages
Python作业2
No ratings yet
Python作业2
5 pages
Advanced Bash Shell Scripting Guide - Reference Cards
No ratings yet
Advanced Bash Shell Scripting Guide - Reference Cards
5 pages
Kartik Chauhan Resume
No ratings yet
Kartik Chauhan Resume
1 page
DP - Report of Inventory Managment System
No ratings yet
DP - Report of Inventory Managment System
7 pages
UNIX Shell Scripting Interview Questions, Answers, and Explanations: UNIX Shell Certification Review
From Everand
UNIX Shell Scripting Interview Questions, Answers, and Explanations: UNIX Shell Certification Review
Equity Press
4.5/5 (4)
Shell Scripting: Expert Recipes for Linux, Bash, and more
From Everand
Shell Scripting: Expert Recipes for Linux, Bash, and more
Steve Parker
No ratings yet
Developing Apps with Python and Flet
From Everand
Developing Apps with Python and Flet
Williams Asiedu
No ratings yet
Linux Explained
From Everand
Linux Explained
Sebastien Bronchard
No ratings yet
Mastering Linux Administration: A Comprehensive Guide: The IT Collection
From Everand
Mastering Linux Administration: A Comprehensive Guide: The IT Collection
Christopher Ford
5/5 (1)
Linux: A Comprehensive Guide to Linux Operating System and Command Line
From Everand
Linux: A Comprehensive Guide to Linux Operating System and Command Line
Sam Griffin
No ratings yet
Rust for Beginners
From Everand
Rust for Beginners
Hernando Abella
No ratings yet
Speech Recognition: Fundamentals and Applications
From Everand
Speech Recognition: Fundamentals and Applications
Fouad Sabry
No ratings yet

CMU Sphinx

Uploaded by

CMU Sphinx

Uploaded by

CMU Sphinx

CMU Sphinx, also called Sphinx for short, is the

In 2000, the Sphinx group at Carnegie Mellon Operating system Cross-platform

Sphinx Operating system Cross-platform

Current development goals include:

developing a new (acoustic model) trainer

Retrieved from "https://en.wikipedia.org/w/index.php?title=CMU_Sphinx&oldid=1263415119"

You might also like