CMU Sphinx, the Glossary
CMU Sphinx, also called Sphinx for short, is the general term to describe a group of speech recognition systems developed at Carnegie Mellon University.[1]
Table of Contents
28 relations: Acoustic model, ARM architecture family, Asterisk (PBX), Berkeley Software Distribution, BSD licenses, C (programming language), Carnegie Mellon University, CMU Pronouncing Dictionary, ConfDesigner, Cross-platform software, Hidden Markov model, Java (programming language), Kai-Fu Lee, Kevin Lenzo, Language model, Library (computing), List of speech recognition software, Massachusetts Institute of Technology, Mixture model, N-gram, Open-source software, Project LISTEN, Public domain, SourceForge, Speech recognition, Speech recognition software for Linux, Sun Microsystems, Xuedong Huang.
- Speech recognition software
Acoustic model
An acoustic model is used in automatic speech recognition to represent the relationship between an audio signal and the phonemes or other linguistic units that make up speech.
See CMU Sphinx and Acoustic model
ARM architecture family
ARM (stylised in lowercase as arm, formerly an acronym for Advanced RISC Machines and originally Acorn RISC Machine) is a family of RISC instruction set architectures (ISAs) for computer processors.
See CMU Sphinx and ARM architecture family
Asterisk (PBX)
Asterisk is a software implementation of a private branch exchange (PBX).
See CMU Sphinx and Asterisk (PBX)
Berkeley Software Distribution
The Berkeley Software Distribution or Berkeley Standard Distribution (BSD) is a discontinued operating system based on Research Unix, developed and distributed by the Computer Systems Research Group (CSRG) at the University of California, Berkeley.
See CMU Sphinx and Berkeley Software Distribution
BSD licenses
BSD licenses are a family of permissive free software licenses, imposing minimal restrictions on the use and distribution of covered software.
See CMU Sphinx and BSD licenses
C (programming language)
C (pronounced – like the letter c) is a general-purpose programming language.
See CMU Sphinx and C (programming language)
Carnegie Mellon University
Carnegie Mellon University (CMU) is a private research university in Pittsburgh, Pennsylvania.
See CMU Sphinx and Carnegie Mellon University
CMU Pronouncing Dictionary
The CMU Pronouncing Dictionary (also known as CMUdict) is an open-source pronouncing dictionary originally created by the Speech Group at Carnegie Mellon University (CMU) for use in speech recognition research. CMU Sphinx and CMU Pronouncing Dictionary are software using the BSD license.
See CMU Sphinx and CMU Pronouncing Dictionary
ConfDesigner
ConfDesigner is a graphical environment written in Java, which eases the design of complex system configurations.
See CMU Sphinx and ConfDesigner
Cross-platform software
In computing, cross-platform software (also called multi-platform software, platform-agnostic software, or platform-independent software) is computer software that is designed to work in several computing platforms.
See CMU Sphinx and Cross-platform software
A hidden Markov model (HMM) is a Markov model in which the observations are dependent on a latent (or "hidden") Markov process (referred to as X). An HMM requires that there be an observable process Y whose outcomes depend on the outcomes of X in a known way.
See CMU Sphinx and Hidden Markov model
Java (programming language)
Java is a high-level, class-based, object-oriented programming language that is designed to have as few implementation dependencies as possible.
See CMU Sphinx and Java (programming language)
Kai-Fu Lee
Kai-Fu Lee (born December 3, 1961) is a Taiwanese businessman, computer scientist, investor, and writer.
Kevin Lenzo
Kevin Lenzo (born 1967) is an American computer scientist.
See CMU Sphinx and Kevin Lenzo
Language model
A language model is a probabilistic model of a natural language.
See CMU Sphinx and Language model
Library (computing)
In computer science, a library is a collection of read-only resources that is leveraged during software development to implement a computer program.
See CMU Sphinx and Library (computing)
List of speech recognition software
Speech recognition software is available for many computing platforms, operating systems, use models, and software licenses. CMU Sphinx and List of speech recognition software are speech recognition software.
See CMU Sphinx and List of speech recognition software
Massachusetts Institute of Technology
The Massachusetts Institute of Technology (MIT) is a private land-grant research university in Cambridge, Massachusetts.
See CMU Sphinx and Massachusetts Institute of Technology
Mixture model
In statistics, a mixture model is a probabilistic model for representing the presence of subpopulations within an overall population, without requiring that an observed data set should identify the sub-population to which an individual observation belongs.
See CMU Sphinx and Mixture model
N-gram
An n-gram is a sequence of n adjacent symbols in particular order.
Open-source software
Open-source software (OSS) is computer software that is released under a license in which the copyright holder grants users the rights to use, study, change, and distribute the software and its source code to anyone and for any purpose.
See CMU Sphinx and Open-source software
Project LISTEN
Project LISTEN (Literacy Innovation that Speech Technology ENables) was a 25-year research project at Carnegie Mellon University to improve children's reading skills.
See CMU Sphinx and Project LISTEN
Public domain
The public domain (PD) consists of all the creative work to which no exclusive intellectual property rights apply.
See CMU Sphinx and Public domain
SourceForge
SourceForge is a web service that offers software consumers a centralized online location to control and manage open-source software projects and research business software.
See CMU Sphinx and SourceForge
Speech recognition
Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers.
See CMU Sphinx and Speech recognition
Speech recognition software for Linux
As of the early 2000s, several speech recognition (SR) software packages exist for Linux.
See CMU Sphinx and Speech recognition software for Linux
Sun Microsystems
Sun Microsystems, Inc. (Sun for short) was an American technology company that sold computers, computer components, software, and information technology services and created the Java programming language, the Solaris operating system, ZFS, the Network File System (NFS), and SPARC microprocessors.
See CMU Sphinx and Sun Microsystems
Xuedong Huang
Xuedong David Huang (born October 20, 1962) is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence, including Azure AI Services.
See CMU Sphinx and Xuedong Huang
See also
Speech recognition software
- Braina
- CMU Sphinx
- CallMiner
- Common Voice
- Dragon Dictation
- Dragon NaturallySpeaking
- DragonDictate
- HTK (software)
- IBM Shoebox
- IBM ViaVoice
- Janus Recognition Toolkit
- Jott
- Julius (software)
- Kaldi (software)
- Lessac Technologies
- List of speech recognition software
- MacSpeech
- MacSpeech Scribe
- Microsoft Office 2003
- Microsoft Office XP
- Microsoft Voice Command
- Modular Audio Recognition Framework
- Nortel Speech Server
- Oasis (software)
- Pronunciation Lexicon Specification
- QuickFuse
- RWTH ASR
- Regulus Grammar Compiler
- SILVIA
- Shrutlekhan-Rajbhasha
- Speechmatics
- Tazti
- Voice Finger
- Voice Navigator
- VoxForge
- Whisper (speech recognition system)
- Windows Speech Recognition
- WordQ+SpeakQ
- Zeroth (software)