en.unionpedia.org

CMU Sphinx, the Glossary

Index CMU Sphinx

CMU Sphinx, also called Sphinx for short, is the general term to describe a group of speech recognition systems developed at Carnegie Mellon University.[1]

Table of Contents

  1. 28 relations: Acoustic model, ARM architecture family, Asterisk (PBX), Berkeley Software Distribution, BSD licenses, C (programming language), Carnegie Mellon University, CMU Pronouncing Dictionary, ConfDesigner, Cross-platform software, Hidden Markov model, Java (programming language), Kai-Fu Lee, Kevin Lenzo, Language model, Library (computing), List of speech recognition software, Massachusetts Institute of Technology, Mixture model, N-gram, Open-source software, Project LISTEN, Public domain, SourceForge, Speech recognition, Speech recognition software for Linux, Sun Microsystems, Xuedong Huang.

  2. Speech recognition software

Acoustic model

An acoustic model is used in automatic speech recognition to represent the relationship between an audio signal and the phonemes or other linguistic units that make up speech.

See CMU Sphinx and Acoustic model

ARM architecture family

ARM (stylised in lowercase as arm, formerly an acronym for Advanced RISC Machines and originally Acorn RISC Machine) is a family of RISC instruction set architectures (ISAs) for computer processors.

See CMU Sphinx and ARM architecture family

Asterisk (PBX)

Asterisk is a software implementation of a private branch exchange (PBX).

See CMU Sphinx and Asterisk (PBX)

Berkeley Software Distribution

The Berkeley Software Distribution or Berkeley Standard Distribution (BSD) is a discontinued operating system based on Research Unix, developed and distributed by the Computer Systems Research Group (CSRG) at the University of California, Berkeley.

See CMU Sphinx and Berkeley Software Distribution

BSD licenses

BSD licenses are a family of permissive free software licenses, imposing minimal restrictions on the use and distribution of covered software.

See CMU Sphinx and BSD licenses

C (programming language)

C (pronounced – like the letter c) is a general-purpose programming language.

See CMU Sphinx and C (programming language)

Carnegie Mellon University

Carnegie Mellon University (CMU) is a private research university in Pittsburgh, Pennsylvania.

See CMU Sphinx and Carnegie Mellon University

CMU Pronouncing Dictionary

The CMU Pronouncing Dictionary (also known as CMUdict) is an open-source pronouncing dictionary originally created by the Speech Group at Carnegie Mellon University (CMU) for use in speech recognition research. CMU Sphinx and CMU Pronouncing Dictionary are software using the BSD license.

See CMU Sphinx and CMU Pronouncing Dictionary

ConfDesigner

ConfDesigner is a graphical environment written in Java, which eases the design of complex system configurations.

See CMU Sphinx and ConfDesigner

Cross-platform software

In computing, cross-platform software (also called multi-platform software, platform-agnostic software, or platform-independent software) is computer software that is designed to work in several computing platforms.

See CMU Sphinx and Cross-platform software

A hidden Markov model (HMM) is a Markov model in which the observations are dependent on a latent (or "hidden") Markov process (referred to as X). An HMM requires that there be an observable process Y whose outcomes depend on the outcomes of X in a known way.

See CMU Sphinx and Hidden Markov model

Java (programming language)

Java is a high-level, class-based, object-oriented programming language that is designed to have as few implementation dependencies as possible.

See CMU Sphinx and Java (programming language)

Kai-Fu Lee

Kai-Fu Lee (born December 3, 1961) is a Taiwanese businessman, computer scientist, investor, and writer.

See CMU Sphinx and Kai-Fu Lee

Kevin Lenzo

Kevin Lenzo (born 1967) is an American computer scientist.

See CMU Sphinx and Kevin Lenzo

Language model

A language model is a probabilistic model of a natural language.

See CMU Sphinx and Language model

Library (computing)

In computer science, a library is a collection of read-only resources that is leveraged during software development to implement a computer program.

See CMU Sphinx and Library (computing)

List of speech recognition software

Speech recognition software is available for many computing platforms, operating systems, use models, and software licenses. CMU Sphinx and List of speech recognition software are speech recognition software.

See CMU Sphinx and List of speech recognition software

Massachusetts Institute of Technology

The Massachusetts Institute of Technology (MIT) is a private land-grant research university in Cambridge, Massachusetts.

See CMU Sphinx and Massachusetts Institute of Technology

Mixture model

In statistics, a mixture model is a probabilistic model for representing the presence of subpopulations within an overall population, without requiring that an observed data set should identify the sub-population to which an individual observation belongs.

See CMU Sphinx and Mixture model

N-gram

An n-gram is a sequence of n adjacent symbols in particular order.

See CMU Sphinx and N-gram

Open-source software

Open-source software (OSS) is computer software that is released under a license in which the copyright holder grants users the rights to use, study, change, and distribute the software and its source code to anyone and for any purpose.

See CMU Sphinx and Open-source software

Project LISTEN

Project LISTEN (Literacy Innovation that Speech Technology ENables) was a 25-year research project at Carnegie Mellon University to improve children's reading skills.

See CMU Sphinx and Project LISTEN

Public domain

The public domain (PD) consists of all the creative work to which no exclusive intellectual property rights apply.

See CMU Sphinx and Public domain

SourceForge

SourceForge is a web service that offers software consumers a centralized online location to control and manage open-source software projects and research business software.

See CMU Sphinx and SourceForge

Speech recognition

Speech recognition is an interdisciplinary subfield of computer science and computational linguistics that develops methodologies and technologies that enable the recognition and translation of spoken language into text by computers.

See CMU Sphinx and Speech recognition

Speech recognition software for Linux

As of the early 2000s, several speech recognition (SR) software packages exist for Linux.

See CMU Sphinx and Speech recognition software for Linux

Sun Microsystems

Sun Microsystems, Inc. (Sun for short) was an American technology company that sold computers, computer components, software, and information technology services and created the Java programming language, the Solaris operating system, ZFS, the Network File System (NFS), and SPARC microprocessors.

See CMU Sphinx and Sun Microsystems

Xuedong Huang

Xuedong David Huang (born October 20, 1962) is a Chinese American computer scientist and technology executive who has made contributions to spoken language processing and artificial intelligence, including Azure AI Services.

See CMU Sphinx and Xuedong Huang

See also

Speech recognition software

References

[1] https://en.wikipedia.org/wiki/CMU_Sphinx