patents.google.com

JP2003195890A - Speech operating device - Google Patents

️Wed Jul 09 2003

【発明の詳細な説明】Detailed Description of the Invention

【０００１】[0001]

【発明の属する技術分野】本発明は、音声操作装置に関
するものであり、特に複数の機器を音声で操作する音声
操作装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a voice operating device, and more particularly to a voice operating device for operating a plurality of devices by voice.

【０００２】[0002]

【従来の技術】従来より、利用者の音声指令を認識し
て、機器を操作する音声操作装置が種々提案され、例え
ば特開２００１−３３７６８７号公報には車両搭載用機
器を操作する音声操作装置が開示されている。斯かる音
声操作装置は、マイク，音声認識部及び機器操作制御部
等から構成され、利用者が発した音声指令を前記音声認
識部で認識して、オーディオシステム，ナビゲーション
システム等の機器を音声で操作するものである。2. Description of the Related Art Conventionally, various voice operation devices for recognizing a voice command of a user and operating a device have been proposed. For example, Japanese Patent Laid-Open No. 2001-337687 discloses a voice operation device for operating a vehicle-mounted device. Is disclosed. Such a voice operation device is composed of a microphone, a voice recognition unit, a device operation control unit, and the like. The voice recognition unit recognizes a voice command issued by a user, and a device such as an audio system or a navigation system is voiced. It is something to operate.

【０００３】車両用音声操作装置は、音声指令をマイク
ロフォン（以下、マイクと記す）で入力し、マイクから
出力された音声データを音声認識部で認識し、この認識
結果に基づいて、機器操作制御部を介して機器を操作す
るものである。音声認識部は、機器を操作するための複
数の指令語データからなる指令語データ群を記憶した指
令語記憶部を有しており、マイクから出力された音声デ
ータと、指令語記憶部に記憶されている指令語データと
を各々比較して、類似度が最も高い指令語データを音声
指令として認識する。The voice operation device for a vehicle inputs a voice command with a microphone (hereinafter referred to as a microphone), recognizes voice data output from the microphone with a voice recognition unit, and controls equipment operation based on the recognition result. The device is operated via the unit. The voice recognition unit has a command word storage unit that stores a command word data group composed of a plurality of command word data for operating the device, and stores the voice data output from the microphone and the command word storage unit. The command word data having the highest degree of similarity is recognized as a voice command by comparing each of the command word data.

【０００４】例えば、日本次郎のＣＤ（Compact Disc）
を聴く場合、利用者が「ニホンジロウ」という音声指令
を発すると、その音声指令を入力したマイクが音声デー
タを音声認識部に出力し、音声認識部は指令語記憶部に
記憶されている多数の指令語データを音声データと夫々
比較し、類似度が最も高い指令語データを音声指令とし
て認識して、機器操作制御部がオーディオシステムに日
本次郎のＣＤを再生させる。For example, Jiro Nihon's CD (Compact Disc)
When listening to, when the user issues a voice command "Japanese jiro", the microphone that inputs the voice command outputs voice data to the voice recognition unit, and the voice recognition unit stores a large number of words stored in the command word storage unit. The command word data is compared with the voice data, respectively, and the command word data having the highest degree of similarity is recognized as a voice command, and the device operation control unit causes the audio system to reproduce the CD of Jiro Nihonro.

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、音声操
作装置で操作する機器が複数になると、音声認識部が音
声指令を誤認識する虞があった。例えば、利用者が「日
本四郎」の家の位置をナビゲーションシステムに記憶さ
せていた場合、指令語記憶部にはＣＤを再生するための
「ニホンジロウ」という指令語データと、日本四郎の家
に車両を案内するための「ニホンシロウ」という指令語
データが記憶されているため、「日本次郎」のＣＤを聴
こうと思って、「ニホンジロウ」という音声指令を発し
たときに、前記「ニホンジロウ」という音声指令を「ニ
ホンシロウ」という音声指令であると誤認識する可能性
が高い。However, when a plurality of devices are operated by the voice operation device, the voice recognition unit may erroneously recognize the voice command. For example, if the user has stored the position of "Nihonshiro"'s house in the navigation system, the command word storage section stores the command word data "Nihonjirou" for playing a CD and the vehicle in the Nishiro's house. Since the command word data “Japanese Shiro” for guiding the player is stored, when the voice command “Japanese Jiro” is issued while thinking that the CD of “Nihonjiro” is to be heard, the voice “Japanese Jiro” There is a high possibility that the command is erroneously recognized as a voice command "Japanese Shiro".

【０００６】このような誤認識を低減するために、利用
者が操作する機器も音声で指令することが考えられる。
例えば、まず「オーディオ」という音声指令を発して、
オーディオの操作であることを認識させた後に、「ニホ
ンジロウ」という音声指令を発することにより、ナビゲ
ーションシステムのための指令語である「ニホンシロ
ウ」という指令語データは、「ニホンジロウ」という音
声データと比較される候補ではなくなるため、誤認識を
低減できる。しかし、１つの操作のために複数の音声指
令、即ち「オーディオ」という音声指令と、「ニホンジ
ロウ」という音声指令を入力する必要があり、利用者が
煩わしさを感じるという問題があった。また、音声認識
処理も煩雑となり、プログラム設計が難しくなり、コス
トアップにつながる。本発明は、この問題に鑑みなされ
たものであり、複数の機器を音声で操作することがで
き、且つ、誤認識が少ない音声操作装置を提供するもの
である。また、複数の機器を音声で操作する場合であっ
ても、誤認識が少なく、音声認識処理が簡単な音声操作
装置を提供するものである。In order to reduce such erroneous recognition, it is conceivable that a device operated by the user may also be instructed by voice.
For example, first issue a voice command "audio",
After recognizing that it is an audio operation, by issuing a voice command "Japanese Jiro", the command word data "Japanese Shiro" which is a command word for the navigation system is compared with the voice data "Japanese Jiro". False recognition can be reduced because it is no longer a candidate. However, it is necessary to input a plurality of voice commands, that is, a voice command "audio" and a voice command "Japanese jiro" for one operation, which causes a problem that the user feels troublesome. In addition, the voice recognition process becomes complicated, and the program design becomes difficult, which leads to an increase in cost. The present invention has been made in view of this problem, and provides a voice operation device capable of operating a plurality of devices by voice and causing less erroneous recognition. Further, the present invention provides a voice operation device in which misrecognition is small and voice recognition processing is simple even when a plurality of devices are operated by voice.

【０００７】[0007]

【課題を解決するための手段】本発明は、前記課題を解
決するため、音声指令を入力し音声データに変換する音
声データ出力手段１を備え、前記音声データに基づいて
複数の機器２０，３０，４０，５０を操作する音声操作
装置であって、操作する前記機器２０，３０，４０，５
０を選択する機器選択手段２と、前記機器２０，３０，
４０，５０を操作するための複数の指令語データからな
る指令語データ群８ａ，８ｂ，８ｃ，８ｄを前記機器別
に分類して記憶している指令語記憶部８を有し、前記機
器選択手段２で選択された機器２０，３０，４０，５０
に応じた前記指令語データ群８ａ，８ｂ，８ｃ，８ｄの
中から前記音声指令を認識する音声認識手段３と、を有
するものである。In order to solve the above-mentioned problems, the present invention comprises a voice data output means 1 for inputting a voice command and converting it into voice data, and a plurality of devices 20, 30 based on the voice data. , 40, 50 for operating a voice operating device, the operating device 20, 30, 40, 5
Device selecting means 2 for selecting 0, and the devices 20, 30,
The command word storage unit 8 stores command word data groups 8a, 8b, 8c, and 8d composed of a plurality of command word data for operating 40 and 50 by classifying them by the device and storing the device. Device 20, 30, 40, 50 selected in 2
And a voice recognition means 3 for recognizing the voice command from the command word data group 8a, 8b, 8c, 8d according to the above.

【０００８】また、本発明は、前記機器選択手段２は、
前記機器２０，３０，４０，５０に対応する複数の手動
操作スイッチ２ａ，２ｂ，２ｃ，２ｄを有するものであ
る。According to the present invention, the device selection means 2 is
It has a plurality of manually operated switches 2a, 2b, 2c, 2d corresponding to the devices 20, 30, 40, 50.

【０００９】また、本発明は、前記音声認識手段３は、
前記手動操作スイッチ２ａ，２ｂ，２ｃ，２ｄがオンさ
れてから所定時間だけ、前記音声指令を認識するもので
ある。According to the present invention, the voice recognition means 3 is
The voice command is recognized only for a predetermined time after the manual operation switches 2a, 2b, 2c, 2d are turned on.

【００１０】また、本発明は、前記音声認識手段３は、
前記音声データと前記指令語データとの類似度を比較し
て前記音声指令を認識するものである。According to the present invention, the voice recognition means 3 is
The voice command is recognized by comparing the degree of similarity between the voice data and the command word data.

【００１１】また、本発明は、前記機器２０，３０，４
０，５０は車両搭載用機器であり、前記手動操作スイッ
チ２ａ，２ｂ，２ｃ，２ｄはステアリングホイール５に
設けられているものである。Further, the present invention provides the above-mentioned devices 20, 30, 4
Reference numerals 0 and 50 are equipments mounted on a vehicle, and the manual operation switches 2a, 2b, 2c and 2d are provided on the steering wheel 5.

【００１２】[0012]

【発明の実施の形態】以下、添付の図面に基づいて、車
両に搭載される音声操作装置１０に本発明を適用した一
実施形態を説明する。BEST MODE FOR CARRYING OUT THE INVENTION An embodiment in which the present invention is applied to a voice operating device 10 mounted on a vehicle will be described below with reference to the accompanying drawings.

【００１３】音声操作装置１０は、マイク１（音声デー
タ出力手段），機器選択スイッチ２（機器選択手段），
音声認識部３（音声認識手段）及び機器操作制御部４か
らなり、車両に搭載されたオーディオシステム２０（機
器），電子メールシステム３０（機器），自動車電話シ
ステム４０（機器）及びナビゲーションシステム５０
（機器）を操作するものである。オーディオシステム２
０は、ＣＤチェンジャー２１，ＭＤデッキ２２，ラジオ
２３を有している。The voice operating device 10 includes a microphone 1 (voice data output means), a device selection switch 2 (device selection means),
An audio system 20 (equipment), an electronic mail system 30 (equipment), a car telephone system 40 (equipment), and a navigation system 50, which include a voice recognition unit 3 (voice recognition means) and a device operation control unit 4, and are installed in a vehicle.
(Device) is operated. Audio system 2
0 has a CD changer 21, an MD deck 22, and a radio 23.

【００１４】マイク１は、車室内のバイザー，フロント
ピラー，インストゥルメントパネル等に配設されてお
り、利用者の音声指令を入力する。マイク１は、入力し
た音声指令を音声データに変換して、この音声データを
音声認識部３に出力する。機器選択スイッチ２は、４個
の押ボタンスイッチ２ａ，２ｂ，２ｃ，２ｄ（手動操作
スイッチ）を有している。押ボタンスイッチ２ａ，２
ｂ，２ｃ，２ｄをオンすると、ボタン操作信号が音声認
識部３に出力される。The microphone 1 is arranged in a visor, a front pillar, an instrument panel, etc. in the vehicle compartment, and inputs a voice command from the user. The microphone 1 converts the input voice command into voice data and outputs this voice data to the voice recognition unit 3. The device selection switch 2 has four push button switches 2a, 2b, 2c, 2d (manual operation switches). Push button switches 2a, 2
When b, 2c and 2d are turned on, a button operation signal is output to the voice recognition unit 3.

【００１５】図２に示すように、押ボタンスイッチ２
ａ，２ｂ，２ｃ，２ｄは、ステアリングホイール５に設
けられており、各押ボタンスイッチ２ａ，２ｂ，２ｃ，
２ｄは円弧状に配列されている。従って、運転者は、ス
テアリングホイール５を握ったままの状態で、機器選択
スイッチ２を操作することができ、例えば左手の親指で
各押ボタンスイッチ２ａ，２ｂ，２ｃ，２ｄを押すこと
ができる。As shown in FIG. 2, the push button switch 2
The steering wheel 5 is provided with a, 2b, 2c and 2d, and each push button switch 2a, 2b, 2c,
2d are arranged in an arc shape. Therefore, the driver can operate the device selection switch 2 while holding the steering wheel 5, and can press each push button switch 2a, 2b, 2c, 2d with the thumb of the left hand, for example.

【００１６】音声認識部３は、認識処理部７及び指令語
記憶部８を有している。指令語記憶部８は、フラッシュ
メモリまたはＥＥＰＲＯＭ等の書込み可能な不揮発性メ
モリからなるものであり、この指令語記憶部８には機器
２０，３０，４０，５０を操作するための複数の指令語
データからなる指令語データ群８ａ，８ｂ，８ｃ，８ｄ
が記憶されている。The voice recognition section 3 has a recognition processing section 7 and a command word storage section 8. The command word storage unit 8 is composed of a writable non-volatile memory such as a flash memory or an EEPROM. The command word storage unit 8 has a plurality of command words for operating the devices 20, 30, 40, 50. Command word data group 8a, 8b, 8c, 8d consisting of data
Is remembered.

【００１７】指令語データ群８ａ，８ｂ，８ｃ，８ｄ
は、各機器２０，３０，４０，５０に対応するようにグ
ループ分けされ、指令語記憶部８に記憶されている。な
お、指令語データ群８ａはオーディオシステム２０に対
応し、指令語データ群８ｂは電子メールシステム３０に
対応し、指令語データ群８ｃは自動車電話システム４０
に対応し、指令語データ群８ｄはナビゲーションシステ
ム５０に対応するものである。Command word data group 8a, 8b, 8c, 8d
Are grouped so as to correspond to the respective devices 20, 30, 40, 50 and stored in the command word storage unit 8. The command word data group 8a corresponds to the audio system 20, the command word data group 8b corresponds to the electronic mail system 30, and the command word data group 8c corresponds to the car telephone system 40.
The command word data group 8d corresponds to the navigation system 50.

【００１８】認識処理部７は、ＣＰＵ，ＲＯＭ，ＲＡＭ
等から構成されるマイコンからなるものであり、ＲＯＭ
に記憶された音声認識処理プログラムによって、指令語
記憶部８から指令語データ群８ａ，８ｂ，８ｃ，８ｄを
読み込み、マイク１から出力された音声データと、読み
込まれた複数の指令語データとを各々比較して、類似度
が最も高かった指令語データを認識データとして機器操
作制御部４に出力する。機器操作制御部４は、音声認識
部３から入力した認識データに基づいて機器操作データ
を機器２０，３０，４０，５０に出力し、各機器２０，
３０，４０，５０を操作する。The recognition processing section 7 includes a CPU, a ROM and a RAM.
ROM consisting of a microcomputer composed of
The command recognition data storage unit 8 reads the command word data groups 8a, 8b, 8c, 8d from the command word storage unit 8 and stores the voice data output from the microphone 1 and the read command word data. The command word data having the highest degree of similarity in comparison with each other is output to the device operation control unit 4 as recognition data. The device operation control unit 4 outputs the device operation data to the devices 20, 30, 40, and 50 based on the recognition data input from the voice recognition unit 3, and each device 20,
Operate 30, 40, 50.

【００１９】次に、図３に基づいて、音声認識部３の認
識処理部７による音声認識処理について詳述する。Next, the voice recognition processing by the recognition processing unit 7 of the voice recognition unit 3 will be described in detail with reference to FIG.

【００２０】ステップＳ１では、押ボタンスイッチ２ａ
がオンされたか否かを判別し、オンされていない場合は
ステップＳ２に進む。押ボタンスイッチ２ａがオンされ
た場合はステップＳ５に進む。ステップＳ２では、押ボ
タンスイッチ２ｂがオンされたか否かを判別し、オンさ
れていない場合はステップＳ３に進む。押ボタンスイッ
チ２ｂがオンされた場合はステップＳ６に進む。In step S1, the push button switch 2a
If it is not turned on, the process proceeds to step S2. When the push button switch 2a is turned on, the process proceeds to step S5. In step S2, it is determined whether or not the push button switch 2b is turned on, and if it is not turned on, the process proceeds to step S3. When the push button switch 2b is turned on, the process proceeds to step S6.

【００２１】ステップＳ３では、押ボタンスイッチ２ｃ
がオンされたか否かを判別し、オンされていない場合は
ステップＳ４に進む。押ボタンスイッチ２ｃがオンされ
た場合はステップＳ７に進む。ステップＳ４では、押ボ
タンスイッチ２ｄがオンされたか否かを判別し、オンさ
れていない場合はステップＳ１に戻る。押ボタンスイッ
チ２ｄがオンされた場合はステップＳ８に進む。In step S3, the push button switch 2c
If it is not turned on, the process proceeds to step S4. If the push button switch 2c is turned on, the process proceeds to step S7. In step S4, it is determined whether or not the push button switch 2d is turned on, and if it is not turned on, the process returns to step S1. When the push button switch 2d is turned on, the process proceeds to step S8.

【００２２】ステップＳ５では、押ボタンスイッチ２ａ
がオンされてから５秒以内に、音声指令があったか否か
（即ち、マイク１から音声データの入力があったか否
か）を判別し、音声指令あった場合はステップＳ９に進
む。５秒以内に音声指令が無かった場合はステップＳ１
に戻る。ステップＳ６では、押ボタンスイッチ２ｂがオ
ンされてから５秒以内に、音声指令があったか否かを判
別し、音声指令あった場合はステップＳ１０に進む。５
秒以内に音声指令が無かった場合はステップＳ１に戻
る。In step S5, the push button switch 2a
Within 5 seconds after the switch is turned on, it is determined whether or not there is a voice command (that is, whether or not voice data is input from the microphone 1), and if there is a voice command, the process proceeds to step S9. If there is no voice command within 5 seconds, step S1
Return to. In step S6, it is determined whether or not there is a voice command within 5 seconds after the push button switch 2b is turned on, and if there is a voice command, the process proceeds to step S10. 5
If there is no voice command within the second, the process returns to step S1.

【００２３】ステップＳ７では、押ボタンスイッチ２ｃ
がオンされてから５秒以内に、音声指令があったか否か
を判別し、音声指令あった場合はステップＳ１１に進
む。５秒以内に音声指令が無かった場合はステップＳ１
に戻る。ステップＳ８では、押ボタンスイッチ２ｄがオ
ンされてから５秒以内に、音声指令があったか否かを判
別し、音声指令あった場合はステップＳ１２に進む。５
秒以内に音声指令が無かった場合はステップＳ１に戻
る。In step S7, the push button switch 2c
Within 5 seconds after the is turned on, it is determined whether or not there is a voice command, and if there is a voice command, the process proceeds to step S11. If there is no voice command within 5 seconds, step S1
Return to. In step S8, it is determined whether or not there is a voice command within 5 seconds after the push button switch 2d is turned on, and if there is a voice command, the process proceeds to step S12. 5
If there is no voice command within the second, the process returns to step S1.

【００２４】ステップＳ９では、指令語記憶部８から指
令語データ群８ａを読み込み、ステップＳ１３に進む。
ステップＳ１０では、指令語記憶部８から指令語データ
群８ｂを読み込み、ステップＳ１３に進む。ステップＳ
１１では、指令語記憶部８から指令語データ群８ｃを読
み込み、ステップＳ１３に進む。ステップＳ１２では、
指令語記憶部８から指令語データ群８ｄを読み込み、ス
テップＳ１３に進む。In step S9, the command word data group 8a is read from the command word storage unit 8, and the process proceeds to step S13.
In step S10, the command word data group 8b is read from the command word storage unit 8, and the process proceeds to step S13. Step S
In 11, the command word data group 8c is read from the command word storage unit 8, and the process proceeds to step S13. In step S12,
The command word data group 8d is read from the command word storage unit 8, and the process proceeds to step S13.

【００２５】ステップＳ１３では、マイク１から入力し
た音声データと、指令語記憶部８から読み込んだ指令語
データ群８ａ，８ｂ，８ｃ，８ｄの各指令語データとを
比較し、ステップ１４に進む。ステップ１４では、読み
込んだ指令語データ群８ａ，８ｂ，８ｃ，８ｄの中から
最も類似度が高かった指令語データを認識データとして
機器操作制御部４に出力し、ステップ１に戻る。In step S13, the voice data input from the microphone 1 is compared with the command word data of the command word data groups 8a, 8b, 8c, 8d read from the command word storage unit 8, and the process proceeds to step 14. In step 14, the command word data having the highest similarity among the read command word data groups 8a, 8b, 8c, 8d is output to the device operation control unit 4 as recognition data, and the process returns to step 1.

【００２６】以上説明したステップ１〜ステップ１４を
簡潔に説明すると、ステップＳ１〜Ｓ４において、押ボ
タンスイッチ２ａ〜２ｄの何れかがオンされると、ステ
ップＳ５〜ステップＳ８に進み、このとき、押ボタンス
イッチ２ａ，２ｂ，２ｃ，２ｄが押されてから5秒以内
に音声指令があった場合は、オンされた押ボタンスイッ
チ２ａ，２ｂ，２ｃ，２ｄに対応する指令語データ群８
ａ，８ｂ，８ｃ，８ｄが指令語記憶部８から読み込まれ
（ステップＳ９〜Ｓ１２）、その読み込まれた指令語デ
ータ群８ａ，８ｂ，８ｃ，８ｄの指令語データと、マイ
ク１から出力された音声データとを夫々比較し（ステッ
プ１３）、最も類似度が高かった指令語データを指令語
データ群８ａ，８ｂ，８ｃ，８ｄから抽出し、この抽出
した指令語データを認識データとして機器操作制御部４
に出力する（ステップ１４）。The steps 1 to 14 described above will be briefly described. When any of the push button switches 2a to 2d is turned on in steps S1 to S4, the process proceeds to steps S5 to S8. When there is a voice command within 5 seconds after the button switches 2a, 2b, 2c, 2d are pressed, the command word data group 8 corresponding to the pushed push button switches 2a, 2b, 2c, 2d.
a, 8b, 8c, and 8d are read from the command word storage unit 8 (steps S9 to S12), and the command word data of the read command word data groups 8a, 8b, 8c, and 8d and the microphone 1 are output. The voice word data is compared with each other (step 13), the command word data having the highest degree of similarity is extracted from the command word data groups 8a, 8b, 8c, 8d, and the extracted command word data is used as recognition data for device operation control. Part 4
(Step 14).

【００２７】次に、利用者がＣＤを聴く場合における音
声認識処理の具体例を説明する。利用者が「日本次郎」
のＣＤを聴く場合、ＣＤの再生はオーディオシステム２
０の機能であるので、利用者は、まず、押ボタンスイッ
チ２ａをオンし、５秒以内に「ニホンジロウ」という音
声指令を発する。Next, a specific example of the voice recognition process when the user listens to the CD will be described. User is "Japan Jiro"
When listening to a CD, the audio system 2
Since the function is 0, the user first turns on the push button switch 2a and issues a voice command "Japanese jiro" within 5 seconds.

【００２８】マイク１は「ニホンジロウ」という音声指
令を入力し、音声認識部３に音声データを出力する。音
声認識部３の認識処理部７は、押ボタンスイッチ２ａに
対応する指令語データ群８ａを指令語記憶部８から読み
込んで、この指令語データ群８ａの指令語データをマイ
ク１入力した音声データと各々比較し、最も類似度が高
かった指令語データを指令語データ群から抽出し、この
抽出した指令ごデータを認識データとして機器操作制御
部４に出力する。機器操作制御部４はオーディオシステ
ム２０に操作データを出力し、オーディオシステム２０
はＣＤチェンジャー２１の中から日本次郎のＣＤを選ん
で再生させる。The microphone 1 inputs a voice command "Japanese jiro" and outputs voice data to the voice recognition unit 3. The recognition processing unit 7 of the voice recognition unit 3 reads the command word data group 8a corresponding to the push button switch 2a from the command word storage unit 8, and inputs the command word data of the command word data group 8a into the microphone 1 as voice data. Then, the command word data having the highest degree of similarity is extracted from the command word data group, and the extracted command data is output to the device operation control unit 4 as recognition data. The device operation control unit 4 outputs operation data to the audio system 20,
Selects the CD of Jiro Nihon from the CD changer 21 and plays it.

【００２９】上述の例において、利用者が「日本四郎」
の家の位置をナビゲーションシステム５０に記憶させて
いた場合であっても、「ニホンシロウ」という指令語デ
ータを含む指令語データ群８ｄは認識処理部に読み込ま
れなく、オンされた押ボタンスイッチ２ａに対応する指
令語データ群８ａのみが指令語記憶部８から認識処理部
７に読み込まれるため、「ニホンジロウ」という音声指
令を「ニホンシロウ」という音声指令と誤認識する虞が
なくい。また、押ボタンスイッチの選択によって指令語
記憶部８に記憶されている指令語データの絞り込みが行
われ、この絞り込まれた指令語データの中から入力した
音声データに対応する指令語データを抽出することが可
能であることから、抽出処理に伴なうプログラム処理が
簡素化され、抽出処理速度を高めることが可能となる。In the above example, the user is "Shiro Nihon".
Even if the navigation system 50 stores the position of the house, the command word data group 8d including the command word data "Japanese Shiro" is not read by the recognition processing unit, and the push button switch 2a that is turned on does not. Since only the corresponding command word data group 8a is read from the command word storage unit 8 into the recognition processing unit 7, there is no risk of erroneously recognizing the voice command "Japanese Jiro" as the voice command "Japanese Shiro". Further, the command word data stored in the command word storage unit 8 is narrowed down by selecting the push button switch, and the command word data corresponding to the input voice data is extracted from the narrowed down command word data. Therefore, the program processing accompanying the extraction processing can be simplified and the extraction processing speed can be increased.

【００３０】また、同様に、電子メールシステム３０に
「日本四郎」のメールアドレスが記憶されていた場合で
あっても、押ボタンスイッチ２ａをオンしたときは指令
語データ群８ａだけが読み込まれ、「ニホンシロウ」と
いう指令語データを含む指令語データ群８ｂは認識処理
部７に読み込まれないため、「ニホンジロウ」という音
声指令を「ニホンシロウ」という音声指令と誤認識する
虞がない。Similarly, even if the e-mail system 30 stores the mail address of "Japan Shiro", when the push button switch 2a is turned on, only the command word data group 8a is read, Since the command word data group 8b including the command word data “Japanese Shiro” is not read by the recognition processing unit 7, there is no possibility that the voice command “Japanese Jiro” is erroneously recognized as the voice command “Japanese Shiro”.

【００３１】本実施形態は、操作する機器毎に分類して
指令語データ群８ａ，８ｂ，８ｃ，８ｄを指令語記憶部
８に記憶させ、オンされた押ボタンスイッチ２ａ，２
ｂ，２ｃ，３ｄに対応する指令語データ群８ａ，８ｂ，
８ｃ，８ｄだけを認識処理部７に読み込むため、誤認識
する虞が少ない。また、機器選択スイッチ２はステアリ
ングホイール５に設けられているため、利用者は、ステ
アリングホイール５を握ったままの状態で、機器選択ス
イッチ２を操作することができる。In this embodiment, the command word data groups 8a, 8b, 8c and 8d are classified into the operated devices and stored in the command word storage unit 8, and the push button switches 2a and 2 that are turned on are stored.
command word data groups 8a, 8b corresponding to b, 2c, 3d,
Since only 8c and 8d are read into the recognition processing unit 7, there is little risk of erroneous recognition. Further, since the device selection switch 2 is provided on the steering wheel 5, the user can operate the device selection switch 2 while holding the steering wheel 5.

【００３２】音声指令だけで機器を操作するときに比較
すると、本実施形態は、押ボタンスイッチ２ａ，２ｂ，
２ｃ，３ｄをオンする動作が必要であるが、押ボタンス
イッチ２ａ，２ｂ，２ｃ，３ｄの数が少ないため、利用
者が煩わしさを感じることはないことが発明者の実験に
より確かめられている。In comparison with the case where the device is operated only by the voice command, in the present embodiment, the push button switches 2a, 2b,
Although it is necessary to turn on 2c and 3d, it has been confirmed by the inventor's experiment that the user does not feel bothersome because the number of pushbutton switches 2a, 2b, 2c and 3d is small. .

【００３３】なお、本実施形態の音声操作装置１０が操
作する機器は、車両搭載機器（オーディオシステム２
０，電子メールシステム３０，自動車電話システム４
０，ナビゲーションシステム）であったが、本発明は他
の機器、例えばテレビ，ビデオデッキ，エアコン等の家
庭用電気製品を音声で操作する音声操作装置にも適用で
きることは言うまでもない。The equipment operated by the voice operating device 10 of the present embodiment is an on-vehicle equipment (audio system 2).
0, e-mail system 30, car phone system 4
No. 0, navigation system), but it goes without saying that the present invention can also be applied to other devices, for example, a voice operation device for operating household electric appliances such as a television, a VCR, and an air conditioner by voice.

【００３４】[0034]

【発明の効果】本発明は、音声指令を入力し音声データ
に変換する音声データ出力手段を備え、前記音声データ
に基づいて複数の機器を操作する音声操作装置であっ
て、操作する前記機器を選択する機器選択手段と、前記
音声指令を入力し音声データを出力する音声データ出力
手段と、前記機器を操作するための複数の指令語データ
からなる指令語データ群を前記機器別に分類して記憶し
ている指令語記憶部を有し、前記機器選択手段で選択さ
れた機器に応じた前記指令語データ群の中から前記音声
指令を認識する音声認識手段３と、を有するものであ
り、複数の機器を音声で操作することができ、且つ、誤
認識が少ない。The present invention is a voice operation device for operating a plurality of devices based on the voice data, which includes voice data output means for inputting a voice command and converting the voice data into the voice data. A device selection unit for selecting, a voice data output unit for inputting the voice command and outputting voice data, and a command word data group consisting of a plurality of command word data for operating the device are classified and stored by the device. And a voice recognition unit 3 for recognizing the voice command from the command word data group corresponding to the device selected by the device selection unit. Can be operated by voice, and there are few false recognitions.

【図面の簡単な説明】[Brief description of drawings]

【図１】本発明の実施形態を示すブロック図。FIG. 1 is a block diagram showing an embodiment of the present invention.

【図２】同上実施形態を示すステアリングホイールの
正面図。FIG. 2 is a front view of a steering wheel showing the embodiment.

【図３】同上実施形態を示すフロー図。FIG. 3 is a flowchart showing the embodiment.

【符号の説明】[Explanation of symbols]

１マイク（音声データ出力手段）２機器選択スイッチ（機器選択手段）２ａ押ボタンスイッチ（手動操作スイッチ）２ｂ押ボタンスイッチ（手動操作スイッチ）２ｃ押ボタンスイッチ（手動操作スイッチ）２ｄ押ボタンスイッチ（手動操作スイッチ）３音声認識部７認識処理部８指令語記憶部８ａ指令語データ群８ｂ指令語データ群８ｃ指令語データ群８ｄ指令語データ群２０オーディオシステム（機器）３０電子メールシステム（機器）４０自動車電話システム（機器）５０ナビゲーションシステム（機器） 1 Microphone (voice data output means) 2 Device selection switch (device selection means) 2a Push button switch (manual operation switch) 2b Push button switch (manual operation switch) 2c Push button switch (manual operation switch) 2d push button switch (manual operation switch) 3 Speech recognition section 7 Recognition processing unit 8 Command word storage 8a Command word data group 8b Command word data group 8c Command word data group 8d Command word data group 20 Audio system (equipment) 30 E-mail system (equipment) 40 Car phone system (equipment) 50 Navigation system (equipment)

───────────────────────────────────────────────────── フロントページの続き (51)Int.Cl.⁷ 識別記号ＦＩテーマコート゛(参考）Ｇ１０Ｌ 15/28 Ｇ１０Ｌ 3/00 ５７１Ｋ (72)発明者佐原祐介新潟県長岡市藤橋１丁目190番地１日本精機株式会社アールアンドデイセンター内Ｆターム(参考） 3D030 DB13 5D015 BB01 HH11 HH13 KK01 LL08─────────────────────────────────────────────────── ─── Continuation of the front page (51) Int.Cl. ⁷ Identification code FI theme code (reference) G10L 15/28 G10L 3/00 571K (72) Inventor Yusuke Sahara 1 190-1, Fujihashi, Nagaoka-shi, Niigata Japan Seiki Co., Ltd. R & D Center F term (reference) 3D030 DB13 5D015 BB01 HH11 HH13 KK01 LL08