cto是什么职位| 出来混迟早要还的什么意思| 六度万行 是什么意思| 1980年属什么生肖| 一直发烧不退是什么原因| 外阴又疼又痒用什么药| 12月16是什么星座| cfu是什么单位| 感冒全身酸痛吃什么药| lf是什么牌子| 荨麻疹为什么晚上起| 办准生证需要什么资料| 天空又什么又什么| 长期口臭吃什么药| 鹦鹉喜欢吃什么食物| 拉姆什么意思| 肺动脉高压用什么药| 百无一用是什么意思| 白细胞数目偏高是什么意思| 立棍是什么意思| 头发沙发是什么意思| 蒲公英什么功效| da是什么意思| 工作是为了什么| ga是什么牌子| array是什么意思| 口腔溃疡长什么样| 世界上最大的蛇是什么蛇| 膝盖跪着疼是什么原因| 胃间质瘤是什么性质的瘤| 什么是原发性高血压和继发性高血压| 此起彼落是什么意思| 为什么总是头疼| 病是什么结构| 配送是什么意思| 大保健是什么| 梅花三弄是什么意思| 鬼一般找什么人压床| 海为什么是蓝色| 无声无息是什么意思| 凌乱是什么意思| 单核细胞偏高是什么原因| 苯是什么| 为什么晚上血压比白天高| 颜字五行属什么| 减肥吃什么好| 膝盖怕冷是什么原因| 兽性大发是什么生肖| 苏联什么时候解体| 预设是什么意思| 舌裂是什么原因造成的| 树根有什么作用| 专一是什么意思| 葫芦代表什么寓意| 破损是什么意思| 守株待兔是什么生肖| bbr是什么牌子| 法西斯战争是什么意思| 高血压用什么药最好| 无冕之王是什么意思| 手会发抖是什么原因| 蟑螂吃什么东西| 尿酸高会得什么病| 麻风病是什么病| 摧残是什么意思| modern是什么牌子| 洗耳恭听是什么意思| timing什么意思| 后背容易出汗是什么原因| funfair是什么意思| 萎缩性胃炎吃什么水果好| 舌根苔白厚腻是什么原因| 贲门不舒服有什么症状| 斐乐是什么档次| 眼黄瘤什么方法治疗最好| 为什么会胆汁反流| beacon什么意思| 莞尔一笑什么意思| 早起胃疼是什么原因导致的| 水钠潴留什么意思| wiggle是什么意思| 姑妈的老公叫什么| 大好河山是什么生肖| 直肠ca是什么意思| 1979年出生属什么生肖| 经常吐口水是什么原因| 死皮是什么| 美国为什么不敢打伊朗| 结婚证需要什么资料| 斗破苍穹什么时候出的| 鼻涕由清变黄说明什么| 优越感是什么意思| 阑尾炎属于什么科室| 华丽的近义词是什么| 迟脉是什么意思| 高密度脂蛋白胆固醇高是什么意思| 外痔用什么药可以消除| 夏枯草是什么样子| 麻油跟香油什么区别| 健脾祛湿吃什么药| 怂恿是什么意思| 血液由什么组成| 25分贝相当于什么声音| 临终关怀的目的是什么| 为什么睡觉会突然抖一下| 处女座是什么星座| 贫血貌是什么表现| 男人吃什么大补| 施食是什么意思| 屈髋是什么姿势| 狗狗为什么会得细小| 呵呵什么意思| 什么的歌声| 胸腺癌早期有什么症状| 什么人不能吃玉米| 益生菌什么时候吃最好| 梦见抓鱼是什么预兆| 一个虫一个夫念什么| 流产后吃什么食物| 蚂蚱吃什么| 恐龙什么时候灭绝的| 骨骼肌率是什么意思| 妇乐颗粒的功效能治什么病| 什么药可以消肿| 幽门螺旋杆菌感染吃什么药| 世界上最长的英语单词是什么| 牛奶为什么能解辣| 晕车药吃多了有什么副作用| 中招是什么意思| 记仇的人是什么性格| 外阴白斑是什么原因| 金鱼沉底不动什么原因| 展开的近义词是什么| 99什么意思| 槊是什么意思| 蚱蜢吃什么食物| 布病是什么病| 甲状腺应该多吃什么| 707是什么意思| 鹿角有什么功效和作用| 活塞是什么意思| 吃银耳有什么功效和作用| 柏油样便见于什么病| 天德月德是什么意思| 天行健的下一句是什么| 吃榴莲对身体有什么好处| 1999年五行属什么| 高回声结节是什么意思| 干净的近义词是什么| 动脉硬化吃什么| 羊眼圈是什么| 缩量横盘意味着什么| 洗面奶什么好| 榴莲有什么作用| 梦见偷别人东西是什么意思| hcg下降是什么原因| 呼吸道感染挂什么科| 继发性高血压是什么意思| 身体寒湿重吃什么好| 中指戴戒指是什么意思| 胰子是什么意思| 心经讲的是什么| otc属于什么药| 沱茶属于什么茶| 弹力棉是什么面料| 潮起潮落是什么意思| 便秘吃什么菜有助排便| 香港特首什么级别| 孕妇梦见鱼是什么意思| 贫血喝什么口服液| 肌红蛋白是什么意思| 泌尿系统感染什么症状| 毛主席什么时候去世| 股癣是什么原因引起的| 圆谎是什么意思| 早上打嗝是什么原因呢| 喝牛奶拉肚子是什么原因| 胃底腺息肉什么意思| 纤维灶是什么意思| 检察院是做什么的| 胎停了有什么明显症状| 胃酸分泌过多是什么原因造成的| 晚上难入睡是什么原因| 血压低是什么情况| sinoer是什么牌子| 谈恋爱是为了什么| ad和d3有什么区别| 头孢喝酒有什么反应| 眉毛旁边长痘痘是什么原因| 什么长| 震慑是什么意思| mophie是什么牌子| 桑葚什么季节成熟| 蚊子咬了为什么会痒| 共产主义社会是什么样的社会| 怀孕前壁和后壁有什么区别| 西瓜像什么比喻句| 排浊是什么意思| 中药天龙又叫什么| 谈什么色变| 50至60岁吃什么钙片好| 猫叫是什么意思| 清洁度二度是什么意思| 什么是胃肠型更年期| 步兵什么意思| 颈椎病头晕吃什么药好| 用什么能把牙齿洗白| 左侧附件区囊性回声是什么意思| 为什么瘦不下来| 晕车吃什么药| 盆腔炎是什么原因引起的| 大便拉不出来什么原因| 什么动物不怕热| 胃酸烧心吃什么药| 糜烂性胃炎吃什么药效果好| 伤风胶囊又叫什么| 脚气病缺什么维生素| 榴莲什么季节成熟| 口腔医学和口腔医学技术有什么区别| 乙酰氨基葡萄糖苷酶阳性什么意思| 什么手| 什么是道家| 陈皮泡水喝有什么作用| 右手无名指戴戒指代表什么| 儿童结膜炎用什么眼药水| 什么是全麦面粉| 拉稀屎是什么原因| 孕期什么时候补充dha| 阑尾炎吃什么消炎药| 查血型挂什么科| 奀是什么意思| 为什么会有荨麻疹| 赛字五行属什么| 螺子黛是什么| 22度穿什么衣服合适| 每次来月经都会痛经什么原因| 新店开业送什么好| 痈疡是什么意思| 组织委员的职责是什么| 检点是什么意思| 子宫钙化灶是什么意思| 坐月子什么不可以吃| nf是什么单位| mri检查是什么意思| 喝葡萄糖有什么功效与作用| 细菌性阴道炎用什么药效果最好| 脾不统血吃什么中成药| 八字缺什么怎么算| 龟头炎吃什么药| 泡脚用什么东西泡最好| 天蝎座男生喜欢什么样的女生| 孙悟空是个什么样的人| 黄体破裂什么意思| 五行缺什么怎么算| 深覆合是什么样子的| 小ck属于什么档次| 制剂是什么意思| 鲤鱼喜欢吃什么| 小孩用脚尖走路是什么原因| 滴水不漏是什么生肖| 甲状腺结节不能吃什么东西| 色丁布是什么面料| 胃胀嗳气吃什么药最有效| 男占258女占369什么意思| 百度Jump to content

儿童说话晚去医院挂什么科

From Wikipedia, the free encyclopedia
百度   睡不好,不肯睡,该咋治?  习惯晚睡,是一种病!得治!  据说2017年眼罩、隔音耳塞、足贴是最受欢迎的助眠产品TOP3,其中,隔音耳塞是95后的最爱,而且95后还买得更“贵”。

The Speech Application Programming Interface or SAPI is an API developed by Microsoft to allow the use of speech recognition and speech synthesis within Windows applications. To date, a number of versions of the API have been released, which have shipped either as part of a Speech SDK or as part of the Windows OS itself. Applications that use SAPI include Microsoft Office, Microsoft Agent and Microsoft Speech Server.

In general, all versions of the API have been designed such that a software developer can write an application to perform speech recognition and synthesis by using a standard set of interfaces, accessible from a variety of programming languages. In addition, it is possible for a 3rd-party company to produce their own Speech Recognition and Text-To-Speech engines or adapt existing engines to work with SAPI. In principle, as long as these engines conform to the defined interfaces they can be used instead of the Microsoft-supplied engines.

In general, the Speech API is a freely redistributable component which can be shipped with any Windows application that wishes to use speech technology. Many versions (although not all) of the speech recognition and synthesis engines are also freely redistributable.

There have been two main 'families' of the Microsoft Speech API. SAPI versions 1 through 4 are all similar to each other, with extra features in each newer version. SAPI 5, however, was a completely new interface, released in 2000. Since then several sub-versions of this API have been released.

Basic architecture

[edit]

The Speech API can be viewed as an interface or piece of middleware which sits between applications and speech engines (recognition and synthesis). In SAPI versions 1 to 4, applications could directly communicate with engines. The API included an abstract interface definition which applications and engines conformed to. Applications could also use simplified higher-level objects rather than directly call methods on the engines.

In SAPI 5 however, applications and engines do not directly communicate with each other. Instead, each talks to a runtime component (sapi.dll). There is an API implemented by this component which applications use, and another set of interfaces for engines.

Typically in SAPI 5 applications issue calls through the API (for example to load a recognition grammar; start recognition; or provide text to be synthesized). The sapi.dll runtime component interprets these commands and processes them, where necessary calling on the engine through the engine interfaces (for example, the loading of grammar from a file is done in the runtime, but then the grammar data is passed to the recognition engine to actually use in recognition). The recognition and synthesis engines also generate events while processing (for example, to indicate an utterance has been recognized or to indicate word boundaries in the synthesized speech). These pass in the reverse direction, from the engines, through the runtime DLL, and on to an event sink in the application.

In addition to the actual API definition and runtime DLL, other components are shipped with all versions of SAPI to make a complete Speech Software Development Kit. The following components are among those included in most versions of the Speech SDK:

  • API definition files - in MIDL and as C or C++ header files.
  • Runtime components - e.g. sapi.dll.
  • Control Panel applet - to select and configure default speech recognizer and synthesizer.
  • Text-To-Speech engines in multiple languages.
  • Speech Recognition engines in multiple languages.
  • Redistributable components to allow developers to package the engines and runtime with their application code to produce a single installable application.
  • Sample application code.
  • Sample engines - implementations of the necessary engine interfaces but with no true speech processing which could be used as a sample for those porting an engine to SAPI.
  • Documentation.

Versions

[edit]

Xuedong Huang was a key person who led Microsoft's early SAPI efforts.

SAPI 1-4 API family

[edit]

SAPI 1

[edit]

The first version of SAPI was released in 1995, and was supported on Windows 95 and Windows NT 3.51. This version included low-level Direct Speech Recognition and Direct Text To Speech APIs which applications could use to directly control engines, as well as simplified 'higher-level' Voice Command and Voice Talk APIs.

SAPI 3

[edit]

SAPI 3.0 was released in 1997. It added limited support for dictation speech recognition (discrete speech, not continuous), and additional sample applications and audio sources.

SAPI 4

[edit]

SAPI 4.0 was released in 1998. This version of SAPI included both the core COM API; together with C++ wrapper classes to make programming from C++ easier; and ActiveX controls to allow drag-and-drop Visual Basic development. This was shipped as part of an SDK that included recognition and synthesis engines. It also shipped (with synthesis engines only) in Windows 2000.

The main components of the SAPI 4 API (which were all available in C++, COM, and ActiveX flavors) were:

  • Voice Command - high-level objects for command & control speech recognition
  • Voice Dictation - high-level objects for continuous dictation speech recognition
  • Voice Talk - high-level objects for speech synthesis
  • Voice Telephony - objects for writing telephone speech applications
  • Direct Speech Recognition - objects for direct control of recognition engine
  • Direct Text To Speech - objects for direct control of synthesis engine
  • Audio objects - for reading to and from an audio device or file

SAPI 5 API family

[edit]

The Speech SDK version 5.0, incorporating the SAPI 5.0 runtime was released in 2000. This was a complete redesign from previous versions and neither engines nor applications which used older versions of SAPI could use the new version without considerable modification.

The design of the new API included the concept of strictly separating the application and engine so all calls were routed through the runtime sapi.dll. This change was intended to make the API more 'engine-independent', preventing applications from inadvertently depending on features of a specific engine. In addition, this change was aimed at making it much easier to incorporate speech technology into an application by moving some management and initialization code into the runtime.

The new API was initially a pure COM API and could be used easily only from C/C++. Support for VB and scripting languages were added later. Operating systems from Windows 98 and NT 4.0 upwards were supported.

Major features of the API include:

  • Shared Recognizer. For desktop speech recognition applications, a recognizer object can be used that runs in a separate process (sapisvr.exe). All applications using the shared recognizer communicate with this single instance. This allows sharing of resources, removes contention for the microphone and allows for a global UI for control of all speech applications.
  • In-proc recognizer. For applications that require explicit control of the recognition process, the in-proc recognizer object can be used instead of the shared one.
  • Grammar objects. Speech grammars are used to specify the words that the recognizer is listening for. SAPI 5 defines an XML markup for specifying a grammar, as well as mechanisms to create them dynamically in code. Methods also exist for instructing the recognizer to load a built-in dictation language model.
  • Voice object. This performs speech synthesis, producing an audio stream from a text. A markup language (similar to XML, but not strictly XML) can be used for controlling the synthesis process.
  • Audio interfaces. The runtime includes objects for performing speech input from the microphone or speech output to speakers (or any sound device); as well as to and from wave files. It is also possible to write a custom audio object to stream audio to or from a non-standard location.
  • User lexicon object. This allows custom words and pronunciations to be added by a user or application. These are added to the recognition or synthesis engine's built-in lexicons.
  • Object tokens. This is a concept allowing recognition and TTS engines, audio objects, lexicons and other categories of an object to be registered, enumerated and instantiated in a common way.

SAPI 5.0

[edit]

This version shipped in late 2000 as part of the Speech SDK version 5.0, together with version 5.0 recognition and synthesis engines. The recognition engines supported continuous dictation and command & control and were released in U.S. English, Japanese and Simplified Chinese versions. In the U.S. English system, special acoustic models were available for children's speech and telephony speech. The synthesis engine was available in English and Chinese. This version of the API and recognition engines also shipped in Microsoft Office XP in 2001.

SAPI 5.1

[edit]

This version shipped in late 2001 as part of the Speech SDK version 5.1. Automation-compliant interfaces were added to the API to allow use from Visual Basic, scripting languages such as JScript, and managed code. This version of the API and TTS engines were shipped in Windows XP. Windows XP Tablet PC Edition and Office 2003 also include this version but with a substantially improved version 6 recognition engine and Traditional Chinese.

SAPI 5.2

[edit]

This was a special version of the API for use only in the Microsoft Speech Server which shipped in 2004. It added support for SRGS and SSML mark-up languages, as well as additional server features and performance improvements. The Speech Server also shipped with the version 6 desktop recognition engine and the version 7 server recognition engine.

SAPI 5.3

[edit]

This is the version of the API that ships in Windows Vista together with new recognition and synthesis engines. As Windows Speech Recognition is now integrated into the operating system, the Speech SDK and APIs are a part of the Windows SDK. SAPI 5.3 includes the following new features:

  • Support for W3C XML speech grammars for recognition and synthesis. The Speech Synthesis Markup Language (SSML) version 1.0 provides the ability to mark up voice characteristics, speed, volume, pitch, emphasis, and pronunciation.
  • The Speech Recognition Grammar Specification (SRGS) supports the definition of context-free grammars, with two limitations:
    • It does not support the use of SRGS to specify dual-tone modulated-frequency (touch-tone) grammars.
    • It does not support Augmented Backus–Naur form (ABNF).
  • Support for semantic interpretation script within grammars. SAPI 5.3 enables an SRGS grammar to be annotated with JavaScript for semantic interpretation to supplement the recognized text.
  • User-Specified shortcuts in lexicons, which is the ability to add a string to the lexicon and associate it with a shortcut word. When dictating, the user can say the shortcut word and the recognizer will return the expanded string.
  • Additional functionality and ease-of-programming provided by new types.
  • Performance improvements, improved reliability, and security.
  • Version 8 of the speech recognition engine ("Microsoft Speech Recognizer")

SAPI 5.4

[edit]

This is an updated version of the API that ships in Windows 7.

SAPI 5 Voices

[edit]

Microsoft Sam (Speech Articulation Module)[citation needed] is a commonly shipped SAPI 5 voice. In addition, Microsoft Office XP and Office 2003 installed L&H Michael and Michelle voices. The SAPI 5.1 SDK installs 3 more voices, Mike, Mary, and an additional testing voice known as "Sample TTS Voice" that uses prerecorded voice recordings instead of synthesized voices. Windows Vista includes Microsoft Anna which replaces Microsoft Sam and sounds more natural and intelligible. It is also installed on Windows XP by Microsoft Streets & Trips 2006 and later versions. The Chinese version of Vista and later Windows client versions also include a female voice named Microsoft Lili.

Managed code Speech API

[edit]

A managed code API ships as part of the .NET Framework 3.0.[1] It has similar functionality to SAPI 5 but is more suitable to be used by managed code applications. The new API is available on Windows XP, Windows Server 2003, Windows Vista, and Windows Server 2008.

The existing SAPI 5 API can also be used from managed code to a limited extent by creating a COM Interop code (helper code designed to assist in accessing COM interfaces and classes). This works well in some scenarios however the new API should provide a more seamless experience equivalent to using any other managed code library.

However, major obstacle towards transitioning from the COM Interop is the fact that the managed implementation has subtle memory leaks which lead to memory fragmentation and exclude the use of the library in any non-trivial applications. As a workaround, Microsoft has suggested using a different API, which has fewer voices.[2]

Speech functionality in Windows Vista

[edit]

Windows Vista includes a number of new speech-related features including:

  • Speech control of the full Windows GUI and applications
  • New tutorial, microphone wizard, and UI for controlling speech recognition
  • New version of the Speech API runtime: SAPI 5.3
  • Built-in updated Speech Recognition engine (Version 8)
  • New Speech Synthesis engine and SAPI voice Microsoft Anna
  • Managed code speech API (codenamed SpeechFX)
  • Speech recognition support for 8 languages at release time: U.S. English, U.K. English, traditional Chinese, simplified Chinese, Japanese, Spanish, French, and German, with more language to be released later.

Microsoft Agent most notably, and all other Microsoft speech applications use SAPI 5.

Compatibility

[edit]

The Speech API is compatible with the following operating systems:[3][4]

SAPI 5

[edit]

List as of SAPI version 5.1:[3][4]

Later versions of SAPI 5 (e.g. SAPI 5.3 and above) are compatible with the following operating systems:

SAPI 4

[edit]

Major applications using SAPI

[edit]

See also

[edit]

References

[edit]
  1. ^ Michael Dunn. "Speech synthesis and recognition in .NET - Give applications a voice". Redmond Developer News. Retrieved 2025-08-06. Archived 14 January 2010 at the Wayback Machine
  2. ^ System. Speech has a memory leak | Microsoft Connect. Connect.microsoft.com. Retrieved on 2025-08-06.
  3. ^ a b Microsoft Corporation. "SAPI System Requirements". MSDN. Archived from the original on 2025-08-06. Retrieved 2025-08-06.
  4. ^ a b "Welcome to the Microsoft Speech SDK - Microsoft Speech SDK Documentation". documentation.help. Retrieved 2025-08-06.
[edit]
自然生化流产是什么意思 手心脚心发热吃什么药 小麦淀粉可以做什么 1212是什么星座 农历7月25日是什么星座
疝气是什么原因引起的 天珠到底是什么 随餐服用是什么意思 我不知道你在说什么英文 追求是什么意思
骨密度z值是什么意思 什么是日记 一路顺风是什么生肖 昆明是什么城 12月14日是什么星座
欢天喜地是什么生肖 短纤是什么 什么人靠别人的脑袋生活 活珠子是什么 一竖读什么
猝死什么意思hcv9jop6ns1r.cn 为什么脚会有酸臭味hcv9jop3ns0r.cn 肩膀酸痛是什么原因hcv7jop6ns6r.cn 孟买血型是什么意思hcv8jop2ns2r.cn 发烧了吃什么药0297y7.com
佝偻病是什么意思hcv8jop5ns7r.cn 富贵包是什么hcv8jop1ns7r.cn 手心发热吃什么药hcv7jop7ns2r.cn 口臭吃什么中成药ff14chat.com 太平猴魁是什么茶hcv7jop6ns3r.cn
脚干裂用什么药膏hcv8jop2ns1r.cn 人文是什么意思hcv8jop5ns1r.cn hippo什么意思hcv8jop0ns3r.cn 迪丽热巴颜值什么水平hcv8jop8ns8r.cn 血压低有什么症状fenrenren.com
4月25日什么星座hlguo.com 珠海有什么好玩的hcv8jop2ns7r.cn Valentino什么牌子hcv9jop3ns0r.cn 仁慈是什么意思96micro.com 飞水是什么意思hcv8jop3ns3r.cn
百度