Amazon Alexa智能语音服务

最近调研了一下Amazon Alexa智能语音服务,这里记录一下调研到的信息;

Alexa基本上就是Amazon智能音箱Echo的大脑,比如用查天气举例:用户说“What's the weather in Seattle”,Echo都会发送至Alexa去处理,提取特征“weather”、“Seattle”,分析用户的语义,再根据语义去查天气信息,然后将查到的信息返回给Echo,Echo收到信息之后播报。

Alexa

Alexa, the voice service that powers Echo, provides capabilities, or skills, that enable customers to interact with devices in a more intuitive way using voice. Examples of these skills include the ability to play music, answer general questions, set an alarm or timer and more. Alexa is built in the cloud, so it is always getting smarter. The more customers use Alexa, the more she adapts to speech patterns, vocabulary, and personal preferences.

Echo的智能语音服务,为用户提供语音交互服务,例如播放音乐、回答问题、设置闹钟等

支持的设备

  • Amazon Echo等
  • 集成了AVS(Alexa Voice Service)的第三方厂家设备
  • 手机应用(集成AVS,通过http2的方式调用AVS API)

AVS(Alexa Voice Service)

Use the Alexa Voice Service (AVS) to add intelligent voice control to any connected product that has a microphone and speaker. Your customers will be able to ask Alexa to play music, answer questions, get news and local information, control smart home products, and more on their voice-enabled products.

AVS is now available for the UK and Germany.目前支持英文和德语

AVS provides hardware and software development tools to help you easily build products with Alexa.

Demo地址

这个Demo的思路比较奇怪,比如没有用Android直接集成AVS,而是先在Mac上跑一个Java程序(自带图形界面),然后打开Android模拟器,运行它的android程序,android程序获取授权得到Token,然后再使用Mac的那个java程序来测试。(个人觉得这逻辑很怪...为什么不直接通过http2的方式调用AVS API...)

这里再强调一下,AVS不支持中文,所以Demo测试时要用英文

一个同事在Mac尝试了该Demo,Demo跑起来之后,可以在Mac上说话,比如你说“What's the weather in Seattle”,Demo会回复“天气是xxxxxx”

Demo的坑

Android获取授权那一步,最好用原生的模拟器,不要用Genymotion的模拟器,否则会连接超时;

Building with AVS

Integrate Alexa into your next voice-enabled connected product. Alexa is always getting smarter with new capabilities and services through machine learning, regular feature updates, and custom skills. The AVS speech engines and features live in the cloud and are easily accessible via APIs, minimizing the design complexity of your product. We also provide software and hardware development tools to help you quickly and easily build Alexa-enabled products.

可通过AVS Device SDK、Development Kits for AVS、AVS API三种方式集成AVS

一个android集成AVS的例子

屏幕快照 2017-09-08 下午4.28.04.png
屏幕快照 2017-09-08 下午4.28.10.png
屏幕快照 2017-09-08 下午4.28.20.png

AVS Device SDK

The AVS Device SDK provides C++-based libraries that enable your device to process audio inputs and triggers, establish persistent connections with AVS, and handle all Alexa driven interactions. The SDK leverages the AVS API that handle core Alexa functionality, including speech recognition and synthesis, and other capabilities such as streaming media, timers and alarms, notifications, weather reports, and thousands of custom skills.


Key SDK Components

Development Kits for AVS

Development Kits for AVS are reference solutions for integrating Alexa into your products. They include chipsets, voice processing technologies, and client software that leverages the AVS APIs to help you easily build voice-enabled commercial products while reducing development costs and accelerating the integration process. These solutions range in features, cost, and in use case, allowing you to select the best performing audio front end solution for your product needs.

AVS API

AVS provides programming interfaces to access the cloud-based service from your product, regardless of its operating system. Each interface contains logically grouped messages called directives and events that enable your device and the AVS cloud to communicate directly. The APIs handle core Alexa functionality, including speech recognition and synthesis, and other capabilities such as streaming music, timers and alarms, notifications, weather reports, and skills—enabling your device to capture an utterance, process it in the cloud, and playback an intelligent speech response.

What Is an Alexa Skill

Alexa is Amazon’s voice service and the brain behind millions of devices like the Amazon Echo, Echo Dot, and Echo Show. Alexa provides capabilities, or skills, that enable customers to create a more personalized experience. There are now more than 15,000 skills from companies like Starbucks, Uber, and Capital One as well as other innovative designers and developers.

What Is the Alexa Skills Kit?

With the Alexa Skills Kit (ASK), designers, developers, and brands can build engaging skills and reach millions of customers. ASK is a collection of self-service APIs, tools, documentation, and code samples that makes it fast and easy for you to add skills to Alexa. With ASK, you can leverage Amazon’s knowledge and pioneering work in the field of voice design.

Alexa Skills Kit用于扩展Alexa能力,第三方(开发者、服务商等)使用ASK开发应用,来提供查找信息、订餐、订车、控制智能家居设备、获取新闻资讯等功能。

skill类型

  • Custom Skills

    • Look up tide information
    • Order a pizza
    • Request a taxi
    • Engage the user in a game, such as word puzzles or trivia
    • Just about any other action you can imagine!
  • Smart Home Skills

    • turn on / turn off
    • increase / decrease the temperature
    • change the dimness or brightness for a light

    you (as the developer) define: How your skill responds to a particular directive. For instance, you write the code that makes a light turn on when your skill receives a “turn on the light” directive. This code is called a skill adapter.

  • Flash Briefing Skills

For this type of skill, the Flash Briefing Skill API defines:

  • The words users say to make (or invoke) those requests. For example:

    • “give me my flash briefing”
    • “tell me the news”
  • You (as the creator) define:

    • The name, description and images for a flash briefing skill. This helps a customer choose your skill in the skill store.
    • One or more content feeds for a flash briefing skill. These feeds can contain audio content that is played to the customer or text content that Alexa reads to the customer.
  • Video Skills

    A video skill enables you to provide video content such as TV shows and movies for customers.

    • The words users say to make (or invoke) those requests. For example:
      • “play Manchester by the Sea”
      • “change to channel 4”

    You (as the creator) define:

    • The name, description and images for a video skill. This helps a customer choose your skill in the skill store.

    • The requests the skill can handle such as playing and searching for video content and how video content search results display.

案例

  1. 接入亚马逊Echo音箱教程
image
image
  1. 一文看懂 Echo 和 Alexa,亚马逊如何用苹果的玩法在玩语音?
image
最后编辑于
©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 158,233评论 4 360
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 67,013评论 1 291
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 108,030评论 0 241
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 43,827评论 0 204
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 52,221评论 3 286
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 40,542评论 1 216
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 31,814评论 2 312
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 30,513评论 0 198
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 34,225评论 1 241
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 30,497评论 2 244
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 31,998评论 1 258
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 28,342评论 2 253
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 32,986评论 3 235
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 26,055评论 0 8
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 26,812评论 0 194
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 35,560评论 2 271
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 35,461评论 2 266

推荐阅读更多精彩内容