How to train your open source HAL 9000
The current state of opensource speech to text processing.
Presented by:
Aaron Chantrill
My earliest memory is of lying in my crib after being passed around by a group of adults and thinking to myself, "When I can talk, I'm going to ask them not to stick their thumbs under babies armpits when they pick them up". I have always been fascinated with cognition and communication. As a kid, I was more interested in writing video games and figuring out how they worked than with playing them. At Johns Hopkins in the late 80's, I got to work with professor Sejnowski on some early neural networks classifying phonemes from recorded speech. Over the last twenty years I have worked in machine learning, database administration, and web design with a focus on free and opensource software. I currently work as a medical informaticist. I am deeply concerned about the renormalization of expectations around privacy.
In this tutorial, we will get a copy of Jasper Voice Assistant up and running on Debian Stretch.
Jasper is an open source virtual assistant written in Python. It runs well on a Raspberry Pi 2 or 3. The particular version of Jasper we will be installing is one I have been working on for several months and includes a plugin for playing z-machine text adventure games.
Jasper can use local speech to text and text to speech systems, making it a good platform for the privacy concious tinkerer who likes the idea of a computer you can talk to, but prefers not to have random snippets of household conversation shared with Alphabet, Amazon or Apple.
This tutorial includes setting up and configuring the PocketSphinx and DeepSpeech Speech To Text and Festival Text To Speech systems, and writing a simple Jasper SpeechHandler plugin in Python.
Through this tutorial, we will talk about privacy concerns with large closed-source speech to text processing systems, the current state of open source speech to text processing, the importance of public voice datasets and how to contribute to them, and how speech recognition intersects with artificial intelligence.
Please bring a laptop with you with vagrant and virtualbox installed. Please download the aaronchantrill/JasperVB vagrant file (vagrant box add aaronchantrill/JasperVB). This box has an 18GB dynamically allocated hard drive and uses the standard vagrant insecure SSH key and passwords. This file is about 11GB. I will publish a video of the complete process (still taking 3 hours to compile) and a video of how to setup and test the vagrant box.
I plan to spend the first hour introducing the common strategies used by HMM and ANN speech to text systems, exploring the files used, building some custom dictionaries and doing some live demonstrations. The second hour will be spent creating a Jasper configuration file (naming the assistant), writing a simple SpeechHandler plugin, and talking about future development.
This tutorial was designed at The Foundry, Bellingham’s makerspace.
- Date:
- 2018 April 29 - 05:00
- Duration:
- 2 h
- Room:
- CC-201 TUT1
- Conference:
- LinuxFest Northwest 2018
- Language:
- Track:
- Code
- Difficulty:
- Medium
- Building and using your own inexpensive, Docker-based pico-cluster
- Start Time:
- 2018 April 29 05:00
- Room:
- CC-202 TUT2
- How to train your open source HAL 9000
- Start Time:
- 2018 April 29 05:00
- Room:
- CC-201 TUT1
- State of Video Copyright & Free Culture
- Start Time:
- 2018 April 29 05:30
- Room:
- G-103
- Linux 101: Just the basics
- Start Time:
- 2018 April 29 05:30
- Room:
- HC-108
- Securing Docker on the Cheap - Part 1 - Fundamentals
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-208
- Applying Object Character Recognition and Kubernetes to Twitch
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-200
- Switching to the BSDs: A crash course in FreeBSD, FreeNAS, TrueOS and OpenBSD
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-236
- Presentation Skills for Open Source Folks
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-115
- Linux in a Microsoft world
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-114
- Maintaining local forks
- Start Time:
- 2018 April 29 05:30
- Room:
- CC-235
- Tables and columns and rows! Oh my!
- Start Time:
- 2018 April 29 05:30
- Room:
- HC-103 Postgres
- Linux 102: Choosing the right distro for you
- Start Time:
- 2018 April 29 06:45
- Room:
- HC-108
- Securing Docker on the Cheap - Part 2 - Vulnerabilities
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-208
- Introduction to the Clojure Programming Language
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-236
- Endless Summer of Code: Building the Foundation of a Community Through GSOC
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-115
- Linux email clients give me a migrane
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-114
- Let's uninstall GRUB
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-235
- Brave New World
- Start Time:
- 2018 April 29 06:45
- Room:
- G-103
- End-to-end Encryption in Nextcloud
- Start Time:
- 2018 April 29 06:45
- Room:
- CC-200
- Linux Professional Institute: Exam Lab - Session 2
- Start Time:
- 2018 April 29 07:00
- Room:
- HC-112 LPI