#Voicebook

Learn how to build voice applications in Python. The comprehensive guide to voice computing with 20+ Python libraries, 200+ starter scripts, and practical examples.

About the Book

It's an exciting time to be a developer in the voice computing space: 1 in 4 searches on Google are now voice-enabled, Amazon Alexa just passed 10,000 skills, and 100 million calls are completed on WhatsApp daily. But where do you go to start learning how to code in this field?

Whether you are a veteran developer or just starting out, this book guides you through the process of building voice-based applications in Python.

Buy Now

Why Buy This Book?

You will be able to build voice applications faster.

20+ Python Libraries

Applies >20 Python libraries to help you solve voice-related problems faster.

10+ Training Datasets

Get access to training voice datasets like the Common Voice Project or AudioSet.

200+ Starter Scripts

Over 200 scripts are provided on GitHub to get you up-and-running quickly.

Lasting Community

Become involved in the larger open source voice community, 45,000+ people and counting.

Choose Your Format

Available in digital and print formats

Kindle eBook

$9.99
  • Instant download
  • Searchable text
  • Adjustable font size
  • Sync across devices
Buy on Amazon

Testimonials

We're already seeing the #Voicebook used to train the next-generation of voice computing experts.

Larry Zhang

"I like the fact that there's a lot of code samples right after new concepts in the book are introduced. With this structure, I was able to quickly featurize and build machine learning models for voice datasets. The book is also filled with rich tables and visualizations; it's hard to find information like this in one place."

Larry Zhang
Software Engineer, NVIDIA
Dr. Reza Hosseini Ghomi

"I've been looking for a textbook for voice computing for several years now. In academic research I have struggled at times putting together material to efficiently orient new undergraduates, graduates, post-docs, and even other faculty to the field. A book like this is valuable to anyone doing work related to voice computing."

Reza Hosseini Ghomi, MD/MSE
Director of the DigiPsych Lab, University of Washington
James Fairey

"Collecting voice has many layers and challenges, Jim has successfully navigated the 'do's and dont's' into an easy guide for starters and seasoned voice collectors. I highly recommend this book if you need to collect great quality of voice, but need those voice files to produce great data at all levels."

James Fairey
Audio Director of Production, Entercom Communications
Dr. Marsal Gavalda

"This book provides an extraordinarily thorough overview of the entire field of speech technologies. Jim's practical ingenuity shines as he guides us through all the technical components and design choices involved in building a voice assistant. Use it to get your hands dirty in this exciting field!"

Marsal Gavalda, PhD
Head of Machine Learning Commerce Platform, Square

Frequently Asked Questions

Voice computing is the discipline that aims to develop hardware or software to process voice inputs. You can learn more about this field in the introductory video here. The #Voicebook focuses on building voice computing software applications in Python.

You can buy the book at these links:

Kindle eBook ($9.99): Amazon Kindle Store
Paperback ($29.99): Amazon

This book is geared for beginners or veteran programmers alike. Many of the chapters start off simple then go into more advanced topics. By the end of the book you'll be able to:

Understand how to read/write, record, clean, encrypt, playback, transcode, transcribe, compress, publish, featurize, model, and visualize voice files
Build your own voice computer and voice assistant from scratch
Design cutting-edge microservice server architectures on top of Docker and Kubernetes
Get access to 200+ starter scripts in a GitHub repository
Become involved in the larger open source voice community

Please use this information:

Schwoebel, J. (2018). An Introduction to Voice Computing in Python. Boston; Seattle, Atlanta: NeuroLex Laboratories. https://github.com/jim-schwoebel/voicebook

Newsletter

Subscribe to get monthly updates on voice computing.