Recently I have used IBM Watson for speech-to-text, how I used it is as follows:


pip install --upgrade ibm-watsonpip install --upgrade watson-developer-cloudsudo -H pip install --ignore-installed six ibm-watson

Getting credentials

  1. Go to https://www.ibm.com/cloud/watson-speech-to-text
  2. Choose to Get started free
  3. Choose the Lite plan
  4. Login
  5. Choose Create
  6. Go to the Manage option on the left-hand side and copy the API key.

The Code

I have used the following file by replacing my API key and changing the input audio file.

It provides the output including the start and end of each word along with confidence. It also provides the complete script along with the overall confidence.





MS Thesis Student, CVGL, LUMS http://pk.linkedin.com/in/talhahanifbutt

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Core EOS: Building it One More Time

QC — The strength and the weakness of Qubits in Quantum Computing

Basic authentication for Springboot REST API application with HandlerInterceptor

Scrum for beginners, a short introduction

Remote spark-submit to YARN running on EMR

How to build a high performance and well managed logging system architecture in your project

What is Chaos Engineering?

Flutter : A Beginners guide to Flutter

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Talha Hanif Butt

Talha Hanif Butt

MS Thesis Student, CVGL, LUMS http://pk.linkedin.com/in/talhahanifbutt

More from Medium

8 Ways on applying Deep Learning into various business use-cases.

Status Quo: Getting into AI, ML, DNNs, …

Where to get started with Machine Learning

Using Deep Stats for Performance-Based Soccer Player Valuations

Segna Newsletter — 9 December 2021