Containerizing Kaldi - the speech recognition toolkit

Kaldi is a very popular Automatic Speech Recognition (ASR) toolkit. In my experience, setting up Kaldi on a system can take huge amount of time. Therefore, it is best to use containerization approach. Following is the Dockerfile that I wrote. We will be using a cuda enabled Ubuntu 18.04 image from Nvidia NGC: FROM nvidia/cuda:11.1.1-cudnn8-devel-ubuntu18.04 LABEL Author="umerjamil16@github.io" Email="umer1694@gmail.com" LABEL Description="Kaldi ASR Image" Vendor="x" Version="1.0" RUN sh -c 'echo "APT { Get { AllowUnauthenticated \"1\"; }; };" > /etc/apt/apt....

November 29, 2022 · 2 min · 318 words · Umer Jamil