CentOS7搭建whisper语音识别系统

xingyun86 2023-12-28 1045

CentOS7搭建whisper语音识别系统

1.安装ffmpeg

sudo rpm --import http://li.nux.ro/download/nux/RPM-GPG-KEY-nux.ro
sudo rpm -Uvh http://li.nux.ro/download/nux/dextop/el7/x86_64/nux-dextop-release-0-5.el7.nux.noarch.rpm
sudo yum install ffmpeg

2.python3.9安装

yum install -y git gcc make openssl-devel bzip2-devel libffi-devel zlib-devel readline-devel sqlite-devel
# 下载openssl-1.1.1t源代码包：
wget --no-check-certificate   https://www.openssl.org/source/openssl-1.1.1t.tar.gz
# 解压
tar -zxvf openssl-1.1.1t.tar.gz
cd openssl-1.1.1t/
# 指定openssl安装的目标路径
./config --prefix=/usr/local/my_openssl
# 在CPU占用不多的情况下，可以适当使用4个线程加速编译，可以根据需要调整线程数，
make # make -j4
make install
========================================================================================================
# 下载python3.9.7源代码包
wget https://www.python.org/ftp/python/3.9.7/Python-3.9.7.tgz
# 解压源代码包
tar -xf Python-3.9.7.tgz
# 进入源代码目录
cd Python-3.9.7
# 配置编译参数
./configure --enable-optimizations --with-openssl=/usr/local/my_openssl #把openssl安装路径配置到编译参数中
# 如果出现Could not import runpy module的报错，那么说明gcc版本太低，不支持--enable-optimizations参数，把它去掉就好
# 编译并安装Python
make
make altinstall
# 此时python安装完毕，但是由于附带了2.7.5版本的Python，所以此时查看Python的版本仍是2.7.5
# 查看python3的版本
python3.9 --version
=========================================================================================================
pip3.9 install torch torchvision torchaudio tqdm tiktoken numba

3.安装whisper

pip3.9 install git+https://github.com/openai/whisper.git
pip3.9 install --upgrade --no-deps --force-reinstall git+https://github.com/openai/whisper.gi

4.验证效果：

whisper audio.mp3（见附件）

Detected language: Chinese
[00:00.000 --> 00:03.000] 各位觀眾 晚上好
[00:03.000 --> 00:07.000] 今天是12月29號 星期四 農曆12月初期
[00:07.000 --> 00:09.000] 歡迎收看新聞聯播節目
[00:09.000 --> 00:12.000] 首先為您介紹今天節目的主要內容

上传的附件：

audio.mp3

最新回复 (0)

只看楼主

全部楼主

CentOS7搭建whisper语音识别系统

xingyun86

作者最近主题：