0% found this document useful (0 votes)

4 views

Week-8 Nlp Lab Program

This document provides a Python program for converting audio files to text and text files to audio using the NLTK package and other libraries. It includes functions for converting MP3 to WAV, performing speech recognition, and generating audio from text, along with installation instructions for required packages and FFmpeg. The program allows users to choose between converting audio to text or text to audio and provides options for saving the results.

Uploaded by

227r1a7349

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views

Week-8 Nlp Lab Program

Uploaded by

227r1a7349

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

EXPERIMENT-8

Natural Language Processing Lab

Write a python program to convert audio file to text and text file to audio

files using NLTK Package.

Requirements

To run this program, you'll need to install the following packages:

pip install nltk SpeechRecognition gtts pydub

import nltk

from nltk.tokenize import word_tokenize, sent_tokenize

import speech_recognition as sr

from gtts import gTTS

import os

# Download NLTK data (only needed once)

nltk.download('punkt')

from pydub import AudioSegment

import os

def mp3_to_wav(mp3_file_path, wav_file_path=None):

"""

Convert MP3 file to WAV format using pydub.

Args:

mp3_file_path (str): Path to the input MP3 file

wav_file_path (str): Path to save the output WAV file (optional)

If not provided, replaces .mp3 with .wav

Returns:

str: Path to the created WAV file

"""
try:

# If output path not specified, create one by replacing extension

if wav_file_path is None:

wav_file_path = os.path.splitext(mp3_file_path)[0] + '.wav'

# Load MP3 file

audio = AudioSegment.from_mp3(mp3_file_path)

# Export as WAV

audio.export(wav_file_path, format="wav")

print(f"Successfully converted {mp3_file_path} to {wav_file_path}")

return wav_file_path

except Exception as e:

print(f"Error converting MP3 to WAV: {e}")

return None

# Example usage

if __name__ == "__main__":

input_mp3 = "input.mp3" # Change to your MP3 file path

output_wav = "output.wav" # Change to desired WAV file path

mp3_to_wav(input_mp3, output_wav)

def audio_to_text(mp3_file_path):

"""

Convert MP3 to WAV, then perform speech recognition and NLP processing

"""

try:

# First convert MP3 to WAV

wav_file = mp3_to_wav(mp3_file_path)
# Then do speech recognition

recognizer = sr.Recognizer()

with sr.AudioFile(wav_file) as source:

audio_data = recognizer.record(source)

text = recognizer.recognize_google(audio_data)

# NLP processing with NLTK

tokens = word_tokenize(text)

print("Recognized text tokens:", tokens)

return text

except Exception as e:

print(f"Error in MP3 to text conversion: {e}")

return None

def text_to_audio(text, output_file="output.mp3", language='en'):

"""

Convert text to speech and save as an audio file using gTTS.

"""

try:

# Tokenize text into sentences for better processing

sentences = sent_tokenize(text)

processed_text = ' '.join(sentences)

tts = gTTS(text=processed_text, lang=language, slow=False)

tts.save(output_file)

print(f"Audio file saved as {output_file}")

return output_file
except Exception as e:

print(f"Error in text-to-speech conversion: {e}")

return None

def text_file_to_audio(text_file_path, output_file="output.mp3", language='en'):

"""

Read text from a file and convert it to speech.

"""

try:

with open(text_file_path, 'r', encoding='utf-8') as file:

text = file.read()

return text_to_audio(text, output_file, language)

except Exception as e:

print(f"Error reading text file: {e}")

return None

def main():

print("Audio and Text Conversion Tool")

print("1. Audio file to Text")

print("2. Text file to Audio")

choice = input("Enter your choice (1 or 2): ")

if choice == '1':

audio_file = input("Enter audio file path (WAV, AIFF, FLAC): ")

text = audio_to_text(audio_file)

if text:

print("\nConverted Text:")

print(text)

# Save to file

save_choice = input("Save to text file? (y/n): ").lower()

if save_choice == 'y':

output_file = input("Enter output text file name (e.g., output.txt): ")

with open(output_file, 'w', encoding='utf-8') as f:

f.write(text)

print(f"Text saved to {output_file}")

elif choice == '2':

text_file = input("Enter text file path: ")

output_audio = input("Enter output audio file name (e.g., output.mp3): ")

result = text_file_to_audio(text_file, output_audio)

if result:

print(f"Successfully created audio file: {result}")

# Option to play the audio

play_choice = input("Play the audio file? (y/n): ").lower()

if play_choice == 'y':

os.system(f"start {result}" if os.name == 'nt' else f"xdg-open {result}")

else:

print("Invalid choice")

if __name__ == "__main__":

main()
Additionally, you'll need FFmpeg installed on your system:

FFmpeg Installation Guide for Windows

1. Download FFmpeg:

o Direct download link: https://www.gyan.dev/ffmpeg/builds/

o Choose: ffmpeg-release-essentials.zip (latest version)

o Alternative official source: https://ffmpeg.org/download.html

2. Install FFmpeg:

o Extract the ZIP file to a permanent location (e.g., C:\ffmpeg)

o Copy the path to the bin folder (e.g., C:\ffmpeg\bin)

3. Add FFmpeg to System PATH:

o Press Win + R, type sysdm.cpl, and press Enter

o Go to "Advanced" tab → "Environment Variables"

o Under "System variables", find and select "Path" → Click "Edit"

o Click "New" and paste your FFmpeg bin path (e.g., C:\ffmpeg\bin)

o Click "OK" on all windows to save

4. Verify Installation:

o Open Command Prompt (Win + R, type cmd)

o Run: ffmpeg -version

o You should see version information if installed correctly

Notes:

1. Audio to Text:

o Uses Google Speech Recognition API (free but requires internet)

o Works best with uncompressed WAV, AIFF, or FLAC files

o For other formats, you might need to convert them first

2. Text to Audio:

o Uses Google Text-to-Speech (gTTS) which requires internet

o Outputs as MP3 by default

o Includes NLTK sentence tokenization for better speech flow

NLP EXP 8
No ratings yet
NLP EXP 8
2 pages
Training Project.pptyx
No ratings yet
Training Project.pptyx
11 pages
Pdf2mp3 Py
No ratings yet
Pdf2mp3 Py
4 pages
TSA Lab 2
No ratings yet
TSA Lab 2
3 pages
SpeechRecognition
No ratings yet
SpeechRecognition
5 pages
Artificial Intelligence Project Report-Ads18a00095y
No ratings yet
Artificial Intelligence Project Report-Ads18a00095y
3 pages
dhara_NLP_Practical
No ratings yet
dhara_NLP_Practical
67 pages
Speech To Text Conversion
No ratings yet
Speech To Text Conversion
7 pages
Speech Recog
No ratings yet
Speech Recog
5 pages
Voice_Assistant_Report
No ratings yet
Voice_Assistant_Report
4 pages
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
No ratings yet
Voice Assistant - Doge: Bachelor of Engineering IN Computer Science & Engineering
48 pages
Voice Assistant Suggetion
No ratings yet
Voice Assistant Suggetion
3 pages
2.5 Automatic Speech Recognition
No ratings yet
2.5 Automatic Speech Recognition
8 pages
Voice_Identification_GLM4_Guide
No ratings yet
Voice_Identification_GLM4_Guide
2 pages
202100123__Priyank__Dewashish
No ratings yet
202100123__Priyank__Dewashish
15 pages
Spoken Language Processing in Python Chapter3
No ratings yet
Spoken Language Processing in Python Chapter3
26 pages
Pydub
No ratings yet
Pydub
26 pages
Labs_9
No ratings yet
Labs_9
4 pages
Ai
No ratings yet
Ai
2 pages
Import Datetime
No ratings yet
Import Datetime
6 pages
application_code_exp2 (2)
No ratings yet
application_code_exp2 (2)
4 pages
Exno8 Lab
No ratings yet
Exno8 Lab
4 pages
speech_recog[1]
No ratings yet
speech_recog[1]
2 pages
Assistant
No ratings yet
Assistant
2 pages
Text to Speech Presentation
No ratings yet
Text to Speech Presentation
7 pages
CHAPTER 1 INTRODUCTION
No ratings yet
CHAPTER 1 INTRODUCTION
12 pages
Lecture
No ratings yet
Lecture
7 pages
Audio Processing Packages
No ratings yet
Audio Processing Packages
4 pages
Speech Recognition System
No ratings yet
Speech Recognition System
16 pages
Chat Bot 1
No ratings yet
Chat Bot 1
7 pages
2. Sphinx speech recognition
No ratings yet
2. Sphinx speech recognition
5 pages
Create Audio Effects App in Python
No ratings yet
Create Audio Effects App in Python
5 pages
TEXT - TO - SPEECH - CONVERSION - 22215a1211
No ratings yet
TEXT - TO - SPEECH - CONVERSION - 22215a1211
8 pages
Python Ai
No ratings yet
Python Ai
3 pages
Data Sorting Guideline
No ratings yet
Data Sorting Guideline
2 pages
Jarvis For Windows
No ratings yet
Jarvis For Windows
1 page
Speech To Text - No Need To Write - 03
No ratings yet
Speech To Text - No Need To Write - 03
1 page
aa Alexa
No ratings yet
aa Alexa
3 pages
explanation
No ratings yet
explanation
4 pages
Spoken Language Processing in Python Chapter2
No ratings yet
Spoken Language Processing in Python Chapter2
23 pages
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
No ratings yet
Presentation On - Ohh Toodle, An Assistant: Presented by Presented To
10 pages
Text Into Speech Python Report
No ratings yet
Text Into Speech Python Report
18 pages
How Speech Recognition Works: Hidden Markov Model
No ratings yet
How Speech Recognition Works: Hidden Markov Model
25 pages
Sujal Kumar Sinha - IOT - MATLAB Mini
No ratings yet
Sujal Kumar Sinha - IOT - MATLAB Mini
13 pages
Speech to Text
No ratings yet
Speech to Text
17 pages
Python GuiaUser
No ratings yet
Python GuiaUser
23 pages
suryanarayan 3
No ratings yet
suryanarayan 3
2 pages
Voice Assistant Using Python 2
No ratings yet
Voice Assistant Using Python 2
20 pages
Voice_Assistant_Report_40_Pages
No ratings yet
Voice_Assistant_Report_40_Pages
44 pages
Virtual Assistance Project Brief
No ratings yet
Virtual Assistance Project Brief
8 pages
Voice M
No ratings yet
Voice M
19 pages
Basic Operations
No ratings yet
Basic Operations
15 pages
Jarvis Voice Assistant
No ratings yet
Jarvis Voice Assistant
2 pages
My Jarvis
No ratings yet
My Jarvis
2 pages
py report
No ratings yet
py report
8 pages
Voice Assistant
No ratings yet
Voice Assistant
3 pages
code
No ratings yet
code
4 pages
Vioce Assistant by Python
No ratings yet
Vioce Assistant by Python
38 pages
Python Based Voice Assistant Presentation
No ratings yet
Python Based Voice Assistant Presentation
8 pages
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
From Everand
Mastering Go A Practical Guide to Developers: A Practical Guide to Developers
Miguel Miranda de Mattos
No ratings yet
bfs 569
No ratings yet
bfs 569
10 pages
Naveen Resume
No ratings yet
Naveen Resume
1 page
Issues in Machine Learning With Conclution (2)
No ratings yet
Issues in Machine Learning With Conclution (2)
8 pages
lab 16
No ratings yet
lab 16
3 pages
iomp ppt (1)
No ratings yet
iomp ppt (1)
11 pages
16th Program
No ratings yet
16th Program
7 pages
Lab 13 for Manual
No ratings yet
Lab 13 for Manual
4 pages
Program 12
No ratings yet
Program 12
7 pages
lab manual 15 (1)
No ratings yet
lab manual 15 (1)
7 pages
Agriculture Geography
No ratings yet
Agriculture Geography
61 pages
Pitot Tube
No ratings yet
Pitot Tube
11 pages
8475PZ7
No ratings yet
8475PZ7
8 pages
PAN Change Application
No ratings yet
PAN Change Application
1 page
RA-9729 Group 9
No ratings yet
RA-9729 Group 9
13 pages
Who Are We?: Reference Number: MAN - MYI - 2020001
No ratings yet
Who Are We?: Reference Number: MAN - MYI - 2020001
2 pages
How To Practice Chorale 6 Feb 2019 at 1:17 PM PDF
100% (1)
How To Practice Chorale 6 Feb 2019 at 1:17 PM PDF
1 page
EPS
No ratings yet
EPS
31 pages
Duracoat AR: Elastomeric, Flexible Cementitious Waterproofing Coating
No ratings yet
Duracoat AR: Elastomeric, Flexible Cementitious Waterproofing Coating
3 pages
Eoi Imported Sand English 2
No ratings yet
Eoi Imported Sand English 2
7 pages
HJK Ce Atex-Nec-Cen-Iec - 2021
No ratings yet
HJK Ce Atex-Nec-Cen-Iec - 2021
25 pages
Amendment No. 3 March 2017 TO Is 1786: 2008 High Strength Deformed Bars and Wires For Concrete Reinforcement - Specification
100% (2)
Amendment No. 3 March 2017 TO Is 1786: 2008 High Strength Deformed Bars and Wires For Concrete Reinforcement - Specification
3 pages
TLS06F006-C Covidien PB540 PB560 Spec 2982400 Rev 2 - 7
No ratings yet
TLS06F006-C Covidien PB540 PB560 Spec 2982400 Rev 2 - 7
24 pages
Lecture 1
No ratings yet
Lecture 1
19 pages
History NCERT Class 9
100% (1)
History NCERT Class 9
27 pages
Your Guide To Planets Stars and Galaxies PDF
No ratings yet
Your Guide To Planets Stars and Galaxies PDF
14 pages
3rd Grade
No ratings yet
3rd Grade
2 pages
Dsa Q
No ratings yet
Dsa Q
45 pages
Download Full Patent Management: Protecting Intellectual Property and Innovation 1st Edition Oliver Gassmann PDF All Chapters
No ratings yet
Download Full Patent Management: Protecting Intellectual Property and Innovation 1st Edition Oliver Gassmann PDF All Chapters
55 pages
Master Thesis Civil Engineering PDF
100% (3)
Master Thesis Civil Engineering PDF
4 pages
Xii Ip Records
No ratings yet
Xii Ip Records
10 pages
Grade 7 English Curriculum Map
No ratings yet
Grade 7 English Curriculum Map
16 pages
Stuffing Box
67% (3)
Stuffing Box
2 pages
Cancer Pathophysiology Final
100% (1)
Cancer Pathophysiology Final
3 pages
Copyofluckertbenchmarklesson-Makinginferencessol7 5g
No ratings yet
Copyofluckertbenchmarklesson-Makinginferencessol7 5g
2 pages
Events and Issues - Script
No ratings yet
Events and Issues - Script
2 pages
Chapter 3
No ratings yet
Chapter 3
51 pages
Willa B. Brown
No ratings yet
Willa B. Brown
1 page
01 80 13 Project Site Design Criteria PDF
No ratings yet
01 80 13 Project Site Design Criteria PDF
2 pages
1 Cloud Computing
No ratings yet
1 Cloud Computing
5 pages