Devide

The document contains a Python script that merges subtitles from a file by processing lines to combine consecutive subtitles without punctuation. It reads the input subtitle file, filters lines based on punctuation, and then merges them if they meet certain criteria. Finally, it writes the merged subtitles to an output file in the correct format.

Uploaded by

zrr20031119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views2 pages

Devide

Uploaded by

zrr20031119

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as TXT, PDF, TXT or read online on Scribd

You are on page 1/ 2

def merge_subtitles(lines):

merged_lines = []
i = 0
while i < len(lines):
if lines[i].strip().isdigit():
if i + 2 < len(lines):
current_num = lines[i].strip()
current_time = lines[i + 1].strip()
current_text = lines[i + 2].strip()

if i + 5 < len(lines) and lines[i + 3].strip().isdigit():

next_num = lines[i + 3].strip()
next_time = lines[i + 4].strip()
next_text = lines[i + 5].strip()

ends_with_punct = any(current_text.endswith(p) for p in ['.',

'...', '?', '!'])

if not ends_with_punct:
start_time = current_time.split(' --> ')[0]
end_time = next_time.split(' --> ')[1]
merged_time = f"{start_time} --> {end_time}"

merged_lines.append([current_num, merged_time, next_text])

i += 6
continue

merged_lines.append([current_num, current_time, current_text])

i += 3
else:
merged_lines.append([lines[i].strip()])
i += 1
else:
merged_lines.append([lines[i].strip()])
i += 1

return merged_lines

def process_subtitle_file(input_path, output_path):

with open(input_path, 'r', encoding='utf-8') as file:
lines = file.readlines()

keep_lines = set()

for i in range(len(lines)):
line = lines[i]
if any(sep in line for sep in ['.', '...', '?', '!']):
for j in range(max(0, i-2), min(len(lines), i+5)):
keep_lines.add(j)

filtered_lines = [lines[i] for i in range(len(lines)) if i in keep_lines]

merged_subtitles = merge_subtitles(filtered_lines)
with open(output_path, 'w', encoding='utf-8') as file:
new_index = 1
for i, subtitle in enumerate(merged_subtitles):
if len(subtitle) == 3: # 完整的字幕块
file.write(f"{new_index}\n")
file.write(f"{subtitle[1]}\n")
file.write(f"{subtitle[2]}")

if i < len(merged_subtitles) - 1:
file.write("\n\n")
else:
file.write("\n")
new_index += 1

input_file = r"(the path of file)"

output_file = r"(the path of target)"
process_subtitle_file(input_file, output_file)

A Short Progam: Pig Latin: Text '/n'.join (Lines)
No ratings yet
A Short Progam: Pig Latin: Text '/n'.join (Lines)
1 page
Lisp Interpreter in Rust
From Everand
Lisp Interpreter in Rust
Vishal Patil
1/5 (1)
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
From Everand
Programming with MATLAB: Taken From the Book "MATLAB for Beginners: A Gentle Approach"
Peter Kattan
4.5/5 (3)
Hebing
No ratings yet
Hebing
1 page
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Python3 #
No ratings yet
Python3 #
2 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Criar Legenda
No ratings yet
Criar Legenda
2 pages
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
From Everand
Python: Advanced Guide to Programming Code with Python: Python Computer Programming, #4
Charlie Masterson
No ratings yet
New Rich Text Document
No ratings yet
New Rich Text Document
4 pages
Introduction to PHP, Part 2, Second Edition
From Everand
Introduction to PHP, Part 2, Second Edition
Adam Majczak
No ratings yet
Python: Advanced Guide to Programming Code with Python
From Everand
Python: Advanced Guide to Programming Code with Python
Charlie Masterson
No ratings yet
New Rich Document
No ratings yet
New Rich Document
4 pages
Video Processing Guii
No ratings yet
Video Processing Guii
5 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Movie Ticket Booking System
No ratings yet
Movie Ticket Booking System
41 pages
Subtitle Py
No ratings yet
Subtitle Py
1 page
C++ Functions and tutorial
From Everand
C++ Functions and tutorial
Nino Paiotta
No ratings yet
AIM1
No ratings yet
AIM1
7 pages
Olawson Programming Assignment 3
No ratings yet
Olawson Programming Assignment 3
2 pages
Claude Comparet DB
No ratings yet
Claude Comparet DB
8 pages
Morse
No ratings yet
Morse
4 pages
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Language Translator
No ratings yet
Language Translator
2 pages
Project Machine Translation
No ratings yet
Project Machine Translation
45 pages
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
From Everand
UNIX Shell Programming Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
A Short Course in Discrete Mathematics
From Everand
A Short Course in Discrete Mathematics
Edward A. Bender
3/5 (1)
Simplifying Data Science With Python
From Everand
Simplifying Data Science With Python
Billy David millican
No ratings yet
Message
No ratings yet
Message
4 pages
Python - Black Badger
No ratings yet
Python - Black Badger
3 pages
Markdown Renderer - Py
No ratings yet
Markdown Renderer - Py
7 pages
Python Strings1
No ratings yet
Python Strings1
1 page
Codigocompleto
No ratings yet
Codigocompleto
7 pages
Formatting
No ratings yet
Formatting
5 pages
Lab File Complete
No ratings yet
Lab File Complete
10 pages
FILE HANDLING 8th PROGRAM
No ratings yet
FILE HANDLING 8th PROGRAM
3 pages
A Beginner's guide to Python
From Everand
A Beginner's guide to Python
Steven Mcananey
No ratings yet
Code
No ratings yet
Code
7 pages
Cs Project
No ratings yet
Cs Project
23 pages
Project Adding Bullets To Wiki Markup
No ratings yet
Project Adding Bullets To Wiki Markup
3 pages
Pyton Strings
No ratings yet
Pyton Strings
1 page
Learn C++
From Everand
Learn C++
Durgesh
4.5/5 (9)
Introduction to Algorithms
From Everand
Introduction to Algorithms
S VASIST
No ratings yet
AI Lab Manual
No ratings yet
AI Lab Manual
24 pages
Assignment 6
No ratings yet
Assignment 6
8 pages
FDFinal Draft Strings
No ratings yet
FDFinal Draft Strings
16 pages
Lisp Programming Language
From Everand
Lisp Programming Language
Faiz ul haque Zeya
No ratings yet
Import Subprocess
No ratings yet
Import Subprocess
11 pages
PHP programming
From Everand
PHP programming
Nino Paiotta
No ratings yet
ChatGPT Queries Plus Codes Into Sections
No ratings yet
ChatGPT Queries Plus Codes Into Sections
12 pages
Python Code For NLP
No ratings yet
Python Code For NLP
6 pages
English Practice2
No ratings yet
English Practice2
1 page
Hangman
No ratings yet
Hangman
3 pages
Python for Absolute Beginners: Learn to Code Fast!
From Everand
Python for Absolute Beginners: Learn to Code Fast!
Ibnul Jaif Farabi
No ratings yet
Adding Bullets To Wiki Markup
No ratings yet
Adding Bullets To Wiki Markup
9 pages
Word Extraction-Best
No ratings yet
Word Extraction-Best
1 page
Pinaki Day2 Roll 11
No ratings yet
Pinaki Day2 Roll 11
7 pages
Text Processor
No ratings yet
Text Processor
3 pages

Devide

Uploaded by

Devide

Uploaded by

def merge_subtitles(lines):

if i + 5 < len(lines) and lines[i + 3].strip().isdigit():

ends_with_punct = any(current_text.endswith(p) for p in ['.',

merged_lines.append([current_num, merged_time, next_text])

merged_lines.append([current_num, current_time, current_text])

def process_subtitle_file(input_path, output_path):

filtered_lines = [lines[i] for i in range(len(lines)) if i in keep_lines]

input_file = r"(the path of file)"

You might also like