BTP Report

The document summarizes a thesis project on automatically extracting relevant metadata from educational videos for efficient indexing. The project aims to obtain metadata like institute name, publisher name, department name, professor name, subject name, and topic name from video lectures. Key steps involved creating a dataset of video clips, identifying keyframes, applying text localization to keyframes, and developing an indexing system to map extracted attributes. Evaluation shows the system accurately maps publisher names 88.03% of the time, institute names 88.88% of the time, and department names 82.47% of the time.

Uploaded by

kshalika734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

30 views4 pages

BTP Report

Uploaded by

kshalika734

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

A thesis on Automatic extraction of relevant metadata from educational

videos for efficient indexing

Project-I (ME47603) report

Shalika Kumbham

(18ME10030)

Under the supervision of

Professor Krothapalli Sreenivasa Rao

Department of Computer Science and Engineering

Indian Institute of Technology Kharagpur

Acknowledgment
I sincerely thank my supervisor Professor Krothapalli Sreenivasa Rao and
my mentor Mr. Abhijit Debnath(Ph.D. Scholar) for their constant guidance.

I thank IIT Kharagpur for providing an opportunity to take up my project in

outside department as per my interests.
Abstract :
Nowadays, there are being more online classes due to the COVID situation. So more video
lectures are being recorded, maintained, and uploaded. We generally observe that these
educational videos have metadata containing five to six attributes: Institute Name, Publisher
Name, Department Name, Professor Name, Subject Name, and Topic Name. It would be easy
to maintain these videos if we could organize them with respect to their categories. In this
project, we are trying to get the metadata information as mentioned above from the video
lectures.

Background:
Many changes are being made by students while adjusting to the COVID situation. One of
which is mainly affected is the mode of classes. Creating online videos and maintaining them is
a kind of task. It would be a lot easier if we could organize them automatically without much
human interaction. It saves a lot of time and effort. So we are trying to develop an interface for
getting the required information to organize them in their respective categories. If given a video,
it will provide information belonging to those categories, Ex: if there is an NPTEL video on
Image Processing taught by Prof. K S Rao from Computer Science Department, IIT Kharagpur,
we have to map the information with their column-like Publisher Name - NPTEL, Professor
Name - K S Rao, Department Name - Computer Science, etc.

Literature Review:
The TIB AV-Portal is a portal for scientific videos with a focus on technology/engineering as well
as architecture, chemistry, computer science, mathematics, and physics. Other scientific
disciplines are also included in the AV-Portal. The videos include computer visualizations,
learning material, simulations, experiments, interviews, video abstracts, lecture, and conference
recordings, and (open) audiovisual learning and teaching materials. The portal uses various
automatic video analyses. The added value: these video analyses enable pinpoint searches
within the video content.
NDLI is an integration platform that sources metadata from a massive number of sources to
provide single window access to a wide variety of digital learning resources. Metadata
summarizes basic information of a resource, which can make finding and working with particular
resources easier. Sources have different standards for their metadata with a variety of
resources.NDLI metadata standard has been developed to draw up this uniform schema. It is an
envelope of several well-established global standards.
The Dublin Core Metadata Initiative (DCMI), which formulates the Dublin Core, is a project of
the Association for Information Science and Technology (ASIS&T). ” Dublin Core” is also used
as an adjective for Dublin Core metadata. It is a style of metadata that draws on multiple
Resource Description Framework (RDF) vocabularies, packaged and constrained in Dublin
Core application profiles.
The Dublin Core Metadata Initiative (DCMI)[9] provides an open forum for developing
interoperable online metadata standards for a broad range of purposes and business models.
DCMI’s activities include
● consensus-driven working groups,
● global conferences and workshops,
● standards liaison, and
● educational efforts to promote widespread acceptance of metadata standards and
practices.

Method:
We divided the problem into subproblems and tackled them individually.
Subproblems:
1. Creation of dataset of lecture videos containing the introduction part of videos and
preparation of ground truth against each video.
2. Identification of keyframes and Application of text localization method to locate texts in
videos
3. Creation of an indexing system to map the above mentioned attributes from the
metadata extracted.
4. Evaluation of efficiency of the indexing system, using Levenshtein distance algorithm
and accuracy.
Data set Creation - To create the data set, first, we collected the URLs of all the videos, their
start and end time, in a text file and prepared the ground truth. Then we used the youtube-dl
library to download videos given the URL links and save them in the format we wanted from the
given options. Then FFmpeg tool was used to cut the video with their respective starting and
ending times (which is in the text file).

Keyframes Identification - There are many papers on keyframe extraction. We tried 2 to 3

methods then finalized the ffprobe tool, which gave better results.
This tool categories all the frames into 2 types - ‘I’ and ‘P’, where I corresponds to keyframes.
So we gather all the frames which are mapped as ‘I’ and save them for the keyframes.

Text localization - Easy OCR is used to identify the texts in a given frame. It provides the text,
its probability, and coordinates of bounding boxes for that particular textbox. We can recognize
the text either as lines or paragraphs. For our convenience, we decided to use the paragraph
option.

Indexing System - We used a dictionary for Publisher Names, Institute Names, and
Department Names. We used a fuzzy-wuzzy library to locate the above categories in the text
and mapped them respectively. Used the predefined rules and NER model to identify names
from a given string to identify professor names.
Evaluation - we evaluated its efficiency using the accuracy formula, i.e., No of videos it mapped
correctly for a particular category/ total no of videos. And as EasyOCR might not read all the
letters correctly, there might be errors in the Professor’s names. So we used the Levenshtein
distance algorithm and accuracy combined to evaluate the Professor Name category.

Results:
Accuracies for the following categories are:
1. Publisher Names - 88.03%
2. Institute Names - 88.88%
3. Department Names - 82.47%
4. Professor Names - 85.89

Conclusion:
We developed an indexing system that maps the four categories efficiently.
We could not pursue the other two categories given the time constraints. So we can focus on
how to map those two categories and develop an interface like a web application in the future.
Like when given a video as input, it provides the individual results. And keyframe extraction
accuracy, although better than the methods we tried. It could still be possible to improvise it to
get better clarity( in the frames), which will help reduce OCR errors and get non-repeated
frames.

MySQL Question Bank
0% (2)
MySQL Question Bank
4 pages
Rdbms Solved
0% (1)
Rdbms Solved
59 pages
A Project Presentation On Real Time Object Detection in Autonomous Driving
No ratings yet
A Project Presentation On Real Time Object Detection in Autonomous Driving
35 pages
Database System For Library Management S
60% (5)
Database System For Library Management S
31 pages
Black Book PDF
No ratings yet
Black Book PDF
53 pages
A Multimodal Approach For Extracting Content Descriptive Metadata From Lecture Videos
No ratings yet
A Multimodal Approach For Extracting Content Descriptive Metadata From Lecture Videos
25 pages
Tempest 160314194757
No ratings yet
Tempest 160314194757
28 pages
Seminar
No ratings yet
Seminar
18 pages
Vehicle Detection and Counting Final
No ratings yet
Vehicle Detection and Counting Final
17 pages
Computers 12 00186
No ratings yet
Computers 12 00186
14 pages
0901EO211005 & 0901EO211048 Data Science in IoT Skill Based Mini Project Report File
No ratings yet
0901EO211005 & 0901EO211048 Data Science in IoT Skill Based Mini Project Report File
15 pages
A Comprehensive Framework For Frame Detection Leveraging SIFT and Visual Feature Characterization
No ratings yet
A Comprehensive Framework For Frame Detection Leveraging SIFT and Visual Feature Characterization
5 pages
2018 - slideChangeDetection - Eruvaram2020 - Article - AnExperimentalComparativeStudy
No ratings yet
2018 - slideChangeDetection - Eruvaram2020 - Article - AnExperimentalComparativeStudy
8 pages
Group 16 Synopsis
No ratings yet
Group 16 Synopsis
7 pages
Human Fall Detection Using Optical Flow Farne Back
No ratings yet
Human Fall Detection Using Optical Flow Farne Back
15 pages
Navy Education Society Conduct of Common Preboard Examination For For Navy Children Schools For Class 12 Computer Science
No ratings yet
Navy Education Society Conduct of Common Preboard Examination For For Navy Children Schools For Class 12 Computer Science
10 pages
Nlp-Enriched Automatic Video Segmentation: Mohannad Almousa Rachid Benlamri Richard Khoury
No ratings yet
Nlp-Enriched Automatic Video Segmentation: Mohannad Almousa Rachid Benlamri Richard Khoury
6 pages
Open SQL
No ratings yet
Open SQL
27 pages
Vashok Research
No ratings yet
Vashok Research
4 pages
DBMS Handwritten Notes
No ratings yet
DBMS Handwritten Notes
87 pages
Proposal and Implementation of A Novel Scheme For Image and Emotion Recognition Using Hadoop
No ratings yet
Proposal and Implementation of A Novel Scheme For Image and Emotion Recognition Using Hadoop
6 pages
Demand Assessment Microsoft Learn
No ratings yet
Demand Assessment Microsoft Learn
4 pages
Paper 3516
No ratings yet
Paper 3516
4 pages
SAP Archiving Process-Simple Steps
No ratings yet
SAP Archiving Process-Simple Steps
20 pages
Intelligent Transportation Systems
No ratings yet
Intelligent Transportation Systems
3 pages
I Jcs It 20120302103
No ratings yet
I Jcs It 20120302103
6 pages
Ijert Ijert: Moving Object Detection and Velocity Estimation Using Matlab
No ratings yet
Ijert Ijert: Moving Object Detection and Velocity Estimation Using Matlab
4 pages
HOL3018 - Hands On Lab Session Oracle Database 23ai Best New Features - 1725977266806001OiqY
No ratings yet
HOL3018 - Hands On Lab Session Oracle Database 23ai Best New Features - 1725977266806001OiqY
19 pages
ADVDB or DD Parallel Aau
No ratings yet
ADVDB or DD Parallel Aau
261 pages
HACMP Commandline
No ratings yet
HACMP Commandline
5 pages
Homework 3 For Sta 141b
No ratings yet
Homework 3 For Sta 141b
11 pages
T3TAFJ1 Install R15
No ratings yet
T3TAFJ1 Install R15
40 pages
03 Etl 081028 2055
No ratings yet
03 Etl 081028 2055
46 pages
OTBI-E-V2 HCM Data Lineage - FusionAndTaleo
No ratings yet
OTBI-E-V2 HCM Data Lineage - FusionAndTaleo
3,545 pages
Unit IV Part II
No ratings yet
Unit IV Part II
37 pages
Biw Ora Db2 Export
No ratings yet
Biw Ora Db2 Export
24 pages
Introduction To Database Security
No ratings yet
Introduction To Database Security
5 pages
SAP HANA Data Management and Performance On IBM Power Systems PDF
No ratings yet
SAP HANA Data Management and Performance On IBM Power Systems PDF
76 pages
ICDL Professional Modules - Computational - Using Databases
No ratings yet
ICDL Professional Modules - Computational - Using Databases
10 pages
Intern Report
No ratings yet
Intern Report
23 pages
NDG NISGTC Forensics Lab 04
No ratings yet
NDG NISGTC Forensics Lab 04
29 pages
MongoDB Scalability With Atlas
No ratings yet
MongoDB Scalability With Atlas
16 pages
Big Data Processing Concepts
No ratings yet
Big Data Processing Concepts
9 pages
Building A Scalable Time-Series Database Using Postgres: Mike Freedman
No ratings yet
Building A Scalable Time-Series Database Using Postgres: Mike Freedman
45 pages
Chapter 4 - Queue 2019
No ratings yet
Chapter 4 - Queue 2019
25 pages
Aggregation and Indexing
No ratings yet
Aggregation and Indexing
7 pages
Company Database
No ratings yet
Company Database
1 page
No Query Hasil Execute Fungsi Query
No ratings yet
No Query Hasil Execute Fungsi Query
3 pages
Readme
No ratings yet
Readme
4 pages
Ultimate Enterprise Data Analysis and Forecasting using Python: Leverage Cloud platforms with Azure Time Series Insights and AWS Forecast Components for Time Series Analysis and Forecasting with Deep learning Modeling using Python
From Everand
Ultimate Enterprise Data Analysis and Forecasting using Python: Leverage Cloud platforms with Azure Time Series Insights and AWS Forecast Components for Time Series Analysis and Forecasting with Deep learning Modeling using Python
Shanthababu Pandian
No ratings yet
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
From Everand
MATLAB for Machine Learning: Unlock the power of deep learning for swift and enhanced results
Giuseppe Ciaburro
No ratings yet
Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system
From Everand
Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system
Miroslaw Staron
No ratings yet
Mastering OpenStack: Design, deploy, and manage clouds in mid to large IT infrastructures
From Everand
Mastering OpenStack: Design, deploy, and manage clouds in mid to large IT infrastructures
Omar Khedher
No ratings yet
Learning ASP.NET Core MVC Programming
From Everand
Learning ASP.NET Core MVC Programming
Mugilan T. S. Ragupathi
5/5 (4)
CompTIA Cloud+ Certification Guide (Exam CV0-003): Everything you need to know to pass the CompTIA Cloud+ CV0-003 exam (English Edition)
From Everand
CompTIA Cloud+ Certification Guide (Exam CV0-003): Everything you need to know to pass the CompTIA Cloud+ CV0-003 exam (English Edition)
Gopi Krishna Nuti
No ratings yet
Real-Time Critical Systems
From Everand
Real-Time Critical Systems
Jordan Lee Mauro-Buhagiar
3/5 (1)
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
From Everand
Study Guide 300-835 CLAUTO Automating and Programming Cisco Collaboration Solutions Exam
Anand Vemula
No ratings yet
DevOps Master Courseware
From Everand
DevOps Master Courseware
Alejandro Pestchanker
No ratings yet
The Oracle Universal Content Management Handbook: Build, administer, and manage Oracle Stellent UCM Solutions
From Everand
The Oracle Universal Content Management Handbook: Build, administer, and manage Oracle Stellent UCM Solutions
Dmitri Khanine
5/5 (1)
Blockchain Foundation Courseware - English
From Everand
Blockchain Foundation Courseware - English
Eppo Luppes
No ratings yet
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
Learning Docker
From Everand
Learning Docker
Pethuru Raj
5/5 (5)
EJB 3 Developer Guide
From Everand
EJB 3 Developer Guide
Michael Sikora
No ratings yet
Mainframe Mastery with DevOps: Integrating Legacy Systems with Agile Practices: Mainframes
From Everand
Mainframe Mastery with DevOps: Integrating Legacy Systems with Agile Practices: Mainframes
Ricardo Nuqui
No ratings yet
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
From Everand
Hands-On Python for DevOps: Leverage Python's native libraries to streamline your workflow and save time with automation
Ankur Roy
No ratings yet
Linux Programming Tools Unveiled
From Everand
Linux Programming Tools Unveiled
N. B. Venkateswarlu
No ratings yet
Teaching Kubernetes: A Practical Guide for Instructors and Trainers
From Everand
Teaching Kubernetes: A Practical Guide for Instructors and Trainers
Dargslan
No ratings yet
Basic Principles of an Operating System: Learn the Internals and Design Principles
From Everand
Basic Principles of an Operating System: Learn the Internals and Design Principles
Priyanka Rathee
No ratings yet
Cloud Computing: Master the Concepts, Architecture and Applications with Real-world examples and Case studies
From Everand
Cloud Computing: Master the Concepts, Architecture and Applications with Real-world examples and Case studies
Ruchi Doshi
No ratings yet
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
From Everand
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
Dr. Rajkumar Tekchandani
No ratings yet
Machine Learning with Python: A Comprehensive Guide with a Practical Example
From Everand
Machine Learning with Python: A Comprehensive Guide with a Practical Example
MARTIN NEEL
No ratings yet
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
From Everand
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Chitra Lele
No ratings yet
Software Development: BCS Level 4 Certificate in IT study guide
From Everand
Software Development: BCS Level 4 Certificate in IT study guide
Tig Williams
3.5/5 (2)
Study Guide Implementing DevOps Solutions (DevNet Professional) 300-910 DEVOPS
From Everand
Study Guide Implementing DevOps Solutions (DevNet Professional) 300-910 DEVOPS
Anand Vemula
No ratings yet
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
From Everand
Action Recognition: Step-by-step Recognizing Actions with Python and Recurrent Neural Network
Mark Magic
No ratings yet
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
From Everand
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
Rex Jones II
No ratings yet
Core Java Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
From Everand
Core Java Interview Questions You'll Most Likely Be Asked: Job Interview Questions Series
Vibrant Publishers
4/5 (14)
Designing deep learning systems: Software engineering, #1
From Everand
Designing deep learning systems: Software engineering, #1
rayaan
No ratings yet
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
From Everand
IGNOU MCS 227 Cloud Computing and IoT Previous Years Solved Papers
Manish Soni
No ratings yet
Machine Learning Mastery for Engineers
From Everand
Machine Learning Mastery for Engineers
Abdellatif Sadeq
No ratings yet
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
From Everand
(Part 2) Java 4 Selenium WebDriver: Come Learn How To Program For Automation Testing
Rex Jones
No ratings yet
VMWARE Certified Spring Professional Certification Concept Based Practice Questions - Latest Edition
From Everand
VMWARE Certified Spring Professional Certification Concept Based Practice Questions - Latest Edition
Exam OG
No ratings yet
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
From Everand
IGNOU PGDCA MCS 206 Object Oriented Programming using Java Previous Years solved Papers
Manish Soni
No ratings yet
Introduction to Java Programming, 2nd Edition
From Everand
Introduction to Java Programming, 2nd Edition
Prof. Sham Tickoo
5/5 (1)
IGNOU BCA Introduction to Software Engineering Previous Year Unsolved Papers BCS 051
From Everand
IGNOU BCA Introduction to Software Engineering Previous Year Unsolved Papers BCS 051
Manish Soni
No ratings yet
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
From Everand
Hugging Face Transformers Essentials: From Fine-Tuning to Deployment
Robert Johnson
No ratings yet
Introducing PHP 7/MySQL
From Everand
Introducing PHP 7/MySQL
Prof. Sham Tickoo
No ratings yet
InduSoft Application Design and SCADA Deployment Recommendations for Industrial Control System Security
From Everand
InduSoft Application Design and SCADA Deployment Recommendations for Industrial Control System Security
Richard Clark
No ratings yet
.NET Mastery: The .NET Interview Questions and Answers
From Everand
.NET Mastery: The .NET Interview Questions and Answers
Chetan Singh
No ratings yet
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet

BTP Report

Uploaded by

BTP Report

Uploaded by

A thesis on Automatic extraction of relevant metadata from educational

videos for efficient indexing

Project-I (ME47603) report

Under the supervision of

Professor Krothapalli Sreenivasa Rao

Department of Computer Science and Engineering

Indian Institute of Technology Kharagpur

I thank IIT Kharagpur for providing an opportunity to take up my project in

Keyframes Identification - There are many papers on keyframe extraction. We tried 2 to 3

You might also like