0% found this document useful (0 votes)

27 views6 pages

Manual Action Estimation

The document describes the pipeline and code for a system that estimates actions in videos using computer vision and deep learning techniques. It involves: 1. Creating custom datasets from videos using ball/player detection and VGG features to identify frames, actions, and ball/player positions. 2. Training an LSTM model on the custom datasets to predict actions by feeding it sequences of frames. 3. The code files described implement the datasets creation using different approaches and train/use the LSTM model to estimate actions in test videos.

Uploaded by

Аяз Баянов

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views6 pages

Manual Action Estimation

Uploaded by

Аяз Баянов

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Action Estimation: User Manual

1. Environment and files:

● To activate project environment: conda activate test
● To enter project directory: cd Action_Estimation
The directory contains necessary python files, video files and datasets described in the
Table below:

File Name File type Short Description

“lfc_not_main_2.py” Python file The main part of the project takes a

video file and outputs datasets.

lstm2.py Python file Takes datasets and trains a lstm

model.

vgg_test.py Python file Takes a video file and generates

datasets using VGG approach

dataset_merger.py Python file Modify VGG dataset by adding

corresponding action_features from
our custom dataset.

dataset_30_actions_fcb_rma CSV Dataset Dataset gathered by our approach

_full.csv

dataset_vgg_2020.csv CSV Dataset Dataset gathered by VGG approach

dataset_vgg_2020_merged.c CSV Dataset Modified VGG dataset with action

sv features.

matchForLSTM.mp4 mp4 video file RMA vs FCB 45 minute match

ast_aty_forLSTM.mp4 mp4 video file Astana vs Atyrau 90 minute match

model_ex-100_acc-0.829508. h5 model Scene recognition model (82.9%

h5 accuracy)
2. Pipeline of system:

to predict on OUR approach (custom ball to predict on VGG approach (custom ball
and player detection) use the following and player detection) use the following
sequence sequence

lfc_not_main_2.py

lfc_not_main_2.py vgg_test.py

dataset_merger.py

lstm2 lstm2.py

3. “lfc_not_main_2.py” file
Creates custom dataset which has frame number, action label, ball coordinates, team
owning ball and etc by performing player and ball detection

Lines of code Description

1-22 Importing necessary libraries

23-51 Class DominantColors that has function dominantColors. It takes

image, perform K-Mean clustering and returns a list of dominant
colors in the image (in integer)

53-58 Loading Scene_Recogniton model

60-73 Mapping our actions to integers

74-99 Initializing variables

106-116 Function “process_countour” which determines to which team

contours of players belong.

118-134 Function “find_closest_player” returns the team number which

possesses the ball

136 - 140 Loading Custom_Ball_Detector model

143 Start analyzing each frame of the video in while loop

153-159 Save the current frame in file, then use this file as input for
scene_recognition.

154 Scene_recognition is set to recognize each 15th frame (in order

to keep performance time low)

163-182 Applying morphological transformations on image in order to find

borders of the field

186 Set condition to check current scene

187-195 Finding contours on the field

198-210 Checking the sizes of contours to ensure that it is a player

contours.

214-237 Determining to which team each player belongs

239-258 Finding 2 dominant colors within the borders and ignoring the
field color (executed only once, when 16 or more contours
detected)

262-279 Ball detection

297-328 Entering player coordinates into an array

335-342 Condition to check whether a ball is detected for more than 30

frames (or 1 second)

343-350 Performing DB clustering on ball coordinates to remove noise

and false detections.

353-459 Categorizing 30 actions according to ball movement

462-493 Saving all data in dataset

495-498 Clearing the arrays to use them in next frame data

501-506 Visual representation of detection via video output

508-512 Finishing the program

4. “lstm2.py” file
Builts model and based on dataset predicts the action label and print the accuracy

Lines of code Description

24-35, contains load and helper functions which were outdated after
50-72 solving the problem of multiple detections of ball

38-48 finds the max number of frames of one action. It is needed to

make dimension of splitted data fixed. That’s why we need to
know what is the max length of action among dataset

74-98 load data function:

to prepare a dataset to be run in LSTM network it must be
3-dimensional. shape of input to LSTM is (#of samples,
number_of_frames_in_max_sequences, features). Other lines of
code is just matrix manipulation to make input as above
dimensions.

100-115 if dataset needs to be prepared by fixed number of frames (not

fixed action sequence) use function load_data_window. It
manipulates matrix and takes fixed frames which can be given as
argument to function and makes train, test sets as dimensions
above.

117-127 function that returns model. Lines are self-explanatory. add

function adds layers. if one layers is not fully connected (i.e last
layer) the previous layers should have parameter
return_sequence=true

129-137 function that compares predicted and test values by checking if

action label of the most probable equals to test label

139-156 similar to function above but counts success if top-2 actions

matched the label

158-170 plots graph of action distribution

172-189 main function where program begins. each line is

self-explanatory and refers to functions described above. Also
runs time check and output time taken.

5. “vgg_test.py” file
Creates separate file in directory which contains predicted 4096 features from VGG model’s
output

Lines of code Description

17 sets VGG16 model such that output one layer before the last
fully-connected layer i.e output 4096 features

20-23 function that appends array to the end of file

25-40 loops through each frame in video sequence and manipulates the
matrix of image to be accepted as input to the model built above.
At the end append predicted 4096 features to the file

6. “dataset_merger.py” file
To feed the dataset in lstm2.py and make it complete we should attach frame number and
action label to the dataset which only contains 4096 features from vgg_test.py.

Lines of code Description

6-10 function that appends array to the end of file

13 custom dataset with custom features (players numbers, team

owning ball, coordinates etc)

18 dataset which contains VGG features

22-30 iterates through the first dataset and matches frame number and
creates a new array which contains (frame number, action label,
4096 features) and attaches it to the end of file.

32 prints time to execute the procedure

Additional Neural Network models:

7.1 Custom ball detection model:

To train another model for ball detection you can enter the following folder:
/home/lag/ball_detection
A dataset for ball detection is located at
/home/lag/ball_detection/ball_data
To train new model execute the following python file:
moses_object_detection.py

Lines of code Description

1 Importing libraries

3-4 Choosing Training approach : YOLOv3

5 Select dataset directory

6 Setting training configurations

7 Execute training

Dataset is acquired using LabelImg software. To run it, type:

python3 labelimg
This software is used to label objects on the image, and save corresponding labels in
xml files.

7.2 Scene recognition model:

To train another scene_recognition model using new data you can enter the following folder:
/home/lag/imageAI2

ImageAI library was used to train scene_recognition models.

Dataset and models for scene_recognition are located at
/home/lag/imageAI2/idenprof

To train new model execute the following python file:

scene_recognition_training.py

Lines of code Description

1 Importing libraries

3-4 Setting model type: DenseNet

5 Select dataset directory

6 Setting training configurations and starting the training

Grade 10 Official GDE ATP 2025 Geography
No ratings yet
Grade 10 Official GDE ATP 2025 Geography
7 pages
Build An AI - ML Tennis Analysis System With YOLO, PyTorch, and Key Point Extraction (English (Auto-Generated) )
No ratings yet
Build An AI - ML Tennis Analysis System With YOLO, PyTorch, and Key Point Extraction (English (Auto-Generated) )
165 pages
Lab Act 5
100% (8)
Lab Act 5
4 pages
CMPENEE454 Project1Report MatthewMcTaggart PeterRancourt
No ratings yet
CMPENEE454 Project1Report MatthewMcTaggart PeterRancourt
14 pages
Discover For Mapinfo Tutorials
67% (3)
Discover For Mapinfo Tutorials
58 pages
GISP Unofficial StudyGuide
No ratings yet
GISP Unofficial StudyGuide
40 pages
Deep Learning For Vision Lab Manual 2024
100% (1)
Deep Learning For Vision Lab Manual 2024
25 pages
Sign
No ratings yet
Sign
23 pages
DL Lab-Final
No ratings yet
DL Lab-Final
22 pages
Chest Cancer - 90.8 On Test Data Set Code
No ratings yet
Chest Cancer - 90.8 On Test Data Set Code
17 pages
Automatic Player Face Detection and Recognition For Players in Cricket Games
No ratings yet
Automatic Player Face Detection and Recognition For Players in Cricket Games
14 pages
Synopsis Report
No ratings yet
Synopsis Report
7 pages
Football AI Tutorial - From Basics To Advanced Stats With Python
No ratings yet
Football AI Tutorial - From Basics To Advanced Stats With Python
62 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
DLV Lab Manual Print
No ratings yet
DLV Lab Manual Print
29 pages
Intro Ai Group3
No ratings yet
Intro Ai Group3
35 pages
NNDL Lab Exp
No ratings yet
NNDL Lab Exp
50 pages
Assignment 2 DL
No ratings yet
Assignment 2 DL
10 pages
Autoencoders and GAN Lab Programs - Algorithms and Explanations
No ratings yet
Autoencoders and GAN Lab Programs - Algorithms and Explanations
7 pages
Deep Learning Manual
No ratings yet
Deep Learning Manual
44 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
Implementation - of - MediaPipe - Hand - Tracking - For - IJACSA - and - IJARAI v1.1
No ratings yet
Implementation - of - MediaPipe - Hand - Tracking - For - IJACSA - and - IJARAI v1.1
6 pages
Weekly Activity 6
No ratings yet
Weekly Activity 6
5 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
Kirkvik Acit2022
No ratings yet
Kirkvik Acit2022
155 pages
Ai Internship
No ratings yet
Ai Internship
5 pages
Localization Using Convolutional Neural Networks
No ratings yet
Localization Using Convolutional Neural Networks
29 pages
Report LV8 Project
No ratings yet
Report LV8 Project
43 pages
March Madness - Analyze Video To Detect Players, Teams, and Who Attempted The Basket
No ratings yet
March Madness - Analyze Video To Detect Players, Teams, and Who Attempted The Basket
6 pages
Experiment Number: 2: A) Image Generation Using Generative Adversarial Network (GAN)
No ratings yet
Experiment Number: 2: A) Image Generation Using Generative Adversarial Network (GAN)
10 pages
Mnist Classification Report
No ratings yet
Mnist Classification Report
15 pages
Supervised LEARNING File
No ratings yet
Supervised LEARNING File
42 pages
Final Deep Learning Manual
No ratings yet
Final Deep Learning Manual
26 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Hand Gesture Recognition With Convolution Neural Networks
No ratings yet
Hand Gesture Recognition With Convolution Neural Networks
4 pages
Exp 6,7,8
No ratings yet
Exp 6,7,8
17 pages
Assignment SQGAN
No ratings yet
Assignment SQGAN
14 pages
21BCP167 Ai 9
No ratings yet
21BCP167 Ai 9
10 pages
Layers in CNN
No ratings yet
Layers in CNN
22 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
ML Ass2
No ratings yet
ML Ass2
8 pages
Lab Report 4
No ratings yet
Lab Report 4
6 pages
Miquelarisa TFM0125
No ratings yet
Miquelarisa TFM0125
66 pages
CNN MATLAB Lab Instructions
No ratings yet
CNN MATLAB Lab Instructions
7 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
DNN Architectures
No ratings yet
DNN Architectures
12 pages
Report On Handwritten Digit Recognition Using A Feedforward Neural Network
No ratings yet
Report On Handwritten Digit Recognition Using A Feedforward Neural Network
8 pages
Talking Avatar Application
No ratings yet
Talking Avatar Application
9 pages
Assignment I-4
No ratings yet
Assignment I-4
3 pages
Signals
No ratings yet
Signals
17 pages
Assignment 3 DL
No ratings yet
Assignment 3 DL
6 pages
Ss
No ratings yet
Ss
50 pages
Code 1
No ratings yet
Code 1
18 pages
Capstone Project
No ratings yet
Capstone Project
47 pages
Deep Learning - Image Synthesis
No ratings yet
Deep Learning - Image Synthesis
36 pages
Behavioral Cloning
No ratings yet
Behavioral Cloning
5 pages
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
100% (1)
Deep Learning Lab Manual - IGDTUW - Vinisky Kumar
33 pages
Naveen PILLA U2601948 CN7023 2401128 187603914
No ratings yet
Naveen PILLA U2601948 CN7023 2401128 187603914
22 pages
FPGA Based Implementation of Neural Network
No ratings yet
FPGA Based Implementation of Neural Network
5 pages
Lab 6 ML
No ratings yet
Lab 6 ML
7 pages
Week 6
No ratings yet
Week 6
8 pages
Handwritten Digit Recognition Systems
No ratings yet
Handwritten Digit Recognition Systems
12 pages
Measuring Earthwork
100% (1)
Measuring Earthwork
25 pages
Print TNP
100% (1)
Print TNP
32 pages
Calculation Examples
No ratings yet
Calculation Examples
42 pages
Common - Ss Army 3. Introduction To Map Reading.9b95899eaeaf6d178a34
No ratings yet
Common - Ss Army 3. Introduction To Map Reading.9b95899eaeaf6d178a34
9 pages
DKS Reservoir Elevation Area Capacity Curve
No ratings yet
DKS Reservoir Elevation Area Capacity Curve
4 pages
Full Download Engineering Geology and Construction 1st Edition Fred G. Bell (Author) PDF
100% (13)
Full Download Engineering Geology and Construction 1st Edition Fred G. Bell (Author) PDF
82 pages
On Solving 2D and 3D Puzzles Using Curve Matching
No ratings yet
On Solving 2D and 3D Puzzles Using Curve Matching
8 pages
Surveying
No ratings yet
Surveying
25 pages
Geo f1 Sept Mid Term 2024
No ratings yet
Geo f1 Sept Mid Term 2024
9 pages
Wmi Alhap Final Report 2004
No ratings yet
Wmi Alhap Final Report 2004
140 pages
Surveying - I CT 2 Q Bank
No ratings yet
Surveying - I CT 2 Q Bank
3 pages
Karoora Waggaa ICT
No ratings yet
Karoora Waggaa ICT
8 pages
Slicing and Contour Data
No ratings yet
Slicing and Contour Data
16 pages
Stress Distribution
100% (2)
Stress Distribution
25 pages
Surveying Book - Compiled
No ratings yet
Surveying Book - Compiled
95 pages
Workshop10 Creep Jop PDF
No ratings yet
Workshop10 Creep Jop PDF
11 pages
Dynamic Meteorology 2
No ratings yet
Dynamic Meteorology 2
16 pages
Hinterstoisser Iccv11
No ratings yet
Hinterstoisser Iccv11
8 pages
Tutorial 01 Quick Start
No ratings yet
Tutorial 01 Quick Start
21 pages
BGCSE Geo Book 1 September 2021
No ratings yet
BGCSE Geo Book 1 September 2021
122 pages
Forecasters Reference Book 1997 PDF
100% (1)
Forecasters Reference Book 1997 PDF
176 pages
First Steps English
No ratings yet
First Steps English
14 pages
Theory of Site Planning - Part 1 (UP Mindanao Lecture)
No ratings yet
Theory of Site Planning - Part 1 (UP Mindanao Lecture)
19 pages
Alexander Klug JAppl Phys
No ratings yet
Alexander Klug JAppl Phys
7 pages
Civil Engineering - Surveying Multiple Choice Questions and Answers - Preparation For Engineering
64% (11)
Civil Engineering - Surveying Multiple Choice Questions and Answers - Preparation For Engineering
15 pages
Ap HUG Notes PDF
No ratings yet
Ap HUG Notes PDF
148 pages

Manual Action Estimation

Uploaded by

Manual Action Estimation

Uploaded by

Action Estimation: User Manual

1. Environment and files:

File Name File type Short Description

“lfc_not_main_2.py” Python file The main part of the project takes a

lstm2.py Python file Takes datasets and trains a lstm

vgg_test.py Python file Takes a video file and generates

dataset_merger.py Python file Modify VGG dataset by adding

dataset_30_actions_fcb_rma CSV Dataset Dataset gathered by our approach

dataset_vgg_2020.csv CSV Dataset Dataset gathered by VGG approach

dataset_vgg_2020_merged.c CSV Dataset Modified VGG dataset with action

matchForLSTM.mp4 mp4 video file RMA vs FCB 45 minute match

ast_aty_forLSTM.mp4 mp4 video file Astana vs Atyrau 90 minute match

model_ex-100_acc-0.829508. h5 model Scene recognition model (82.9%

Lines of code Description

1-22 Importing necessary libraries

23-51 Class ​DominantColors that has function ​dominantColors​. It takes

53-58 Loading Scene_Recogniton model

60-73 Mapping our actions to integers

74-99 Initializing variables

106-116 Function “process_countour” which determines to which team

118-134 Function “find_closest_player” returns the team number which

136 - 140 Loading Custom_Ball_Detector model

143 Start analyzing each frame of the video in while loop

154 Scene_recognition is set to recognize each 15th frame (in order

163-182 Applying morphological transformations on image in order to find

186 Set condition to check current scene

187-195 Finding contours on the field

198-210 Checking the sizes of contours to ensure that it is a player

214-237 Determining to which team each player belongs

262-279 Ball detection

297-328 Entering player coordinates into an array

335-342 Condition to check whether a ball is detected for more than 30

343-350 Performing DB clustering on ball coordinates to remove noise

353-459 Categorizing 30 actions according to ball movement

462-493 Saving all data in dataset

495-498 Clearing the arrays to use them in next frame data

501-506 Visual representation of detection via video output

508-512 Finishing the program

Lines of code Description

38-48 finds the max number of frames of one action. It is needed to

74-98 load data function:

100-115 if dataset needs to be prepared by fixed number of frames (not

117-127 function that returns model. Lines are self-explanatory. add

129-137 function that compares predicted and test values by checking if

139-156 similar to function above but counts success if top-2 actions

158-170 plots graph of action distribution

172-189 main function where program begins. each line is

Lines of code Description

20-23 function that appends array to the end of file

Lines of code Description

6-10 function that appends array to the end of file

13 custom dataset with custom features (players numbers, team

18 dataset which contains VGG features

32 prints time to execute the procedure

Additional Neural Network models:

7.1 Custom ball detection model:

Lines of code Description

3-4 Choosing Training approach : YOLOv3

6 Setting training configurations

Dataset is acquired using LabelImg software. To run it, type:

7.2 Scene recognition model:

ImageAI library was used to train scene_recognition models.

To train new model execute the following python file:

Lines of code Description

3-4 Setting model type: DenseNet

5 Select dataset directory

6 Setting training configurations and starting the training

You might also like

23-51 Class DominantColors that has function dominantColors. It takes