Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes

CVPR AI City Challenge 2021 (HCMUS Team)

This is the code for our work at AI City Challenge, CVPR 2021.

Project organization

Our system contains 4 main modules:

Textual attribute extraction: Apply SRL toolkit on the input text query to extract color, vehicle type and action of the target object.
Visual attribute extraction: given the target object boxes, Classifier aims to classify color, vehicle type and extract feature vector. Detector uses the object tracklets to identify the vehicle turn or stop.
Retrieval model: Representation learning based model to handle the retreieval task
Refinement process: Refine and produce final results

Data preparation

Download the challenge dataset and place in folder dataset
Run each module to produce input data for next steps or directly download them from our gdrive (Uploading)

Train

To train the whole system from scratch, run each module in the following order:

Textual attribute extraction: srl_extraction, srl_handler
Visual attribute extraction: classifier, detector
Retrieval model: retrieval_model
Refinement process: refinement

In each folder, we also provide a notebook for easy setup and usage.

Acknowledgement

The implementation of the Retrieval Model is customized from the great work COOT: Cooperative Hierarchical Transformer for Video-Text Representation Learning.

The Classifier is modified from the well-organized repo EfficientNet PyTorch.

The toolkit used for SRL Extraction step is taken from the AllenNLP library.

Citations

Please consider citing this project in your publications if it helps your research: Uploading

The code is used for academic purpose only.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
assets		assets
classifier		classifier
detector		detector
refinement		refinement
retrieval_model		retrieval_model
srl_extraction		srl_extraction
srl_handler		srl_handler
visualize_tool		visualize_tool
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes

Project organization

Data preparation

Train

Acknowledgement

Citations

About

Releases

Packages

Languages

selab-hcmus/AI_City_2021

Folders and files

Latest commit

History

Repository files navigation

Traffic Video Event Retrieval via Text Query using Vehicle Appearance and Motion Attributes

Project organization

Data preparation

Train

Acknowledgement

Citations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages