High-level event recognition in unconstrained videos

140Citations
Citations of this article
162Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

The goal of high-level event recognition is to automatically detect complex high-level events in a given video sequence. This is a difficult task especially when videos are captured under unconstrained conditions by non-professionals. Such videos depicting complex events have limited quality control, and therefore, may include severe camera motion, poor lighting, heavy background clutter, and occlusion. However, due to the fast growing popularity of such videos, especially on the Web, solutions to this problem are in high demands and have attracted great interest from researchers. In this paper, we review current technologies for complex event recognition in unconstrained videos. While the existing solutions vary, we identify common key modules and provide detailed descriptions along with some insights for each of them, including extraction and representation of low-level features across different modalities, classification strategies, fusion techniques, etc. Publicly available benchmark datasets, performance metrics, and related research forums are also described. Finally, we discuss promising directions for future research.

References Powered by Scopus

Distinctive image features from scale-invariant keypoints

49958Citations
N/AReaders
Get full text

Histograms of oriented gradients for human detection

30460Citations
N/AReaders
Get full text

Speeded-Up Robust Features (SURF)

12705Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Modeling spatial-Temporal clues in a hybrid deep learning framework for video classification

363Citations
N/AReaders
Get full text

A review on video-based human activity recognition

356Citations
N/AReaders
Get full text

Modeling temporal dynamics and spatial configurations of actions using two-stream Recurrent Neural Networks

342Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Jiang, Y. G., Bhattacharya, S., Chang, S. F., & Shah, M. (2013). High-level event recognition in unconstrained videos. International Journal of Multimedia Information Retrieval, 2(2), 73–101. https://doi.org/10.1007/s13735-012-0024-2

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 93

68%

Researcher 22

16%

Professor / Associate Prof. 14

10%

Lecturer / Post doc 7

5%

Readers' Discipline

Tooltip

Computer Science 107

80%

Engineering 20

15%

Social Sciences 4

3%

Psychology 3

2%

Article Metrics

Tooltip
Social Media
Shares, Likes & Comments: 7

Save time finding and organizing research with Mendeley

Sign up for free