Important Dates

10th Jan 2015:
Website online
10th Jan 2015:
Registration open
06th Feb 2015:
Sample dataset available
~~28th Feb~~ 10th March 2015: Registration closes
1st March 2015:
Training dataset available
~~10th~~ 15th March 2015: Validation dataset available
~~20th~~ ~~25th~~ 31st March 2015: Submission of Systems
~~25th~~ 30th April 2015: Test dataset available

Latest News

06th Feb 2015: Sample Dataset available!
10th Jan 2015: Website online
10th Jan 2015: Registration open!

Overview

In multi-lingual and multi-script countries the use of two or more scripts is quite common for information communication through news and advertisement videos transmitted across various television channels. The text present in videos plays an important role in automatic video indexing and retrieval. Hence, OCR of multi-lingual video-text is crucial.
The main objective of the competition is to identify scripts from the extracted video words. Different combinations of ten Indian scripts will be considered for the competition. The competition provides a platform for researchers around the globe to address the problem.

The competition aims to find generic algorithms/system for identifying video scripts irrespective of the scripts being considered. General objective of the competition is to evaluate the recently proposed method on script identification. The following scripts will be considered for the competition,

English (Eng),
Hindi (Hin),
Bengali (Ben),
Oriya (Ori),
Gujrathi (Guj),
Punjabi (Pun),
Kannada (Kan),
Tamil (Tam),
Telegu (Tel), and
Arabic (Arb).

The competition will be organized into four different tasks:

Task 1:
Identifying scripts from eight different script triplets (Combinations of three scripts, keeping English and Hindi in all combinations), based on their use in the Indian sub-continent.
Task 2:
Identifying the combination of scripts used in north India. This task involes identification of seven scripts, namely, English, Hindi, Bengali, Oriya, Gujrathi, Punjabi and Arabic.
Task 3:
Identifying the combination of scripts used in south India. This task involes identification of five scripts, namely, English, Hindi, Kannada, Tamil and Telegu.
Task 4:
Identifying the combination of all the ten scripts.

ICDAR 2015 Competition on

Video Script Identification

(CVSI-2015)

Important Dates

Latest News

Overview

Task 1:

Task 2:

Task 3:

Task 4: