Ask Question, Ask an Expert


Ask Electrical & Electronics Expert


The objective of this project is to design a visual search algorithm (VSA) that looks for an object in a video clip. The VSA takes in an image and a video as input, where the image contains an object of interest (OOF) against a plain background. The VSA then has to search for the OOF in the video clip, and output the frames that contain the object as well as its location in those frames.
This is a recognition challenge, not an identification one. So if the input image is a face for ex, the algorithm has to find all faces in the video. If the input image is a bicycle, the algorithm has to find all bicycles.
The VSA’s score will be based on its performance, speed, and visualization capabilities. The performance of the VSA will be assessed through a text file that the VSA will have to output. The speed of the VSA does not include the time it takes to visualize its output.


•    The input image’s resolution will be , and will be in BMP format
•    The object in the input image will be of reasonable complexity (not a simple shape like a beach ball)
•    The object in the input image will be against a white background
•    The input video’s resolution will be , and will be in AVI format
•    The input video will be 10 seconds long, and may be composed of more than one scene

Occurrences of the OOF in the video:

•    Will be the same size or smaller than that of the input image, but not larger
•    Will have the same view as that of the input image, with a maximum angle of affine transformation of
•    Will not have an occlusion of greater than
•    Only one instance of the OOF will occur in any given frame of the video


The VSA should output a text file, output.txt, which contains every occurrence of the OOF in the video as follows:
CN R1 C1 R2 C2


FN is the frame number
R1 is the starting row of the upright rectangle bounding the OOF
C1 is the starting column of the upright rectangle bounding the OOF
R2 is the ending row of the upright rectangle bounding the OOF
C2 is the ending column of the upright rectangle bounding the OOF

For ex, assume the below figure on the left is frame number 23 in the video, and the VSA detects the OOF and highlights it as below. For the purposes of this competition, the VSA should always return the location of an upright rectangle bounding the object, regardless of the affine transformation of the object, as shown on the right. The VSA would add to output.txt the following line:

23 24 30 150 200



Each team must package their VSA as an executable that runs on a windows machine without needing any other software to run. If any .dll files are required, then these files should be supplied by the team as well. It is advisable to compile the visualization as a separate executable as by competition definition the VSA speed does not include visualization. The input image and input video will be placed in the same directory as the VSA. The input image will be “InImage.bmp”, and the input video will be “InVideo.avi”

The VSA will be tested using a script that runs as follows:
• Start clock
• Run VSA executable
• Stop clock and record time
• Read output.txt and record for grading
• Run visualization executable

For each correctly identified frame, depending on the % overlap between the rectangle they specify and the ground truth rectangle. The correctly identified frame is as follows:

                  TP = 1- cos(90* OL)
                  Where OL is:

                                OL= (overlapping area/ area of ground truth table)+ (overlapping area/area of VSA suppliedarea)

An ex is shown below in figure. Assume total number of overlapping pixels is 180, the ground truth rectangle is 220 and the supplied rectangle is 260 pixels. The frame would be:

                               OL= (180/220) +(180/260) =1.51

                               TP=1- cos(90*1.51 ) 1.72

1195_Overlapping pixels.jpg

Therefore, a VSA’s performance score is:

                            P= [50* (ΣTP- ΣFP- ΣFN/P max)]

The timing resolution of the script will be 1ms.

Electrical & Electronics, Engineering

  • Category:- Electrical & Electronics
  • Reference No.:- M9510

Have any Question? 

Related Questions in Electrical & Electronics

Ee design project - 4-bit mini-cpunbspproblem statement you

EE Design Project - 4-Bit Mini-CPU  Problem Statement: You are to design a 4-bit mini-CPU which will perform 6 instructions: Compare, Add, Subtract, AND, OR, and NOT.  That is, it can compare two 4-bit numbers, add two n ...

The hay bridge of given figure is balancedusing given

The Hay bridge of given figure is balanced. Using given equation, determine the unknown inductance L x  and resistance R x . Z 1 Z 4 =Z 3 Z 2

Assume that an infinitely long 300 omega transmission line

Assume that an infinitely long 300 Ω transmission line is suddenly connected to a 6 V battery. Calculate the power drain on the battery.

Voltage dividersprocedureopen multisim softwareprogram

Voltage Dividers Procedure: Open Multisim Software Program Files -> National Instruments (folder) -> Circuit Design Suite -> Multisim START A NEW PROJECT BY GOING TO FILE>NEW>NI ELVIS I Design. You will get a workspace t ...

In the previous problem the force exerted by the parachute

In the previous problem, the force exerted by the parachute is spread by a harness in contact with 0.50 m 2 of the parachutist, and the parachutist has a mass of 75. kg. What is the force per unit area (stress) experienc ...

Anesthesia induces muscle relaxation paralysis and

Anesthesia induces muscle relaxation (paralysis) and unconsciousness in the patient. Muscle relaxation can be monitored using electromyogram signals from nerves in the hand; unconsciousness can be monitored using the car ...

A 200-mva 12-kv 085-pf-lagging 50-hz 20-pole y-connected

A 200-MVA, 12-kV, 0.85-PF-lagging, 50-Hz, 20-pole, Y-connected water turbine generator has a perunit synchronous reactance of 0.9 and a per-unit armature resistance of 0.1. This generator is operating in parallel with a ...

The diode shown in figure p836 has a piecewise linear

The diode shown in Figure P8.36 has a piecewise linear characteristic that passes through the points (-10 V, -5 µA), (0, 0), (0.5 V, 5 mA), and (1 V, 50 mA). Determine the piecewise linear model, and, using that model, s ...

Noise constitutes a pure stochastic signal and is therefore

Noise constitutes a pure stochastic signal and is therefore "totally non-periodic" (a) Examine the amplitude and phase spectra of noise. Is the spectrum continuous? Do amplitude and phase spectra display stochastic behav ...

A 1 mh inductor and a 1 komega resistor are connected to a

A 1 mH inductor and a 1 kΩ resistor are connected to a 12 V battery for a long time. The circuit is similar to that in Fig. 1.26. The battery is suddenly removed and a 1 kΩ resistor is substituted. (a) Find the initial i ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

A cola-dispensing machine is set to dispense 9 ounces of

A cola-dispensing machine is set to dispense 9 ounces of cola per cup, with a standard deviation of 1.0 ounce. The manuf

What is marketingbullwhat is marketing think back to your

What is Marketing? • "What is marketing"? Think back to your impressions before you started this class versus how you

Question -your client david smith runs a small it

QUESTION - Your client, David Smith runs a small IT consulting business specialising in computer software and techno

Inspection of a random sample of 22 aircraft showed that 15

Inspection of a random sample of 22 aircraft showed that 15 needed repairs to fix a wiring problem that might compromise

Effective hrmquestionhow can an effective hrm system help

Effective HRM Question How can an effective HRM system help facilitate the achievement of an organization's strate