Ask Question, Ask an Expert

+1-415-315-9853

info@mywordsolution.com

Ask Electrical & Electronics Expert

PROJECT SUMMARY:

The objective of this project is to design a visual search algorithm (VSA) that looks for an object in a video clip. The VSA takes in an image and a video as input, where the image contains an object of interest (OOF) against a plain background. The VSA then has to search for the OOF in the video clip, and output the frames that contain the object as well as its location in those frames.
This is a recognition challenge, not an identification one. So if the input image is a face for ex, the algorithm has to find all faces in the video. If the input image is a bicycle, the algorithm has to find all bicycles.
The VSA’s score will be based on its performance, speed, and visualization capabilities. The performance of the VSA will be assessed through a text file that the VSA will have to output. The speed of the VSA does not include the time it takes to visualize its output.

TECHNICAL PARAMETERS

•    The input image’s resolution will be , and will be in BMP format
•    The object in the input image will be of reasonable complexity (not a simple shape like a beach ball)
•    The object in the input image will be against a white background
•    The input video’s resolution will be , and will be in AVI format
•    The input video will be 10 seconds long, and may be composed of more than one scene

Occurrences of the OOF in the video:

•    Will be the same size or smaller than that of the input image, but not larger
•    Will have the same view as that of the input image, with a maximum angle of affine transformation of
•    Will not have an occlusion of greater than
•    Only one instance of the OOF will occur in any given frame of the video

VSA OUTPUT:

The VSA should output a text file, output.txt, which contains every occurrence of the OOF in the video as follows:
CN R1 C1 R2 C2

Where:

FN is the frame number
R1 is the starting row of the upright rectangle bounding the OOF
C1 is the starting column of the upright rectangle bounding the OOF
R2 is the ending row of the upright rectangle bounding the OOF
C2 is the ending column of the upright rectangle bounding the OOF

For ex, assume the below figure on the left is frame number 23 in the video, and the VSA detects the OOF and highlights it as below. For the purposes of this competition, the VSA should always return the location of an upright rectangle bounding the object, regardless of the affine transformation of the object, as shown on the right. The VSA would add to output.txt the following line:

23 24 30 150 200

152_VSA.jpg

TESTING PROCEDURE:

Each team must package their VSA as an executable that runs on a windows machine without needing any other software to run. If any .dll files are required, then these files should be supplied by the team as well. It is advisable to compile the visualization as a separate executable as by competition definition the VSA speed does not include visualization. The input image and input video will be placed in the same directory as the VSA. The input image will be “InImage.bmp”, and the input video will be “InVideo.avi”

The VSA will be tested using a script that runs as follows:
• Start clock
• Run VSA executable
• Stop clock and record time
• Read output.txt and record for grading
• Run visualization executable

Performance
For each correctly identified frame, depending on the % overlap between the rectangle they specify and the ground truth rectangle. The correctly identified frame is as follows:

                  TP = 1- cos(90* OL)
                  Where OL is:

                                OL= (overlapping area/ area of ground truth table)+ (overlapping area/area of VSA suppliedarea)

An ex is shown below in figure. Assume total number of overlapping pixels is 180, the ground truth rectangle is 220 and the supplied rectangle is 260 pixels. The frame would be:

                               OL= (180/220) +(180/260) =1.51

                               TP=1- cos(90*1.51 ) 1.72

1195_Overlapping pixels.jpg

Therefore, a VSA’s performance score is:

                            P= [50* (ΣTP- ΣFP- ΣFN/P max)]

The timing resolution of the script will be 1ms.

Electrical & Electronics, Engineering

  • Category:- Electrical & Electronics
  • Reference No.:- M9510

Have any Question? 


Related Questions in Electrical & Electronics

In problem find the rate of discharge if the frictional

In Problem , find the rate of discharge if the frictional head loss in the pipe is 3.2 v2 /g where v is the flow velocity in the pipe. Problem:- Referring to the figure, assume the flow to be frictionless in the siphon. ...

1 how do you find the work done in moving a test charge by

1. How do you find the work done in moving a test charge by an infinitesimal distance in an electric field? 2. What is the amount of work involved in moving a test charge normal to the electric field? 3. What is the phys ...

Assume an image uses a palette of size 8 out of the table

Assume an image uses a palette of size 8 out of the table used by JPEG (GIF uses the same strategy, but the size of the palette is 256), with the combination of the following colors with the indicated level of intensitie ...

A bug starts at a point and travels 1 m northwardm

A bug starts at a point and travels 1 m northward, m eastward, m southward, m westward, m northward, and so on, making a 90°-turn to the right and halving the distance each time. (a) What is the total distance traveled b ...

A a signal x t 5 sinc2 5pit cos 20pit is sampled at a

a. A signal x (t) = 5 sinc 2 (5πt) + cos 20πt is sampled at a rate of 10 Hz. Find the spectrum of the sampled signal. Can x (t) be reconstructed by lowpass filtering the sampled signal? b. Repeat part (a) for a sampling ...

Solve problem using the secant method do not use the

Solve Problem using the secant method. Do not use the incremental force method. Discuss about the convergence rate. Problem :- Consider a uniform bar with a constant Young's modulus, E = 100 MPa; cross-sectional area, A ...

A create a function that accepts as input argument a number

a. Create a function that accepts as input argument a number in radians and returns its value in degrees. b. Compute (through your function) how many degrees is p=4 radians. a. Create a function that plots the function s ...

1 in an impulse invariance design show that if has is a

1. In an impulse invariance design, show that if Ha(s) is a transfer function of a stable system, the corresponding H[z] is also a transfer function of a stable system. 2. First-order backward differences provide the tra ...

1 are encoding and decoding of the multimedia data done by

1. Are encoding and decoding of the multimedia data done by RTP? Explain. 2. Assume an image is sent from the source to the destination using 10 RTP packets. Can the first five packets define the encoding as JPEG and the ...

1 how would you use amperes circuital law in differential

1. How would you use Ampere's circuital law in differential form to find the magnetic field adjacent to the current sheet? 2. If the current density on the infinite plane current sheet of Figure 4.2 were directed in the ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Section onea in an atwood machine suppose two objects of

SECTION ONE (a) In an Atwood Machine, suppose two objects of unequal mass are hung vertically over a frictionless

Part 1you work in hr for a company that operates a factory

Part 1: You work in HR for a company that operates a factory manufacturing fiberglass. There are several hundred empl

Details on advanced accounting paperthis paper is intended

DETAILS ON ADVANCED ACCOUNTING PAPER This paper is intended for students to apply the theoretical knowledge around ac

Create a provider database and related reports and queries

Create a provider database and related reports and queries to capture contact information for potential PC component pro

Describe what you learned about the impact of economic

Describe what you learned about the impact of economic, social, and demographic trends affecting the US labor environmen