Ask Question, Ask an Expert

+1-415-315-9853

info@mywordsolution.com

Ask Electrical & Electronics Expert

PROJECT SUMMARY:

The objective of this project is to design a visual search algorithm (VSA) that looks for an object in a video clip. The VSA takes in an image and a video as input, where the image contains an object of interest (OOF) against a plain background. The VSA then has to search for the OOF in the video clip, and output the frames that contain the object as well as its location in those frames.
This is a recognition challenge, not an identification one. So if the input image is a face for ex, the algorithm has to find all faces in the video. If the input image is a bicycle, the algorithm has to find all bicycles.
The VSA’s score will be based on its performance, speed, and visualization capabilities. The performance of the VSA will be assessed through a text file that the VSA will have to output. The speed of the VSA does not include the time it takes to visualize its output.

TECHNICAL PARAMETERS

•    The input image’s resolution will be , and will be in BMP format
•    The object in the input image will be of reasonable complexity (not a simple shape like a beach ball)
•    The object in the input image will be against a white background
•    The input video’s resolution will be , and will be in AVI format
•    The input video will be 10 seconds long, and may be composed of more than one scene

Occurrences of the OOF in the video:

•    Will be the same size or smaller than that of the input image, but not larger
•    Will have the same view as that of the input image, with a maximum angle of affine transformation of
•    Will not have an occlusion of greater than
•    Only one instance of the OOF will occur in any given frame of the video

VSA OUTPUT:

The VSA should output a text file, output.txt, which contains every occurrence of the OOF in the video as follows:
CN R1 C1 R2 C2

Where:

FN is the frame number
R1 is the starting row of the upright rectangle bounding the OOF
C1 is the starting column of the upright rectangle bounding the OOF
R2 is the ending row of the upright rectangle bounding the OOF
C2 is the ending column of the upright rectangle bounding the OOF

For ex, assume the below figure on the left is frame number 23 in the video, and the VSA detects the OOF and highlights it as below. For the purposes of this competition, the VSA should always return the location of an upright rectangle bounding the object, regardless of the affine transformation of the object, as shown on the right. The VSA would add to output.txt the following line:

23 24 30 150 200

152_VSA.jpg

TESTING PROCEDURE:

Each team must package their VSA as an executable that runs on a windows machine without needing any other software to run. If any .dll files are required, then these files should be supplied by the team as well. It is advisable to compile the visualization as a separate executable as by competition definition the VSA speed does not include visualization. The input image and input video will be placed in the same directory as the VSA. The input image will be “InImage.bmp”, and the input video will be “InVideo.avi”

The VSA will be tested using a script that runs as follows:
• Start clock
• Run VSA executable
• Stop clock and record time
• Read output.txt and record for grading
• Run visualization executable

Performance
For each correctly identified frame, depending on the % overlap between the rectangle they specify and the ground truth rectangle. The correctly identified frame is as follows:

                  TP = 1- cos(90* OL)
                  Where OL is:

                                OL= (overlapping area/ area of ground truth table)+ (overlapping area/area of VSA suppliedarea)

An ex is shown below in figure. Assume total number of overlapping pixels is 180, the ground truth rectangle is 220 and the supplied rectangle is 260 pixels. The frame would be:

                               OL= (180/220) +(180/260) =1.51

                               TP=1- cos(90*1.51 ) 1.72

1195_Overlapping pixels.jpg

Therefore, a VSA’s performance score is:

                            P= [50* (ΣTP- ΣFP- ΣFN/P max)]

The timing resolution of the script will be 1ms.

Electrical & Electronics, Engineering

  • Category:- Electrical & Electronics
  • Reference No.:- M9510

Have any Question? 


Related Questions in Electrical & Electronics

1 describe briefly your concept of a scalar field and

1. Describe briefly your concept of a scalar field and illustrate with an example. 2. Describe briefly your concept of a vector field and illustrate with an example. 3. How do you depict pictorially the gravitational fie ...

A hertzian dipole is situated parallel to a corner

A Hertzian dipole is situated parallel to a corner reflector, which is an arrangement of two plane, perfect conductors at right angles to each other, as shown by the crosssectional view in Figure 9.19. (a) Locate the ima ...

Suppose a pv module has the 1-sun i-v curve shown below

Suppose a PV module has the 1-sun I-V curve shown below. Within the module itself, the manufacturer has provided a pair of bypass diodes to help the panel deliver some power even when many of the cells are shaded. Each d ...

Determine the pressure gradient at the stagnation point of

Determine the pressure gradient at the stagnation point of Problem. Problem For the velocity potentials given below, find the stream function and sketch the streamlines

1 obtain the equations for a one-dimensional steady viscous

1. Obtain the equations for a one-dimensional steady, viscous, compressible flow in the x direction from the Navier- Stokes equations. (These equations, together with an equation of state and the energy equation, may be ...

1 in a dht-based network assume node 4 has a file with key

1. In a DHT-based network, assume node 4 has a file with key 18. The closest next node to key 18 is node 20. Where is the file stored? a. in the direct method b. in the indirect method

Derive the 3 x 3 d matrix in equation for a two-dimensional

Derive the 3 X 3 [D] matrix in Equation for a two-dimensional MooneyRivlin material with three material parameters (A 10 , A 01 , and K). Use the perturbed Lagrangian method for near-incompressibility.

Repeat problem with the elastic constants of the plastic

Repeat Problem with the elastic constants of the plastic defined as E = 10 GPa and ν = 0.485. Problem:- A rectangular plastic specimen of size 100 X 100 X 10 mm 3  is placed in a rectangular metal cavity. The dimensions ...

For an ltid system specified by the equationa find the

For an LTID system specified by the equation a. Find the amplitude and the phase response. b. Find the system response y[n] for the input x[n] = cos (0.5k - (π/3)).

Explain the reasoning in the application tiger woods rents

Explain the reasoning in the application "Tiger Woods' Rents" as to why Tiger Woods was able to capture essentially all the rents from some companies but not from others.

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Section onea in an atwood machine suppose two objects of

SECTION ONE (a) In an Atwood Machine, suppose two objects of unequal mass are hung vertically over a frictionless

Part 1you work in hr for a company that operates a factory

Part 1: You work in HR for a company that operates a factory manufacturing fiberglass. There are several hundred empl

Details on advanced accounting paperthis paper is intended

DETAILS ON ADVANCED ACCOUNTING PAPER This paper is intended for students to apply the theoretical knowledge around ac

Create a provider database and related reports and queries

Create a provider database and related reports and queries to capture contact information for potential PC component pro

Describe what you learned about the impact of economic

Describe what you learned about the impact of economic, social, and demographic trends affecting the US labor environmen