Ask Question, Ask an Expert

+1-415-315-9853

info@mywordsolution.com

Ask Electrical & Electronics Expert

PROJECT SUMMARY:

The objective of this project is to design a visual search algorithm (VSA) that looks for an object in a video clip. The VSA takes in an image and a video as input, where the image contains an object of interest (OOF) against a plain background. The VSA then has to search for the OOF in the video clip, and output the frames that contain the object as well as its location in those frames.
This is a recognition challenge, not an identification one. So if the input image is a face for ex, the algorithm has to find all faces in the video. If the input image is a bicycle, the algorithm has to find all bicycles.
The VSA’s score will be based on its performance, speed, and visualization capabilities. The performance of the VSA will be assessed through a text file that the VSA will have to output. The speed of the VSA does not include the time it takes to visualize its output.

TECHNICAL PARAMETERS

•    The input image’s resolution will be , and will be in BMP format
•    The object in the input image will be of reasonable complexity (not a simple shape like a beach ball)
•    The object in the input image will be against a white background
•    The input video’s resolution will be , and will be in AVI format
•    The input video will be 10 seconds long, and may be composed of more than one scene

Occurrences of the OOF in the video:

•    Will be the same size or smaller than that of the input image, but not larger
•    Will have the same view as that of the input image, with a maximum angle of affine transformation of
•    Will not have an occlusion of greater than
•    Only one instance of the OOF will occur in any given frame of the video

VSA OUTPUT:

The VSA should output a text file, output.txt, which contains every occurrence of the OOF in the video as follows:
CN R1 C1 R2 C2

Where:

FN is the frame number
R1 is the starting row of the upright rectangle bounding the OOF
C1 is the starting column of the upright rectangle bounding the OOF
R2 is the ending row of the upright rectangle bounding the OOF
C2 is the ending column of the upright rectangle bounding the OOF

For ex, assume the below figure on the left is frame number 23 in the video, and the VSA detects the OOF and highlights it as below. For the purposes of this competition, the VSA should always return the location of an upright rectangle bounding the object, regardless of the affine transformation of the object, as shown on the right. The VSA would add to output.txt the following line:

23 24 30 150 200

152_VSA.jpg

TESTING PROCEDURE:

Each team must package their VSA as an executable that runs on a windows machine without needing any other software to run. If any .dll files are required, then these files should be supplied by the team as well. It is advisable to compile the visualization as a separate executable as by competition definition the VSA speed does not include visualization. The input image and input video will be placed in the same directory as the VSA. The input image will be “InImage.bmp”, and the input video will be “InVideo.avi”

The VSA will be tested using a script that runs as follows:
• Start clock
• Run VSA executable
• Stop clock and record time
• Read output.txt and record for grading
• Run visualization executable

Performance
For each correctly identified frame, depending on the % overlap between the rectangle they specify and the ground truth rectangle. The correctly identified frame is as follows:

                  TP = 1- cos(90* OL)
                  Where OL is:

                                OL= (overlapping area/ area of ground truth table)+ (overlapping area/area of VSA suppliedarea)

An ex is shown below in figure. Assume total number of overlapping pixels is 180, the ground truth rectangle is 220 and the supplied rectangle is 260 pixels. The frame would be:

                               OL= (180/220) +(180/260) =1.51

                               TP=1- cos(90*1.51 ) 1.72

1195_Overlapping pixels.jpg

Therefore, a VSA’s performance score is:

                            P= [50* (ΣTP- ΣFP- ΣFN/P max)]

The timing resolution of the script will be 1ms.

Electrical & Electronics, Engineering

  • Category:- Electrical & Electronics
  • Reference No.:- M9510

Have any Question? 


Related Questions in Electrical & Electronics

1 what are the values of the radiation resistance and the

1. What are the values of the radiation resistance and the directivity for a half-wave dipole? 2. What is an antenna array? 3. Justify the approximations involved in the determination of the resultant field of an array o ...

1 state and briefly discuss the basic definition of the

1. State and briefly discuss the basic definition of the curl of a vector. 2. What is a curl meter? How does it help visualize the behavior of the curl of a vector field? 3. Provide two examples of physical phenomena in ...

1 derive the energy form and its linearization of a

1. Derive the energy form and its linearization of a Mooney-Rivlin hyperelastic material using the perturbed Lagrangian method. Use a mixed variable r = [u T , p] T . 2. Derive the 3 X 3 [D] matrix in Eq. (3.147) for a t ...

In a switch using the token bucket algorithm tokens are

In a switch using the token bucket algorithm, tokens are added to the bucket at a rate of r = 5 tokens/second. The capacity of the token bucket is c = 10. The switch has a buffer that can hold only eight packets (for the ...

1 if a communication is unicast how can we use rsvp which

1. If a communication is unicast, how can we use RSVP, which is designed for multicast in IntServ? 2. How can multicasting be achieved using DiffServ? 3. Why do we need Path and Resv messages in RSVP? 4. How many per-hop ...

1 a 15 v aa battery that costs 1 is rated at 18 ah what is

1. A 1.5 V AA battery that costs $1 is rated at 1.8 Ah. What is its cost per kWh? 2. Suppose a 12-V battery bank rated at 200 Ah under standard conditions needs to deliver 600 Wh over a 12-h period each day. If they oper ...

A sketch x omega the amplitude spectrum of a signal x t 3

a. Sketch |X (ω)|, the amplitude spectrum of a signal x (t) = 3 cos 6πt + sin 18πt + 2 cos(28 - ∉)πt, where ∉ is a very small number → 0. Determine the minimum sampling rate required to be able to reconstruct x (t) from ...

1 if the plate shown is inclined at an angle as shown what

1. If the plate shown is inclined at an angle as shown, what are the forces F x  and F y  necessary to maintain its position? The flow is frictionless. 2. A steady, incompressible, frictionless, two-dimensional jet of fl ...

A sinusoid cosomega0t is a bandpass signal with zero

A sinusoid cos(ω0t) is a bandpass signal with zero bandwidth. This implies that the sampling rate that will allow reconstruction of this signal from its samples can be arbitrarily small. Show that this is indeed the case ...

1 what is a poynting vector what is its physical

1. What is a Poynting vector? What is its physical significance? 2. What is the physical interpretation of the surface integral of the Poynting vector over a closed surface? 3. Discuss how the fields far from a physical ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

Section onea in an atwood machine suppose two objects of

SECTION ONE (a) In an Atwood Machine, suppose two objects of unequal mass are hung vertically over a frictionless

Part 1you work in hr for a company that operates a factory

Part 1: You work in HR for a company that operates a factory manufacturing fiberglass. There are several hundred empl

Details on advanced accounting paperthis paper is intended

DETAILS ON ADVANCED ACCOUNTING PAPER This paper is intended for students to apply the theoretical knowledge around ac

Create a provider database and related reports and queries

Create a provider database and related reports and queries to capture contact information for potential PC component pro

Describe what you learned about the impact of economic

Describe what you learned about the impact of economic, social, and demographic trends affecting the US labor environmen