Ask Question, Ask an Expert

+1-415-315-9853

info@mywordsolution.com

Ask Electrical & Electronics Expert

PROJECT SUMMARY:

The objective of this project is to design a visual search algorithm (VSA) that looks for an object in a video clip. The VSA takes in an image and a video as input, where the image contains an object of interest (OOF) against a plain background. The VSA then has to search for the OOF in the video clip, and output the frames that contain the object as well as its location in those frames.
This is a recognition challenge, not an identification one. So if the input image is a face for ex, the algorithm has to find all faces in the video. If the input image is a bicycle, the algorithm has to find all bicycles.
The VSA’s score will be based on its performance, speed, and visualization capabilities. The performance of the VSA will be assessed through a text file that the VSA will have to output. The speed of the VSA does not include the time it takes to visualize its output.

TECHNICAL PARAMETERS

•    The input image’s resolution will be , and will be in BMP format
•    The object in the input image will be of reasonable complexity (not a simple shape like a beach ball)
•    The object in the input image will be against a white background
•    The input video’s resolution will be , and will be in AVI format
•    The input video will be 10 seconds long, and may be composed of more than one scene

Occurrences of the OOF in the video:

•    Will be the same size or smaller than that of the input image, but not larger
•    Will have the same view as that of the input image, with a maximum angle of affine transformation of
•    Will not have an occlusion of greater than
•    Only one instance of the OOF will occur in any given frame of the video

VSA OUTPUT:

The VSA should output a text file, output.txt, which contains every occurrence of the OOF in the video as follows:
CN R1 C1 R2 C2

Where:

FN is the frame number
R1 is the starting row of the upright rectangle bounding the OOF
C1 is the starting column of the upright rectangle bounding the OOF
R2 is the ending row of the upright rectangle bounding the OOF
C2 is the ending column of the upright rectangle bounding the OOF

For ex, assume the below figure on the left is frame number 23 in the video, and the VSA detects the OOF and highlights it as below. For the purposes of this competition, the VSA should always return the location of an upright rectangle bounding the object, regardless of the affine transformation of the object, as shown on the right. The VSA would add to output.txt the following line:

23 24 30 150 200

152_VSA.jpg

TESTING PROCEDURE:

Each team must package their VSA as an executable that runs on a windows machine without needing any other software to run. If any .dll files are required, then these files should be supplied by the team as well. It is advisable to compile the visualization as a separate executable as by competition definition the VSA speed does not include visualization. The input image and input video will be placed in the same directory as the VSA. The input image will be “InImage.bmp”, and the input video will be “InVideo.avi”

The VSA will be tested using a script that runs as follows:
• Start clock
• Run VSA executable
• Stop clock and record time
• Read output.txt and record for grading
• Run visualization executable

Performance
For each correctly identified frame, depending on the % overlap between the rectangle they specify and the ground truth rectangle. The correctly identified frame is as follows:

                  TP = 1- cos(90* OL)
                  Where OL is:

                                OL= (overlapping area/ area of ground truth table)+ (overlapping area/area of VSA suppliedarea)

An ex is shown below in figure. Assume total number of overlapping pixels is 180, the ground truth rectangle is 220 and the supplied rectangle is 260 pixels. The frame would be:

                               OL= (180/220) +(180/260) =1.51

                               TP=1- cos(90*1.51 ) 1.72

1195_Overlapping pixels.jpg

Therefore, a VSA’s performance score is:

                            P= [50* (ΣTP- ΣFP- ΣFN/P max)]

The timing resolution of the script will be 1ms.

Electrical & Electronics, Engineering

  • Category:- Electrical & Electronics
  • Reference No.:- M9510

Have any Question? 


Related Questions in Electrical & Electronics

1 in pastry assume the address space is 16 and that b 2

1. In Pastry, assume the address space is 16 and that b = 2. How many digits are in an address space? List some of the identifiers. 2. In a Pastry network with m = 32 and b = 4, what is the size of the routing table and ...

The key in des is 56 bits assume eve the intruder tries to

The key in DES is 56 bits. Assume Eve, the intruder, tries to find the key using a brute-force attack (tries all of the keys one by one). If she can try one million keys (almost 220) in each second (using a powerful comp ...

1 what is the criterion for a material to be a good

1. What is the criterion for a material to be a good conductor? 2. Give two examples of materials that behave as good conductors for frequencies of up to several gigahertz. 3. What is skin effect? Discuss skin depth, giv ...

Determine the nyquist sampling rate and the nyquist

Determine the Nyquist sampling rate and the Nyquist sampling interval for the signals a. sinc 2 (100πt) b. 0.01 sinc 2 (100πt) c. sinc(100πt) + 3 sinc 2 (60πt) d. sinc(50πt) sinc (100πt)

Consider a bar element as shown in the figure the

Consider a bar element as shown in the figure. The cross-sectional areas are A 1  and A 2  at nodes 1 and 2, respectively, and vary linearly. In addition, the gravitational acceleration is applied along the axial directi ...

Given an rtp packet with the first 8 hexadecimal digits as

Given an RTP packet with the first 8 hexadecimal digits as (86032132)16, answer the following questions: a. What is the version of the RTP protocol? b. Is there any padding for security? c. Is there any extension header? ...

1 how would you use amperes circuital law in differential

1. How would you use Ampere's circuital law in differential form to find the magnetic field adjacent to the current sheet? 2. If the current density on the infinite plane current sheet of Figure 4.2 were directed in the ...

Referring to the figure assume the flow to be frictionless

Referring to the figure, assume the flow to be frictionless in the siphon. Find the rate of discharge in cubic feet per second, and the pressure head at B if the pipe has a uniform diameter of 1 in. How long will it take ...

1 what are the two types of firewalls2 what is a vpn and

1. What are the two types of firewalls? 2. What is a VPN and why is it needed? 3. How do LANs on a fully private internet communicate? 4. Host A and host B use IPSec in the transport mode. Can we say that the two hosts n ...

Does the velocity distribution in example satisfy

Does the velocity distribution in Example satisfy continuity? Example A rotating shaft, as illustrated in Figure, causes the fluid to move in circular streamlines with a velocity that is inversely proportional to the dis ...

  • 4,153,160 Questions Asked
  • 13,132 Experts
  • 2,558,936 Questions Answered

Ask Experts for help!!

Looking for Assignment Help?

Start excelling in your Courses, Get help with Assignment

Write us your full requirement for evaluation and you will receive response within 20 minutes turnaround time.

Ask Now Help with Problems, Get a Best Answer

WalMart Identification of theory and critical discussion

Drawing on the prescribed text and/or relevant academic literature, produce a paper which discusses the nature of group

Section onea in an atwood machine suppose two objects of

SECTION ONE (a) In an Atwood Machine, suppose two objects of unequal mass are hung vertically over a frictionless

Part 1you work in hr for a company that operates a factory

Part 1: You work in HR for a company that operates a factory manufacturing fiberglass. There are several hundred empl

Details on advanced accounting paperthis paper is intended

DETAILS ON ADVANCED ACCOUNTING PAPER This paper is intended for students to apply the theoretical knowledge around ac

Create a provider database and related reports and queries

Create a provider database and related reports and queries to capture contact information for potential PC component pro