TU Wien:Computer Vision VU (Sablatnig)/Prüfung 2021-03-12
Zur Navigation springen
Zur Suche springen
Image Classification and Machine Learning[Bearbeiten | Quelltext bearbeiten]
True/False Questions
- On average, images of man-made environments have more vertical and horizontal edges than images of natural environments.
- The bias and variance of a classifier are independent of each other.
- One way of adding spatial information to the bag of words model is spatial pyramid matching.
- Regularization in machine learning means to reduce the number of training samples.
- Training of neural networks is only possible with linear activation functions.
- The main advantage of the bag of words model is that it does not need training images
Sift[Bearbeiten | Quelltext bearbeiten]
(A picture from the SIFT-Algorithm from the slides is given)
The following picture illustrates the process of computing SIFT keypoint descriptors.
- What is shown on the left? What do the green grid, black arrows and blue circle mean?
- The descriptor is illustrated on the right. What do the green grid and black arrows mean? What is the dimensionality of this descriptor?
3D Reconstruction[Bearbeiten | Quelltext bearbeiten]
- What is Structure from Motion?
- Given a fundamental matrix relating the images 1 and 2 and a point x in image 1, what can be said about the corresponding point in image 2?
Image Acquistion[Bearbeiten | Quelltext bearbeiten]
- Describe the principle of lenses (sketch) and the "thin lens" law!
- Continuing from the question above, explain the Depth of Field
- What are the internal camera parameters (sketch) and what influence do they have?
Plenoptic function[Bearbeiten | Quelltext bearbeiten]
- What is described by the plenoptic function?
- Why does a sheet of paper in a scene not reflect an image of the scene? What simple construction can be used to let an image of a scene appear on a sheet of paper?
Image Pyramids[Bearbeiten | Quelltext bearbeiten]
- How are Gaussian Pyramids generated and what is their frequency composition on the different pyramid levels?
- Following from the question above, what is the frequency composition of a Laplacian pyramid?
RANSAC/Image Stitching[Bearbeiten | Quelltext bearbeiten]
- Describe how RANSAC can be used to determine the correct matches and the transformation between two views for image stitching
Scene Understandin[Bearbeiten | Quelltext bearbeiten]
- What are Scene Emergent Features?
- What information is contained in the magnitudes of a Fourier spectrum and how can this information be used to differentiate between man-made and natural scenes?
Harris Corner Detection[Bearbeiten | Quelltext bearbeiten]
- How is the "cornerness" of an image pixel computed by the Harris corner detection algorithm?
- How are the final corner points detected from the cornerness?
Deep Learning[Bearbeiten | Quelltext bearbeiten]
- What is the difference between the "classical" machine learning pipeline for image recognition and the new deep learning methods?
- Explain the concept and usage of auto-encoders.