[CS5670] Lecture 0: Intro to Computer Vision

티스토리 뷰

AI/Computer Vision

[CS5670] Lecture 0: Intro to Computer Vision

Suyeon Cha 2022. 5. 19. 00:36

728x90

What is computer vision?
Why study computer vision?
Course overview
Images & Image filtering [time permitting]

1. What is computer vision?

Goal of computer vision:

percieve the story behind the picture

Compute properties of the world

3D shape
Names of people or objects
What happened?

Can computers match human perception?

Yes and No (maninly no)
- computers can be better at 'easy' things
- humans are better at 'hard' things
But huge progress
- Accelerating in the last five years due to deep learning
- What is considered 'hard' keeps chaning

Human perception has its shortcomings

But humans can tell a lot about from a little information...

The goal of computer vision

Compute the 3D shape of the world
Recognize objects and people
'Enhance' images
Forensics
Improve photos ('Computational Photography')
- Super-resolution
- Low-light photography
- Depth of field on cell phone camera
- Inpainting / image completion

2. Why study computer vision?

Bilions of images/videos captured per day
Huge number of potential applications
The next slide show the current state of the art(SOTA)

Optical character recognition (OCR)
Face detection (Nearly all cameras detect faces in real time (Why?)
Face analysis and recognition
Vision-based biometrics
Login without a password
- Fingerprint scanners on many new smartphones and other devices
- Face unlock on Apple iPhone X
Bird identification
Special effects: shape capture
Special effects: motion capture
3D face tracking w/ consumer cameras
Image synthesis
- Which face is real?
Sports
Smart cars
Self-driving cars
Robotics
Medical imaging
Virtual & Auamented Reality

Current state of the art

You just saw many examples of current systems.
- Many of these are loss than 5 years old
Computer vision is an active research area, and rapidly changing
- Many new apps in the next 5 years
- Deep learning powering many modern applications
Many startups acrosss a dizzying array of areas
- Deep learning, robotics, autonomous vehicles, medical imaging, construction, inspection, VR/AR, ...

Why is computer vision difficult?

Viewpoint variation
Illumination
Scale
Intra-class variation
Background clutter
Motion
Occlusion

Challenges: local ambiguity

But there are lots of visual cues we can use...

Bottom line

Perception is an inherently ambiguous problem
- Many different 3D scenes could have given rise to a given 2D image
- We often must use prior knowledge about the world's structure

Project based course whose goal is to teach you the basics of computer vision image processing, geometry,
recognition in a hands on way.

Course requirements

Prerequisites
- Data structures
- Good working knowledge of Python programming
- Linear algebra
- Vector calculus
Course does not assume prior imaging experience
- computer vision, image processing, graphics, etc.

3. Course overview (tentative)

Low-level vision
- Image processing, edg detection, feature detection, cameras, image formation
Geometry and algorithms
- projective geometry, stereo, structure from motion, optimization
Recognition
- face detection/recognition, category recognition, segmentation

4. Images & Image filtering [time permitting]

1. Low-level vision

Basic image processing and image formation

Filtering, edge detection
Feature extraction
Image formation

2. Geometry

Projective geometry
Stereo vision
Multi-view stereo
Structure from motion

3. Recognition

Image classification
Object detection
Convolutional Neural Networks

Reference: CS5670: Intro to Computer Vision (Cornell Tech)

728x90

저작자표시 비영리 변경금지 (새창열림)

'AI > Computer Vision' 카테고리의 다른 글

Aliasing(엘리어싱) - 발생 이유, 결과, 방지 방법 (2)	2022.06.01
[CS5670] Lecture 4: Local features & Harris corner detection (0)	2022.05.23
[CS5670] Lecture 3: Image Resampling & Interpolation (0)	2022.05.23
[CS5670] Lecture 2: Edge detection (0)	2022.05.19
[CS5670] Lecture 1: Images and image filtering (0)	2022.05.19

공지사항

최근에 올라온 글

최근에 달린 댓글

Total

Today

Yesterday

링크

TAG more

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

글 보관함

250x250

Deep-Dive AI

티스토리 뷰

[CS5670] Lecture 0: Intro to Computer Vision

1. What is computer vision?

2. Why study computer vision?

3. Course overview (tentative)

4. Images & Image filtering [time permitting]

1. Low-level vision

2. Geometry

3. Recognition

'AI > Computer Vision' 카테고리의 다른 글

티스토리툴바