티스토리 뷰
728x90
- What is computer vision?
- Why study computer vision?
- Course overview
- Images & Image filtering [time permitting]
1. What is computer vision?
Goal of computer vision:
percieve the story behind the picture
Compute properties of the world
- 3D shape
- Names of people or objects
- What happened?
Can computers match human perception?
- Yes and No (maninly no)
- computers can be better at 'easy' things
- humans are better at 'hard' things
- But huge progress
- Accelerating in the last five years due to deep learning
- What is considered 'hard' keeps chaning
Human perception has its shortcomings
But humans can tell a lot about from a little information...
The goal of computer vision
- Compute the 3D shape of the world
- Recognize objects and people
- 'Enhance' images
- Forensics
- Improve photos ('Computational Photography')
- Super-resolution
- Low-light photography
- Depth of field on cell phone camera
- Inpainting / image completion
2. Why study computer vision?
- Bilions of images/videos captured per day
- Huge number of potential applications
- The next slide show the current state of the art(SOTA)
- Optical character recognition (OCR)
- Face detection (Nearly all cameras detect faces in real time (Why?)
- Face analysis and recognition
- Vision-based biometrics
- Login without a password
- Fingerprint scanners on many new smartphones and other devices
- Face unlock on Apple iPhone X
- Bird identification
- Special effects: shape capture
- Special effects: motion capture
- 3D face tracking w/ consumer cameras
- Image synthesis
- Which face is real?
- Sports
- Smart cars
- Self-driving cars
- Robotics
- Medical imaging
- Virtual & Auamented Reality
Current state of the art
- You just saw many examples of current systems.
- Many of these are loss than 5 years old
- Computer vision is an active research area, and rapidly changing
- Many new apps in the next 5 years
- Deep learning powering many modern applications
- Many startups acrosss a dizzying array of areas
- Deep learning, robotics, autonomous vehicles, medical imaging, construction, inspection, VR/AR, ...
Why is computer vision difficult?
- Viewpoint variation
- Illumination
- Scale
- Intra-class variation
- Background clutter
- Motion
- Occlusion
Challenges: local ambiguity
But there are lots of visual cues we can use...
Bottom line
- Perception is an inherently ambiguous problem
- Many different 3D scenes could have given rise to a given 2D image
- We often must use prior knowledge about the world's structure
Project based course whose goal is to teach you the basics of computer vision image processing, geometry,
recognition in a hands on way.
Course requirements
- Prerequisites
- Data structures
- Good working knowledge of Python programming
- Linear algebra
- Vector calculus
- Course does not assume prior imaging experience
- computer vision, image processing, graphics, etc.
3. Course overview (tentative)
- Low-level vision
- Image processing, edg detection, feature detection, cameras, image formation
- Geometry and algorithms
- projective geometry, stereo, structure from motion, optimization
- Recognition
- face detection/recognition, category recognition, segmentation
4. Images & Image filtering [time permitting]
1. Low-level vision
Basic image processing and image formation
- Filtering, edge detection
- Feature extraction
- Image formation
2. Geometry
- Projective geometry
- Stereo vision
- Multi-view stereo
- Structure from motion
3. Recognition
- Image classification
- Object detection
- Convolutional Neural Networks
728x90
'AI > Computer Vision' 카테고리의 다른 글
Aliasing(엘리어싱) - 발생 이유, 결과, 방지 방법 (0) | 2022.06.01 |
---|---|
[CS5670] Lecture 4: Local features & Harris corner detection (0) | 2022.05.23 |
[CS5670] Lecture 3: Image Resampling & Interpolation (0) | 2022.05.23 |
[CS5670] Lecture 2: Edge detection (0) | 2022.05.19 |
[CS5670] Lecture 1: Images and image filtering (0) | 2022.05.19 |
댓글
공지사항
최근에 올라온 글
최근에 달린 댓글
- Total
- Today
- Yesterday
링크
TAG
- 파이썬 클래스 계층 구조
- vscode 자동 저장
- NLP
- 구글드라이브다운
- 프롬프트
- CNN
- 서버구글드라이브연동
- clip
- 파이썬 클래스 다형성
- 파이썬
- support set
- Unsupervised learning
- 도커
- stylegan
- 구글드라이브연동
- 도커 컨테이너
- 퓨샷러닝
- 딥러닝
- 서버에다운
- python
- 파이썬 딕셔너리
- prompt learning
- Prompt
- 구글드라이브서버다운
- docker
- 데이터셋다운로드
- style transfer
- cs231n
- 구글드라이브서버연동
- few-shot learning
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | |||
5 | 6 | 7 | 8 | 9 | 10 | 11 |
12 | 13 | 14 | 15 | 16 | 17 | 18 |
19 | 20 | 21 | 22 | 23 | 24 | 25 |
26 | 27 | 28 | 29 | 30 | 31 |
글 보관함
250x250