[CV] Image segmentation / Morphological Filter / Object Recognition

CS/Computer Vision

[CV] Image segmentation / Morphological Filter / Object Recognition

공영재 2024. 4. 16. 18:22

Image Segmentation

Image segmentation : 이미지에서 ROI(Regions Of Interest)를 분리해내는 것.

Segmentation Methods

- Intensity based : Edge based와 Region based로 나뉘며, 픽셀 값의 불연속과 유사성을 사용한다.

- Information based : 기존에 ROI에 관해 알고 있던 정보들 사용

- Data and learning-based : 위 두가지를 포함하며, 유저가 개입하지 않는다.

위 methods를 조합하여서도 사용한다.

region-based intensity segmentation의 경우 Morphological filtering, Region labeling, Region selection의 전처리를 요구한다.

region selection에서 region growing이란 방법은 미리 정의된 growth 기준에 따라 region을 키워가며 grouping한다.

difference가 threshold보다 작으면 병합하고, 크면 region을 분리한다,

Information based의 경우 Prior knowledge를 활용한다.

이를 사용한 Active Contour method가 있다.

이는 sobel을 통해 gradient를 활용해 edge를 얻은 externel energy( intensity-based ) 와

prior knowledge로 curvature(곡률)같은 요소를 넣은 Internal energy (info-based) 의 합이 최소가 되는 값을 찾도록 반복한다. 이같은 방법은 끊어진 부분 등, edge가 명확하지 않을 때 segmentation을 하기에 효과적이다.

Region selection에서, Largest object만 seperation이 될 경우, threshold를 낮추면 더 적은 object를 추출하게 된다.

Summary Question

1. Segmentation using Intensity Thresholds is done with ( ) with lower and upper intensity thresholds

2. Criteria such as intensity values, texture, and color are local in nature, and do not take into account the ( ), while it is considered in the region growing method

3. Region growing utilizes the concepts of likeness between a candidate pixel and the pixels ( ).

4. In region growing, Compare a pixel or a region with ( ). Also, Selection of ( ) often depends on the type of image data

5. In information-based segmentaiton, ( ) about target ROIs is used

1. binarization

2. history of region growth

3. grown so far
4. adjacent regions / similarity criteria

5. Prior knowledge

Morphological Filter

: Morphology란 동식물의 형태와 구조 표현에 사용되는 단어로, 목적 외 object를 잘라내는 등의 전-후 처리에 사용된다.

- Dilation : 팽창 - output object가 커진다.

- Erosion : 침식 - output object가 작아진다.

- Opening : Erosion -> Dilation : 돌출된 부분을 원만하게 하며, 작은 객체를 제거한다. (=끊김이 생길 수 있다)

- Closing : Dilation -> Erosion : 들어간 부분을 원만하게 하며, 끊어진걸 연결할 수 있다.

- Thinning : Skeleton을 얻는 과정. 최소한의 리소스로 정보를 보존한다.

아래처럼 생긴 8개의 structuring elements로 hit or miss transform을 진행한다.

※ Structuring element = kernel

이 말은 즉, origin image를 zero padding 후 kernel과 완벽하게 일치(아래 그림의 x는 상관없음)할 때에만 중앙을 0으로 바꾼다는 의미이다.

Summary Question

1. The opening of an image is defined as the ( ) using the same structuring element

2. The closing of an image is defined as the ( ) using the same structuring element

3. Thinning can be used for several applications but is particularly useful for ( ).

4. Behavior of the thinning operation is determined by a structuring element. The binary structuring elements used for thinning are of the extended type used for the ( ) (i.e., they can contain both ones and zeros)

1. Erosion 뒤 Dilation

2. Dilation 뒤 Erosion

3. skeletonization

4. hit-or-miss transform

Object Recognition

Pattern은 arrangement of descriptors다. Feature는 보통 descriptor를 정의하기 위해 사용된다.

세가지 common pattern arrangements는 vectors, strings, trees다.

Decision Theoretic methods : 어떤 패턴이 어느 클래스에 속하는지 결정할 때 사용한다.

입력 패턴 x에 대해 클래스 별로 값을 계산하고, 그 중에 가장 큰 값이 나온 클래스에 대해 해당 클래스로 분류한다.

Minimum Distance Classifier : 각 클래스의 prototype vector(=평균)를 미리 계산해두고 새로운 패턴과 distance를 비교한다. 당연하게도, 각 클래스 간의 차이가 클수록 잘 작동한다.

또한 각 클래스의 평균 분포가 n차원 벡터 공간에서 구형의 hypercloud일 때 최적의 성능을 보인다.

Summary Question

1. ( ) is a family of patterns that share some common properties

2. ( ) descriptions adequately generate patterns of objects whose structure is based on simple connectivity of primitives associated with boundary shape

3. Selecting the ( ) has a profound influence on the eventual performance of object recognition

4. Decision-theoretic approaches to recognition are based on the use of ( ).

5. Unknown pattern x is said to belong to the i-th pattern class, if di (x) yields the ( ).

6. MDC computes the distance between the unknown and each of the ( ), and chooses the smallest distance to make a decision

7. It yields optimum performance when the distribution of each class about its mean is in the form of a spherical ( ) in n-dimensional pattern space

1. Pattern class

2. String

3. Descriptors

4. decision functions

5. largest value

6. prototype vectors = mean vector

7. hypercloud

'CS > Computer Vision' 카테고리의 다른 글

Hand4Whole(CVPR 2022) 핵심 정리 (1)	2024.11.08
3DGS(SIGGRAPH 2023) 핵심 정리 (0)	2024.10.31
NeRF(ECCV, 2020) 핵심 정리 (0)	2024.10.29
[CV] Intensity Transformation / Filtering (0)	2024.04.16

현재글[CV] Image segmentation / Morphological Filter / Object Recognition

공영재의 개발일지