WEEK4 : Introduction to TensorFlow for Artificial Intelligence (ImageDataGenerator)

2021. 1. 10. 15:10

ImageDataGenerator

ImageDataGenerator 이용하면 자동으로 label 부여해준다.

from tensorflow.keras.preprocessing.image import ImageDataGenerator

train_datagen = ImageDataGenerator(rescale=1./255)
trian_generator = train_datagen.flow_from_directory(
    train_dir,
    target_size=(300, 300),
    batch_size=128,
    class_mode='binary'
)

train_dir
- 여기서 train_dir는 이미지가 포함된 하위 디렉토리를 포함한 상위 디렉토리여야 한다.
  (위의 이미지에서 trian에 대응하는 위치)
- train_dir의 하위 디렉토리 이름은 label의 이름이여야 한다.
target_size = (300, 300)
- 여기서 channel size 지정해줄 필요 X
- 신경망을 학습하기 위해서는 입력 데이터가 모두 동일한 크기여야 하기 때문에 사이즈 지정해주어야 한다.
- 이미지가 로드될 때 사이즈를 조절해주기 때문에 전처리 할 필요가 없다.
batch_size
- 배치 사이즈를 지정해준다.
class_mode
- "binary": 1D numpy array of binary labels,
- "categorical": 2D numpy array of one-hot encoded labels. Supports multi-label output.
- "input": images identical to input images (mainly used to work with autoencoders),
- "multi_output": list with the values of the different columns,
- "raw": numpy array of values in y_col column(s),
- "sparse": 1D numpy array of integer labels, - None, no targets are returned (the generator will only yield batches of image data, which is useful to use in model.predict()).
여기서 주의할 점은 train_generator와 validation_generator의 target_size가 동일해야 한다는 것이다.

ConvNet 정의

model = tf.keras.models.Sequential([
    # 1st CONV+POOL
    tf.keras.layers.Conv2D(16, (3, 3), activation='relu', input_shape=(300, 300, 3)),
    tf.keras.layers.MaxPooling2D(2, 2),
    # 2nd CONV+POOL
    tf.keras.layers.Conv2D(32, (3, 3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2, 2),
    # 3rd CONV+POOL
    tf.keras.layers.Conv2D(64, (3, 3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2, 2),
    # Flatten
    tf.flatten(),
    # 1st DENSE
    tf.keras.layers.Dense(512, activation='relu'),
    # 2nd DENSE
    tf.keras.layers.Dense(1, activation='sigmoid') # binary classification
])

input_shape
- channel 사이즈도 기재해주어야 한다.
- (300, 300, 3) : row, col, channel 순서
마지막 layer의 output node
- binary classification 문제였기 때문에 unit의 수를 1으로 설정하고
- activation function 역시 sigmoid로 설정해주었다.
- 만약에 multi-class classification이라면 class의 개수로 units의 수를 설정해주고,
  activation function으로는 'softmax'를 사용한다.

from tensorflow.keras.optimizers import RMSProp

model.compile(loss='binary_crossentropy',
              optimizer=RMSProp(lr=0.001),
              metrics=['acc'])

binary classification이기 때문에 loss는 'binary_crossentropy' 사용하였다.
multi-class classification 문제라면 'categorical_crossentropy' 혹은 'sparse_categorical_crossentropy' loss를 사용하여야 한다.
- y가 one-hot encoding이 되어있는 상태라면 categorical_crossentropy를 사용하고,
- y가 one-hot encoding이 되어있지 않은 상태라면 sparse_categorical_crossentropy를 사용한다.

history = model.fit_generator(
    train_generator,
    steps_per_epoch=8,
    epochs=15,
    validation_data=validation_generator,
    validation_steps=8,
    verbose=2
)

model.fit 대신 model.fit_generator 사용하고 있다.
training directory에 1,024장의 이미지가 존재하고, batch_size는 128로 지정해주었기 때문에
steps_per_epoch는 1024/128 = 8으로 설정해주었다.
validation directory에 256장의 이미지가 존재하고, batch_size는 32개로 지정해주었기 때문에
validation_steps는 256/32=8으로 설정해주었다.
verbose는 tranining동안 얼마나 자주 출력할 것인지를 설정하는 parameter이다.

Compressing images

본 강의의 내용에서는 300*300*3 크기의 horse, human 이미지로 학습을 수행했다.
하지만, 150*150*3의 크기로 수정된 이미지로 학습한다면 어떤 일이 일어날까?
(image generator에서 크기와 모델의 input_size를 수정해주면 된다)
크기가 작아지니 학습할 때의 계산 비용이 줄어든다. 하지만 정확도 또한 낮아졌다.
(작은 사이즈의 이미지를 다루기 위해 layer의 개수를 줄였기 때문에)

github.com/lmoroney/dlaicourse

lmoroney/dlaicourse

Notebooks for learning deep learning. Contribute to lmoroney/dlaicourse development by creating an account on GitHub.

github.com

저작자표시 비영리 변경금지

'🙂 > Coursera_TF' 카테고리의 다른 글

WEEK2 : CNN in TensorFlow (data augmentation) (0)	2021.01.10
WEEK1 : CNN in TensorFlow (Cats and Dogs) (0)	2021.01.10
WEEK3 : Introduction to TensorFlow for Artificial Intelligence (CNN, Conv2D, MaxPool2D) (0)	2021.01.07
WEEK2 : Introduction to TensorFlow for Artificial Intelligence (fashion_mnist, callback) (0)	2021.01.04
WEEK1 : Introduction to TensorFlow for Artificial Intelligence (0)	2021.01.04

순간 기록

WEEK4 : Introduction to TensorFlow for Artificial Intelligence (ImageDataGenerator)

ImageDataGenerator

ConvNet 정의

Compressing images

'🙂 > Coursera_TF' 카테고리의 다른 글

+ Recent posts

티스토리툴바