[DL Hacks]Model-Agnostic Meta-Learning for Fast Adaptation of Deep Network

>100 Views

October 24, 18

#deep learning #Deep Networks #Meta-Learning #Domain Adaptation #MAML #Implementation

スライド概要

2018/10/22
Deep Learning JP:
http://deeplearning.jp/hacks/

Deep Learning JP

@DeepLearning2023

スライド一覧

DL輪読会資料

またはPlayer版

埋め込む »CMSなどでJSが使えない場合

（ダウンロード不可）

関連スライド

【DL輪読会】KAN: Kolmogorov–Arnold Networks

Deep Learning JP 90.1K

【拡散モデル勉強会】拡散モデルの数理

Deep Learning JP 65.6K

【DL輪読会】Evolutionary Optimization of Model Merging Recipes モデルマージの進化的最適化

Deep Learning JP 61K

【DL輪読会】Conditional Flow Matching

Deep Learning JP 47K

【拡散モデル勉強会】Introduction to Diffusion Models

Deep Learning JP 46K

【DL輪読会】Cosmos World Foundation Model Platform for Physical AI

Deep Learning JP 44.5K

各ページのテキスト

Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks PSI B3 近藤生也

アジェンダ ● ● ● ● ● 書誌情報メタ学習とは概要 MAML メタ学習 2

書誌情報 ● ● ● Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks Chelsea Finn*... (UC Berkeley) ポイント ○ メタ学習、ドメイン適応に強い ○ UC Berkeleyはロボット系のコンペで最強。 ○ DL輪読回でも以前取り上げられている。 ○ [DL輪読会]Learning to Adapt: Meta-Learning for Model-Based Control ○ ドメイン適応のためになる話 ○ [DL輪読会]Life-Long Disentangled Representation Learning with Cross-Domain Latent Homologies 3

メタ学習とはいかにMeta-train-Taskで事前知識を獲得するかが焦点。 ←最終的にやりたい未知のタスク https://www.slideshare.net/DeepLearningJP2016/dllearning-to-generalize-metalearning-for-domain-generalization 4

https://www.slideshare.net/DeepLearningJP2016/dllearning-to-generalize-metalearning-for-domain-generalization

ドメイン適応 ● シミュレーション ○ ○ ● セグメンテーション ○ ● CG上→リアル映像ボイスチェンジャー ○ ● 急に足がもげる急に坂が現れる Aさんのvocoder→Bさんのvocoder 食器洗いとか ○ 皿→鍋なんにでも使えるね！ 5

MAML概要 ● 発想：どんなタスクにも数ステップで最適化できるような、共通の初期パラメータを求める ● 目的関数 ①meta-train-taskのtrainで初期値パラメタ θを更新 ②更新されたパラメータで meta-train-taskのvalidationに対しての汎化誤差をとって（ΦTは、更新後のパラメータ） ③汎化誤差が小さくなるように初期値パラメータを更新 6

実装 7

課題設定 ● ● ● ● meta-train30言語、meta-test30言語、各言語10文字ずつ。学習時は各文字1画像しか見せてもらえないテスト時は、各文字19画像見せつけられる本家と若干違う ○ 本家はtrain_task内で文字を混ぜてオリジナルタスクを作っている。 8

データローダー 9

10.

タスクローダー ● 言語を一つ選び、いい感じにtrain(1枚)と test(19枚)に分割して、 data_loaderにして返す ● task_loader クラスを適当に作って継承している 10

11.

メタテスト 11

12.

メタトレイン 12

13.

メタトレイン 13

14.

メタトレイン 14

15.

昔のパラメータで微分することができるのか ● やってみた ○ ● ● ↓100万階微分するセルがあるので注意 https://github.com/naruya/maml-pytorch/blob/master/notebooks/pytorch_auto grad_kiso.ipynb できてそう 15

https://github.com/naruya/maml-pytorch/blob/master/notebooks/pytorch_autograd_kiso.ipynb