[Paper Review] Contrastive Vision-Language Pre-training with Limited Resources

[Paper Review] AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection

[Paper Review] Koopman Neural Operator Forecaster for Time-series with Temporal Distribution Shifts

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

iPhone or Chocolate??

Bike Vs Tricycle Fast Challenge

У тебя новый заказ | 1 серия | Сериал «Эскорт. Новый вызов» | КОНКУРС

[Paper Review] Contrastive Vision-Language Pre-training with Limited Resources

Рет қаралды 176

서울대학교 산업공학과 DSBA 연구실

서울대학교 산업공학과 DSBA 연구실

Күн бұрын

발표자 : 서울대학교 산업공학과 DSBA 연구실 석사과정 성시열 (siyul_sung@korea.ac.kr)
1. 논문 제목 : Contrastive Vision-Language Pre-training with Limited Resources (ECCV 2022)
2. 원문 링크 : arxiv.org/abs/...
3. 인용 수 : 23회 (~2024.09.29)
4. 요약
제한된 데이터 자원, 제한된 컴퓨팅 자원에서 효율적인 방식으로 학습하는 CLIP Training Pipeline을 제안
공개적으로 접근 가능한 14M의 Academic Dataset을 활용함으로써, 재구현이 가능하도록 함.
Multi-Source에서 수집된 데이터의 Dataset Bias 문제를 해결하고자, Debias Sampling 기법을 제안함.
접근 가능한 데이터셋의 부족한 양을 극복하고자, 데이터셋 증강기법인 Coin Flipping Mixup 기법을 제안함.
제한된 컴퓨팅 자원에서 Large Batch Size를 구현하고자, Decoupled Gradient Accumulation 기법을 제안함.
위 기법을 모두 적용하여 실험한 결과, 동일 자원 대비 최고 성능을 보였으며, 추가 수집 후 1억 개의 데이터로 학습한 결과, 기존 SOTA 방법론 대비 유사하거나 더 우수한 성능을 보임.

Пікірлер

[Paper Review] AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection

36:40

[Paper Review] AnomalyCLIP: Object-agnostic Prompt Learning for Zero-shot Anomaly Detection

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 167

[Paper Review] Koopman Neural Operator Forecaster for Time-series with Temporal Distribution Shifts

49:15

[Paper Review] Koopman Neural Operator Forecaster for Time-series with Temporal Distribution Shifts

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 172

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

00:12

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

Funny superhero siblings

Рет қаралды 55 МЛН

iPhone or Chocolate??

00:16

iPhone or Chocolate??

Hungry FAM

Рет қаралды 46 МЛН

Bike Vs Tricycle Fast Challenge

00:43

Bike Vs Tricycle Fast Challenge

Russo

Рет қаралды 106 МЛН

У тебя новый заказ | 1 серия | Сериал «Эскорт. Новый вызов» | КОНКУРС

32:45

У тебя новый заказ | 1 серия | Сериал «Эскорт. Новый вызов» | КОНКУРС

DRAMA PIE

Рет қаралды 2,6 МЛН

[Paper Review] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

33:36

[Paper Review] The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 378

[Paper Review] Tell Me More! Towards Implicit User Intention Understanding of Language Model Agent

1:00:04

[Paper Review] Tell Me More! Towards Implicit User Intention Understanding of Language Model Agent

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 103

State Space Models (S4, S5, S6/Mamba) Explained

38:11

State Space Models (S4, S5, S6/Mamba) Explained

Anastasia Borovykh

Рет қаралды 3,3 М.

[Paper Review]CoLLM: Integrating Collaborative Embeddings into LLMs for Recommendation

29:49

[Paper Review]CoLLM: Integrating Collaborative Embeddings into LLMs for Recommendation

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 298

GPT-4o보다 느리고, 정리는 못해도 o1의 파급력이 더 큰 이유 (강정수 박사)

32:37

GPT-4o보다 느리고, 정리는 못해도 o1의 파급력이 더 큰 이유 (강정수 박사)

티타임즈TV

Рет қаралды 20 М.

How “Digital Twins” Could Help Us Predict the Future | Karen Willcox | TED

15:37

How “Digital Twins” Could Help Us Predict the Future | Karen Willcox | TED

TED

Рет қаралды 131 М.

Transformer-based Multivariate TimeSeries Anomaly Detection using Inter-Variable Attention Mechanism

36:13

Transformer-based Multivariate TimeSeries Anomaly Detection using Inter-Variable Attention Mechanism

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 418

AI, Machine Learning, Deep Learning and Generative AI Explained

10:01

AI, Machine Learning, Deep Learning and Generative AI Explained

IBM Technology

Рет қаралды 286 М.

[Paper Review] ORPO: Monolithic Preference Optimization without Reference Model

27:40

[Paper Review] ORPO: Monolithic Preference Optimization without Reference Model

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 271

[Paper Review] How Abilities in Large Language Models are Affected by SFT Data Composition

36:06

[Paper Review] How Abilities in Large Language Models are Affected by SFT Data Composition

서울대학교 산업공학과 DSBA 연구실

Рет қаралды 173

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

00:12

The Joker wanted to stand at the front, but unexpectedly was beaten up by Officer Rabbit

Funny superhero siblings

Рет қаралды 55 МЛН