일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | ||
6 | 7 | 8 | 9 | 10 | 11 | 12 |
13 | 14 | 15 | 16 | 17 | 18 | 19 |
20 | 21 | 22 | 23 | 24 | 25 | 26 |
27 | 28 | 29 | 30 |
- opencv
- 알고리즘
- Segmentation
- Ai
- 파이토치
- 프로그래머스
- transformer
- 코드구현
- 강화학습
- object detection
- 파이썬
- 딥러닝
- optimizer
- 논문구현
- 옵티마이저
- Convolution
- Self-supervised
- cnn
- 코딩테스트
- Python
- 논문 리뷰
- Semantic Segmentation
- ViT
- programmers
- 논문리뷰
- Computer Vision
- 머신러닝
- 논문
- pytorch
- 인공지능
- Today
- Total
목록2025/04/22 (2)
Attention please

이번에 리뷰할 논문은 Dueling Network Architectures for Deep Reinforcement Learning 입니다. https://arxiv.org/abs/1511.06581 Dueling Network Architectures for Deep Reinforcement LearningIn recent years there have been many successes of using deep representations in reinforcement learning. Still, many of these applications use conventional architectures, such as convolutional networks, LSTMs, or auto-encoders..

이번에 리뷰할 논문은 Deep Reinforcement Learning with Double Q-learning 입니다.https://arxiv.org/abs/1509.06461 Deep Reinforcement Learning with Double Q-learningThe popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be pr..