'인공지능' 태그의 글 목록

Notice

모바일 환경에서 수식이 깨지는 현상이 발생합니⋯

Recent Posts

Recent Comments

Link

Github
Gmail

« 2025/04 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

Tags more

Archives

Today

Total

관리 메뉴

목록인공지능 (14)

Attention please

[강화 학습] On-policy vs Off-policy

Temporal DifferenceOn-policy 와 Off-policy에 대해 들어가기 전 TD(Temporal Difference)에 대해 다시 한번 짚고 넘어가보도록 하겠습니다. $$Q(s_t, a_t) \leftarrow Q(s_t, a_t) + \alpha \left( R_t + \gamma Q(s_{t+1}, a_{t+1}) - Q(s_t, a_t) \right)$$ 위 수식은 TD를 보여주고 있으며, $\alpha$는 학습률을 나타내며, TD의 가장 핵심이 되는 $ Q(s_{t+1}, a_{t+1}) $ 는 다음 상태-행동의 예측 value값인 TD target이 됩니다. $$Q(s_t, a_t) \leftarrow Q(s_t, a_t) + \alpha \left( \underbrace..

딥러닝/Reinforcement Learning 2025. 4. 15. 02:23

[논문 리뷰] Mind with Eyes: from Language Reasoning toMultimodal Reasoning (2025)

이번에 리뷰할 논문은 Mind with Eyes: from Language Reasoning to Multimodal Reasoning 입니다.https://arxiv.org/abs/2503.18071 Mind with Eyes: from Language Reasoning to Multimodal ReasoningLanguage models have recently advanced into the realm of reasoning, yet it is through multimodal reasoning that we can fully unlock the potential to achieve more comprehensive, human-like cognitive capabilities. This surve..

논문 리뷰/Multi-Modal 2025. 4. 10. 18:33

[논문 리뷰] VadCLIP: Adapting Vision-Language Models for Weakly SupervisedVideo Anomaly Detection (2023)

이번에 리뷰할 논문은 VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly Detection 입니다. https://arxiv.org/abs/2308.11681 VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly DetectionThe recent contrastive language-image pre-training (CLIP) model has shown great success in a wide range of image-level tasks, revealing remarkable ability for learning powerfu..

논문 리뷰/Anomaly Detection 2025. 4. 4. 23:43

[논문 리뷰] Imagine while Reasoning in Space:Multimodal Visualization-of-Thought (2025)

이번에 리뷰할 논문은 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought 입니다. https://arxiv.org/abs/2501.07542 Imagine while Reasoning in Space: Multimodal Visualization-of-ThoughtChain-of-Thought (CoT) prompting has proven highly effective for enhancing complex reasoning in Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs). Yet, it struggles in complex spatial r..

논문 리뷰/Multi-Modal 2025. 3. 14. 15:46

[논문 리뷰] SAMScore: A Semantic Structural Similarity Metricfor Image Translation Evaluation(2023)

이번에 리뷰할 논문은 SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation 입니다. https://paperswithcode.com/paper/samscore-a-semantic-structural-similarity/review/ Papers with Code - Paper tables with annotated results for SAMScore: A Semantic Structural Similarity Metric for Image Translati Paper tables with annotated results for SAMScore: A Semantic Structural Similarity Met..

논문 리뷰/Image generation 2024. 2. 21. 03:44

Transposed Convolution 원리 및 Pytorch 구현

다양한 convolution 기법들 original convolution dilated convolution separable & depthwise & pointwise convolution depthwise separable convolution 이번 글에서는 Transposed Convolution에 대해 알아보자. 이 convolution 기법은 Deconvolution 이라는 이름으로 많이 쓰이지만 이는 잘못된 표현이다. Deconvolution은 이전의 convolution 작업을 되돌려 output을 원본 input으로 되돌리는 것이다. Transposed Convolution 역시 같은 spatial dimension을 만들어낸다는 점에서 Deconvolution과 유사하지만 Transpos..

딥러닝/Computer Vision 2022. 12. 30. 13:11

이전 Prev 1 2 3 Next 다음

목록인공지능 (14)

Attention please

티스토리툴바