Showing posts with label Deepseek. Show all posts

Sunday, March 9, 2025

Open-Source AI Governance: The Dilemma and Alternative Approaches in U.S. Policy

오픈소스 AI 거버넌스:

미국 정책의 딜레마와 대안적 접근

Open-Source AI Governance:

The Dilemma and Alternative Approaches in U.S. Policy

국문 원문 | Korean Original

영문 번역 | English Translation

DeepSeek가 2024년 12월과 2025년 1월에 공개한 AI 모델은 오픈소스 AI의 확산을 가속화하며, 미국의 오픈소스 거버넌스에 대한 논쟁을 더욱 심화시키고 있다.

The AI models released by DeepSeek in December 2024 and January 2025 have accelerated the proliferation of open-source AI, intensifying debates over U.S. open-source governance.

이 논쟁은 크게 두 가지 관점에서 전개된다.

This debate is primarily framed by two perspectives.

첫째, 지정학적 관점에서는 미국과 중국 간의 경쟁을 고려하여 오픈소스 AI의 확산이 국가 안보 및 기술적 우위를 위협할 수 있다고 우려한다.

First, from a geopolitical perspective, concerns arise that the spread of open-source AI may threaten national security and technological supremacy amid U.S.-China competition.

둘째, 이념적 관점에서는 오픈소스 기술이 혁신, 투명성, 민주주의를 촉진하는 중요한 요소라고 보고, 이에 대한 규제를 반대하는 입장이다.

Second, from an ideological standpoint, open-source technology is viewed as a crucial driver of innovation, transparency, and democracy, leading to opposition to regulation.

현재 미국 정책 입안자들은 이 두 가지 상충하는 관점을 어떻게 조율할 것인지 고민하고 있다.

Currently, U.S. policymakers are grappling with how to reconcile these two conflicting perspectives.

특히 DeepSeek의 최신 AI 모델이 기존 폐쇄형 모델을 기반으로 성능을 향상시킨 사례가 등장하면서, 오픈소스 AI의 규제 필요성에 대한 논의가 더욱 활발해지고 있다.

The recent emergence of DeepSeek’s latest AI models, which have improved performance by leveraging proprietary models, has further intensified discussions on the need for open-source AI regulation.

오픈소스 AI 정책 분석: 네 가지 주요 방안

Analysis of Open-Source AI Policies: Four Key Approaches

최근 보고서에서는 오픈소스 AI 정책을 평가하는 기준을 제시하며, 다음 네 가지 정책적 접근법을 분석했다.

A recent report presents a framework for evaluating open-source AI policies and analyzes four key policy approaches.

전면적 수출 통제 Comprehensive Export Controls

산업 차원의 위험 평가 Industry-Level Risk Assessments

정부 감사 요구 Government Audit Requirements

공공 감사 저장소 구축 Establishment of a Public Audit Repository

1. 전면적 수출 통제: 비효율적이고 역효과 가능성

Comprehensive Export Controls: Inefficiency and Potential Drawbacks

모든 오픈소스 AI 모델에 대한 전면적 수출 통제는 비효율적이며, 오히려 부작용을 초래할 가능성이 크다.

Imposing comprehensive export controls on all open-source AI models is inefficient and may lead to unintended negative consequences.

모든 오픈소스 AI 사용자에게 ‘고객 확인(Know-Your-Customer, KYC)’ 절차를 요구하는 것은 특정 용도의 AI 발전을 저해할 수 있다.

Requiring all open-source AI users to undergo Know-Your-Customer (KYC) verification could hinder the development of specific AI applications.

또한, 중국의 오용을 효과적으로 방지하지 못할 가능성이 높으며, 미국의 글로벌 기술 주도권을 약화시킬 수도 있다.

Moreover, such measures may be ineffective in preventing misuse by China and could undermine U.S. global technological leadership.

수출 규제는 미국 내 오용 문제를 해결하지 못하는 한계를 가진다.

Export controls also fail to address the issue of domestic misuse within the U.S.

2. 개별 모델별 위험 평가: 보다 합리적인 대안

Model-Specific Risk Assessments: A More Practical Alternative

보다 현실적인 대안으로, AI 모델 개발자가 오픈소스 여부를 결정하기 전에 위험 평가를 수행하도록 요구하는 방안이 제안된다.

A more practical alternative is to require AI model developers to conduct risk assessments before deciding whether to release their models as open-source.

예를 들어, AI 기반 화학·생물 모델의 경우, 연구자가 기관 이메일을 제출해야만 전체 모델을 다운로드할 수 있도록 제한할 수 있다.

For instance, AI models related to chemistry or biology could be restricted so that only researchers with institutional email addresses can access the full model.

이 접근법은 기술 발전에 대한 방해를 최소화하면서도 오용 가능성을 줄일 수 있는 균형점이 될 수 있다.

This approach strikes a balance between minimizing disruptions to technological progress and reducing the risk of misuse.

또한, 수출 통제와 달리 중국뿐만 아니라 미국 내 오용 문제까지 해결할 수 있다.

Unlike export controls, this policy also addresses domestic misuse within the U.S., in addition to concerns about China.

3. 오픈소스 AI의 ‘백도어’ 위험과 감사 정책

Addressing Open-Source AI 'Backdoor' Risks through Audits

오픈소스 AI 모델의 보안 취약점을 악용할 가능성을 줄이기 위해, 정부 차원의 감사를 의무화하는 방안이 논의되고 있다.

To mitigate security vulnerabilities in open-source AI models, mandatory government audits are being considered.

그러나,

However,

감사 정보를 효과적으로 수집하는 것이 어려울 수 있으며,

Collecting relevant audit information may prove challenging.

감사 결과가 일반에 공개되지 않을 가능성이 크다.

Audit results are unlikely to be made publicly available.

4. 공공 감사 저장소 구축: 투명성과 신뢰 확보

Public Audit Repository: Enhancing Transparency and Trust

이에 대한 대안으로, 공공 저장소를 구축하여 주요 오픈소스 AI 모델의 감사를 공유하는 방식이 제안된다.

As an alternative, establishing a public repository for auditing key open-source AI models has been proposed.

이를 통해 오픈소스 AI에 대한 신뢰도를 높이고, 투명성을 강화할 수 있다.

This approach would increase trust in open-source AI and improve transparency.

또한, 특정 AI 모델이 보안상 안전한지 검증하는 데 유용한 공공재 역할을 할 수 있다.

Additionally, it could serve as a valuable public resource for verifying the security of AI models.

미래 정책 방향: 지속적인 연구와 비교 평가 필요

Future Policy Directions: Ongoing Research and Comparative Evaluation

오픈소스 AI 정책은 여전히 초기 단계에 있으며, 향후 지속적인 연구가 필요하다.

Open-source AI policy remains in its early stages, necessitating ongoing research.

미국과 중국의 AI 성능을 비교하고, 각국의 기술 발전 속도를 모니터링하는 것이 중요하다.

Comparing the AI performance of the U.S. and China while monitoring their respective technological advancements is essential.

비국가 행위자(Non-state actors)에 의한 보안 위협을 보다 심층적으로 연구할 필요가 있다.

Further in-depth research is needed on security risks posed by non-state actors.

자세한 내용은 보고서를 참고할 수 있다:

For more details, refer to the report:

👉 AI 정책 연구소 보고서

Sunday, February 2, 2025

DeepSeek model V3와 R1의 모든 것

DeepSeek: V2에서 R1까지, 실용적 AI 혁신의 여정

DeepSeek은 V2에서 V3, R1-0을 거쳐 R1 개발에 성공하기까지 수많은 도전과 변화를 겪어왔습니다. 지난해 말 V3가 발표될 당시만 해도 회의적인 시각이 많았고, 아직 갈 길이 멀다는 평가가 있었습니다. 그러나 올해 1월 R1이 발표되면서 AI 업계뿐만 아니라 다양한 산업군에서 DeepSeek에 대한 관심이 급격히 높아졌습니다.

DeepSeek의 기술 발전 과정을 살펴보면, 단순한 성능 개선을 넘어 실용적이고 현실적인 문제 해결에 집중한 접근 방식이 돋보입니다. V3 기술 보고서와 R1 발표 논문을 읽다 보면, 기존 연구들을 기반으로 효과적인 최적화를 수행하며, 실제 AI 시스템에 적용할 수 있는 방법론을 정교하게 발전시켜 왔음을 알 수 있습니다.

DeepSeek: The Journey from V2 to R1, A Practical AI Innovation

DeepSeek has undergone numerous challenges and transformations, from V2 to V3, R1-0, and ultimately, the successful development of R1. Many were skeptical when V3 was announced late last year, believing there was still a long way to go. However, with the release of R1 in January this year, interest in DeepSeek surged across the AI industry and various other sectors.

Examining DeepSeek’s technological advancements from V2 to V3, and then from R1-0 to R1, provides insight into their research trajectory and development philosophy. Reading the V3 technical report and the R1 announcement paper, it becomes evident that their focus is not merely on improving raw performance but on developing a highly practical and effective approach to real-world AI applications.

DeepSeek을 둘러싼 평가와 현실

DeepSeek의 기술에 대해 "새로운 패러다임을 제시한 것은 아니다", "시장 기술 대비 월등한 성능을 보이는 것도 아니다"라는 평가도 존재합니다.

하지만 DeepSeek이 거인의 어깨 위에 올라선 방식은 매우 실용적이며, AI 연구에서 중요한 발전을 이루었다는 점을 간과해서는 안 됩니다.

논문과 소스코드를 보면, DeepSeek의 접근 방식은 단순해 보일 수도 있습니다.

· "별것 아닌 것 같은데?"라는 안도감

· "이 아이디어를 이렇게 적용하면 효과가 있나?"라는 의구심

하지만, 실제로 오픈소스를 분석해 보면 그러한 안도감과 의구심은 곧 경외감으로 바뀝니다.

Evaluations and the Reality of DeepSeek

There are still claims that "DeepSeek hasn’t introduced a new paradigm of AI" or that "its performance is only comparable to existing technologies, not significantly superior."

However, it’s crucial to recognize that DeepSeek has climbed onto the shoulders of giants in an exceptionally practical way, achieving remarkable progress in AI research.

At first glance, DeepSeek's ideas might seem simple when looking at their papers and source code:

"Is this really all that groundbreaking?"—a sense of reassurance.
"Would this idea actually be effective when applied this way?"—a feeling of skepticism.

I felt the same way. However, upon analyzing and running their open-source implementation, that initial reassurance and skepticism quickly turned into admiration.

단순하지만 강력한 구현 방식

DeepSeek이 공개한 코드는 놀라울 정도로 단순하게 구현되었습니다.

· 복잡한 이론은 기존 연구를 활용하면서도, Hugging Face 모듈을 적극적으로 활용.

· 논문에서 다루지 않았지만, 실용적인 최적화 아이디어를 적용하여 더욱 효과적인 모델을 구축.

이러한 점들을 보면, AI 연구에서 "자금이 부족해서 할 수 없다", "전폭적인 지원이 부족하다", "인재가 없다"는 변명이 더 이상 통하지 않을지도 모릅니다.
DeepSeek의 접근 방식은 누구나 생각해볼 수 있는 아이디어들이지만, 결국 그 아이디어들을 제대로 구현하고 실용화한 것이 차별점입니다.

Simple Yet Powerful Implementation

DeepSeek’s open-source code is astonishingly simple and efficient.

Instead of reinventing complex theories, they effectively leverage existing research while incorporating Hugging Face modules.
They apply fine-grained optimizations, even implementing practical ideas that are not explicitly detailed in their papers.

Considering this, excuses like "lack of funding," "insufficient support," or "shortage of talent" may no longer hold as barriers to AI research.
DeepSeek’s approach is built on ideas that anyone could conceive, yet their true innovation lies in executing and refining these ideas into practical, high-performing systems.

DeepSeek의 연구를 정리하며

구슬도 꿰어야 보물이 됩니다. DeepSeek은 이론과 기술을 체계적으로 연결하여, 실제 AI 시스템에 적용 가능한 형태로 그 구슬을 꿰어냈습니다.

이번에 V3와 R1의 문서를 정독하고, 보다 상세하게 내용을 정리했습니다.
이 글은 강화 학습이나 인공지능에 대한 기초 지식이 있는 분이라면 쉽게 이해할 수 있도록 구성하였습니다.

저 역시 공부하는 마음으로 주말을 비워 정리했고, 이를 공유합니다.

DeepSeek의 기술 발전 과정과 핵심 아이디어들을 함께 탐구해 보시죠.

Reflections on DeepSeek’s Research

As the saying goes, "A pearl must be strung to become a treasure."
DeepSeek has successfully strung together the right technologies to create a system that is truly applicable in AI practice.

After carefully reading through the V3 and R1 documentation, I have organized and detailed my findings.
This article is structured to be accessible to those with basic knowledge of reinforcement learning and artificial intelligence.

I, too, took the time to study and compile this over the weekend, and now I share it with you.
Let’s explore DeepSeek’s technological advancements and core innovations together.

Untold Stories of Intellectual Property