Untold Stories of Intellectual Property: The central issue in the copyright infringement controversy surrounding generative AI

Sunday, April 13, 2025

The central issue in the copyright infringement controversy surrounding generative AI

생성형 AI의 저작권 침해 논란의 핵심 쟁점 (The central issue in the copyright infringement controversy surrounding generative AI)

1. 서론 (Introduction)

최근 AI 기술의 급속한 발전은 저작권 침해 및 법적 책임과 관련하여 중대한 법적 논쟁을 촉발하고 있습니다. 특히 생성형 AI의 작동 방식이 기존 저작권 체계에 그대로 적용되기 어려운 구조적 특성을 지니면서, 해석상의 불명확성과 함께 기존 법리에 대한 근본적인 도전을 제기하고 있습니다. 이제 AI는 단순한 기술 혁신의 대상이 아니라, 저작권법상 책임 귀속과 판단 기준이라는 보다 본질적인 쟁점의 중심에 서게 되었습니다. 본 글에서는 이러한 논의에서 핵심이 되는 주요 쟁점들을 간략히 정리해 보고자 합니다.

The rapid advancement of AI technology has recently sparked intense legal debates concerning copyright infringement and liability. In particular, the way generative AI operates poses structural challenges that make it difficult to apply existing copyright frameworks directly. This has introduced significant interpretive ambiguities and fundamentally challenged established legal doctrines. As a result, AI is no longer merely a subject of technological innovation—it now lies at the heart of core legal issues surrounding authorship, liability, and the allocation of rights under copyright law. This article briefly outlines the key issues at the center of these ongoing discussions.

2. AI 저작권 침해의 핵심 쟁점 (Key Issues in AI-Related Copyright Infringement)

2.1. 학습 데이터 입력 단계에서의 침해 가능성 (Infringement Risks at the Training Data and Input Stage (In-Training Phase)

Generative AI models frequently utilize copyrighted works as training data without obtaining the copyright holders’ permission, raising concerns about potential violations of the reproduction right[1].

대표적인 사례로, 2023년 9월 제기된 Authors Guild v. OpenAI 사건에서는 Authors Guild와 17명의 작가들이 자신의 저작물이 무단으로 사용되어 AI 모델이 훈련되었다고 주장하며 OpenAI를 상대로 소송을 제기했습니다. 이후 2023년 12월에는 Microsoft가 OpenAI의 주요 투자자로서 해당 침해 가능성을 인지하고도 AI 훈련을 재정적으로 지원했다는 이유로 피고에 추가되었습니다.

A notable case highlighting this issue is Authors Guild v. OpenAI, filed in September 2023. In this lawsuit, the Authors Guild and 17 individual authors alleged that OpenAI had trained its language models using their copyrighted books without authorization. Later, in December 2023, Microsoft was added as a co-defendant based on allegations that it financially supported the training of these models while being aware of the potential copyright violations, given its role as a major investor in OpenAI.

이와 같은 AI 학습 과정에서는 ‘중간 복제(intermediate copies)’ (예: 텍스트 전체를 RAM에 저장하거나 임베딩 과정 중에 발생하는 임시 혹은 숨겨진 복제물)가 종종 발생됩니다. 이는 전통적인 저작권 원칙을 현대적인 머신 러닝 아키텍처에 어떻게 적용할지 재해석해야 하므로 법적 불확실성을 야기합니다.

AI training processes like these often involve the creation of intermediate copies—temporary or hidden reproductions of copyrighted content (such as storing entire texts in RAM or during embedding processes). This phenomenon introduces legal uncertainty, as it requires reinterpreting how traditional copyright principles apply to modern machine learning architectures.

미국 법원은 이러한 중간 복제가 기술적으로 불가피하더라도, 특정 조건에서 침해로 판단될 수 있음을 인정했습니다:

U.S. courts have acknowledged that even if such intermediate copying is technically necessary, it may still constitute infringement under certain conditions:

저작물 전체의 실질적 사용
상업적 목적
비보호적 저장(암호화 또는 통제 수단 없이 저장)

Substantial use of the copyrighted work in its entirety
Commercial purpose behind the use
Non-protected storage, such as saving unencrypted or uncontrolled copies without safeguards

These developments suggest a need to revisit the application of conventional fair use factors—purpose and character of the use, nature of the work, amount and substantiality used, and market impact—within the technological realities of large-scale AI training systems.

이러한 발전은 대규모 AI 훈련 시스템의 기술적 현실 속에서, 전통적인 공정 이용 요건(사용의 목적과 성격, 저작물의 성격, 사용된 양과 중요성, 시장 영향)을 재검토할 필요성을 시사합니다.

다음은 관련 주요 판례입니다:

Relevant Case Law

아래 사례에서 ‘다운로드’라는 용어는 AI 내부 프로세스를 설명할 때, 사용자 측에서 파일을 다운로드하는 것으로 잘못 해석되지 않도록 ‘임시 디지털 저장 또는 복제’라고 대체할 수 있습니다.

The phrase “download” in the below cases could be refined to “temporary digital storage or reproduction” when describing AI internal processes to avoid misinterpretation as user-level file downloads.

2.1.1. 침해로 인정한 판례 (1900년대) (Cases Recognizing Infringement (1990s)): [2]

MAI Systems Corp. v. Peak Computer, Inc. (9th Cir., 1993)
NLFC, Inc. v. Devcom Mid-America, Inc. (7th Cir., 1995)
Stenograph L.L.C. v. Bossard Associates, Inc. (D.C. Cir., 1998)

이들 판례에서 법원은 다음과 같은 논거를 통해 단순 다운로드조차 복제 행위로 판단했습니다:

In these cases, courts held that even downloading software or files without permission can constitute copyright infringement. The key legal rationales included:

디지털 복제물의 본질: 다운로드된 파일이 원본과 동일한 기능을 수행할 수 있으므로 복제 행위에 해당함. Nature of digital copies: Downloaded files can perform the same functions as the original, thus qualifying as reproductions under copyright law.

저작권자의 권리 보호: 무단 다운로드는 저작권자의 배타적 권리를 침해하는 행위로 간주. Protection of exclusive rights: Unauthorized downloads were seen as violations of the copyright holder’s exclusive right to reproduce the work.

기술적 보호조치 회피 여부: 저작권 보호 기술을 우회한 다운로드는 직접적인 침해에 해당. Circumvention of technological protection measures: In some cases, bypassing technical safeguards to access and download content was viewed as direct infringement.

2.1.2. 침해로 인정하지 않은 판례(2000년대) (Cases Not Recognizing Infringement (2000s)): [2]

CoStar Group, Inc. v. LoopNet, Inc. (4th Cir., 2004)
Cartoon Network LP v. CSC Holdings, Inc. (2d Cir., 2008)

이들 판례는 다음과 같은 이유로 일시적 다운로드를 복제행위로 보지 않았습니다:

In contrast, courts in these cases concluded that temporary or incidental copies may not amount to infringement, based on the following reasoning:

일시적 저장의 비고정성: RAM 상의 임시 파일은 법적으로 ‘고정된 복제물’로 간주되지 않을 수 있음. Ephemeral nature of storage: Temporary copies in RAM or cache were not considered “fixed” under copyright law and thus did not qualify as unauthorized reproductions.

이용자의 의도와 활용방식: 단순 저장만으로는 저작물의 실질적 이용이 있었는지 판단하기 어려움. User intent and use context: Mere storage without further distribution or commercial use was insufficient to establish actionable infringement.

공정 이용 가능성: 연구·분석·비상업적 목적에 해당할 경우 공정 이용 원칙에 따라 침해가 성립하지 않을 수 있음. Potential fair use: If the copies were made for research, analysis, or other non-commercial purposes, the courts considered them more likely to fall under the fair use doctrine.

이러한 가운데, AI 학습 단계에서의 저작물 사용이 ‘변형적 이용(transformative use)’에 해당하는지 여부가 '공정이용'여부를 판단하는 핵심 쟁점으로 떠오르고 있습니다. 즉, AI가 기존 저작물을 단순 복제하는 것이 아니라 새로운 목적, 의미 또는 메시지를 부여하는 방식으로 활용할 경우 공정 이용으로 인정될 여지가 있습니다.

As legal debates unfold, a key question in determining whether AI training constitutes fair use is whether the use of copyrighted materials qualifies as transformative use. In other words, if an AI system does more than merely replicate original works—such as repurposing them to serve a new objective, convey a different meaning, or generate novel messages—it may fall under the umbrella of fair use.

또한 AI 모델 훈련이 원저작물이 원래 의도한 목적과 다른 방식으로 활용되어 교육, 연구 또는 창작 촉진이라는 사회적 가치를 지니고 있고, 이로 인해 저작물의 시장 가치를 실질적으로 훼손하지 않는다면 공정 이용이 인정될 수도 있을 것입니다. 그러나 이는 명확한 판단 기준이 없고 사안마다 사실 관계에 따라 달라질 수 있는 복잡한 영역입니다.

Moreover, if AI model training applies copyrighted content in a way that diverges from the original intent of the work—such as for educational, research, or creative purposes—and does not cause material harm to the market value of the original work, fair use may be more likely to be recognized. However, this area remains highly fact-specific and lacks clear judicial standards, making outcomes unpredictable.

In the European Union, while a text and data mining (TDM) exception exists, copyright holders retain the right to opt out explicitly, which means AI training that relies on copyrighted materials may still be deemed infringing under EU law[3].

일부 AI 기업들은 법적 위험을 줄이기 위해 저작권 보호된 자료를 학습 데이터에서 제외하는 옵션을 제공하고 있으며, 향후 법원의 판결에 따라 AI 학습 방식이 변화할 가능성이 있습니다. 결국, AI 기술 발전과 저작권 보호 간의 균형을 어떻게 설정할 것인지는 향후 주요 판례들을 통해 형성될 법리가 결정적인 영향을 미칠 것입니다.

To mitigate legal risks, some AI companies now offer options to exclude copyrighted content from training datasets. Nevertheless, the future direction of AI training practices will likely hinge on upcoming court decisions. Ultimately, how the balance is struck between the advancement of AI and the protection of copyright will depend heavily on the evolving body of case law.

2.2. 산출 단계(Output phase)에서의 침해 가능성 (Infringement Risks at the Output Stage (Generation Phase))

생성형 AI 모델이 사용자에 의해 활용되어 출력물을 생성하는 산출 단계에서는, 해당 출력물이 기존 저작물과 ‘실질적으로 유사’(substantially similar)하고, AI가 학습 중 해당 저작물에 ‘접근 가능성’(accessibility)을 가졌다면, 이는 전통적인 저작권 침해의 두 가지 요건('실질적 유사성'과 '의거성')에 따라 침해로 간주될 수 있습니다[2-2].

At the output stage—where a generative AI model is used by an end user to create content—copyright infringement may be found if the generated output is substantially similar to an existing copyrighted work and if the AI model had access to that work during training[2-3]. These two elements—substantial similarity and access—form the basis of traditional copyright infringement analysis.

2.2.1. 관련 사례: Andersen v. Stability AI (N.D. Cal. 2023, Case No. 3:23-cv-00201-WHO)

Case Example: Andersen v. Stability AI (N.D. Cal. 2023, Case No. 3:23-cv-00201-WHO)

2023년, 원고 Sarah Andersen, Karla Ortiz, Kelly McKernan 등 3인의 시각 예술가는 Stability AI, Midjourney, DeviantArt, Runway AI 등을 상대로 미국 캘리포니아 북부지방법원에 집단소송을 제기했습니다. 피고들은 Stable Diffusion 등 생성형 AI 모델을 훈련시키는 과정에서 인터넷상에 게시된 다수의 이미지(원고들의 저작물 포함)를 무단으로 수집하여 훈련 데이터로 사용했으며, 이후 사용자의 프롬프트 입력에 따라 생성된 이미지가 원고들의 작품 또는 스타일과 실질적으로 유사하다는 이유로 저작권 침해가 발생했다고 주장했습니다.

In 2023, three visual artists—Sarah Andersen, Karla Ortiz, and Kelly McKernan—filed a class-action lawsuit in the U.S. District Court for the Northern District of California against Stability AI, Midjourney, DeviantArt, and Runway AI. The plaintiffs alleged that the defendants unlawfully scraped millions of images from the internet—including their own works—to train generative AI models such as Stable Diffusion.

They claimed that the models generated images that were substantially similar to their original works or imitated their distinctive artistic styles when users entered prompts referencing their names. On this basis, the plaintiffs asserted that the defendants committed direct copyright infringement.

2.2.2. 법적 맥락: 스타일 대 표현 (Legal Context: Style vs. Expression)

미국 저작권법은 아이디어의 구체적 표현(즉, 해당 저작물 자체)을 보호하지만, 작가의 ‘스타일’ 그 자체에 대한 보호는 명시적으로 제공하지는 않습니다. 그러나 원고들은 AI 모델이 자신의 스타일을 대량 복제함으로써 작품의 상업적 가치를 훼손하고, 이는 시장 침해(market harm)라고 주장하였습니다. 일부 법원은 식별 가능한 표현 요소와 결합된 경우, 스타일 기반 유사성을 실질적 유사성(substantial similarity)을 판단하는 요소로 인정해왔습니다.

While U.S. copyright law protects specific expressions of ideas (i.e., the works themselves), it does not extend protection to an artist’s style per se. However, the plaintiffs argue that mass replication of their styles by AI models undermines the commercial value of their work, constituting a form of market harm. Courts have, in some circumstances, recognized style-based similarity as a factor in determining substantial similarity, particularly when it is coupled with identifiable expressive elements.

2.2.3. 피고 측 주장 (Defendants' Arguments)

Andersen v. Stability AI 사건에서는 피고들은 다음과 같이 항변했습니다:

In Andersen v. Stability AI, the defendants raised several key defenses:

AI 모델은 특정 저작물을 직접 저장하거나 복제하지 않았으며, 결과물은 기존 저작물을 단순 복제한 것이 아닌 독창적 생성물이다. The AI models do not store or reproduce specific copyrighted works, and the outputs are novel and independently generated.

생성된 출력물이 시각적으로 유사해 보일 수는 있으나, 직접적인 실질적 복제나 아차 저작물 작성이 아니라, 스타일적 영향의 결과일 뿐이다. Any visual similarity in the output is the result of stylistic influence rather than direct copying or derivative use.

AI 학습은 변형적인 목적(transformative purpose)과 혁신적 활용성을 고려할 때 공정 이용에 해당한다. The training of AI models qualifies as fair use, particularly given its transformative purpose and innovative application.

2.2.4. 법원의 예비적 판단 (Court’s Preliminary Ruling):

법원은 다음과 같이 예비적으로 판단하고 심리를 계속하고 있습니다.

The court held that:

원고의 저작물이 학습 데이터에 포함되었을 가능성이 존재하고, There is a plausible basis to believe that the plaintiffs’ copyrighted works were included in the training dataset.

사용자들이 원고의 이름을 프롬프트에 입력했을 때 원고의 예술적 스타일과 유사한 이미지가 생성되었음을 인정하고 (스타일의 유사성저작권 침해에 해당하는 지에 대한 본안 판단은 심리 계속 중), When users prompted the model with the plaintiffs’ names, the AI generated images that resembled their artistic styles.

이러한 사실들은 소송 초기에 직접 침해 및 유도 침해(induced infringement) 주장을 뒷받침하기에 충분하며, 이에 따라 사건은 사실심리를 거쳐 본안 소송이 진행될 수 있게 되었습니다. 따라서 스타일의 유사성이 저작권 침해에 해당하는 지에 대한 본안 판단은 심리를 계속하고 있습니다[4]. These facts were sufficient to support claims for direct and induced infringement at the pleading stage, allowing the case to proceed to discovery and further litigation on the merits[4].

2.3. 학습단계에서 공정 이용과 변형적 이용 (Fair Use and the Transformative Purpose Defense in AI Training)

생성형 AI 모델의 개발사들은 학습 과정에서 저작권 보호 자료를 사용하는 것이 혁신과 연구라는 변형적 목적(transformative purpose)을 충족하므로, 이는 공정 이용(fair use)에 해당된다고 주장하고 있습니다[1][4]. 그러나 이러한 주장을 법원이 어떻게 해석하느냐에 따라 판단이 엇갈리고 있으며, 특히 기존 저작물을 '분석하거나 인용하는 것'과 '기계적으로 대규모 학습에 사용하는 것' 사이의 경계가 쟁점이 되고 있습니다.

Developers of generative AI models commonly argue that their use of copyrighted materials during training qualifies as fair use, on the grounds that it serves a transformative purpose, such as innovation or research[1-3][4-3]. However, how courts interpret this claim remains unsettled. A key legal question lies in distinguishing between the analysis or citation of copyrighted works—which has traditionally been allowed under fair use—and the wholesale ingestion of such works for large-scale machine training, which raises novel concerns.

2.3.1. 미국 저작권법상 공정 이용 판단의 네 가지 요소 (The Four Factors of Fair Use Under U.S. Copyright Law)

미국 저작권법상 공정 이용 판단은 다음 네 가지 요소를 기준으로 이루어집니다[5]:

Under U.S. copyright law (17 U.S.C. §107), fair use is assessed based on the following four factors[5]:

1) 이용의 목적 및 성격 (Purpose and Character of the Use)

공익적 목적(예: 교육, 연구, 비평, 보도)에 가까울수록 공정 이용 인정 가능성이 높습니다. Uses with a public interest—such as education, research, criticism, or news reporting—are more likely to be deemed fair use.

상업적 목적이더라도 ‘변형적 이용(transformative use)’일 경우 공정 이용으로 인정될 수 있습니다. Even if a use is commercial in nature, it may still qualify if it is transformative—that is, if it adds new expression, meaning, or purpose to the original work.

2) 저작물의 성질 (Nature of the Copyrighted Work)

창작성이 높은 저작물(예: 소설, 음악, 예술작품)은 보호 강도가 높습니다. Creative works like novels, music, and visual art are afforded stronger protection under copyright law.

사실 중심의 자료(예: 기술 문서, 공공기록)는 공정 이용 인정 가능성이 상대적으로 큽니다. Conversely, factual or functional works (e.g., technical manuals, public records) are more likely to fall within the scope of fair use.

3) 이용된 부분의 양과 중요성 (Amount and Substantiality of the Portion Used)

사용된 양이 전체 저작물에서 차지하는 비중과, 핵심적인 부분인지 여부가 고려됩니다. Courts consider both the quantity and the qualitative importance of the material used.

핵심적인 부분을 광범위하게 사용한 경우 공정 이용으로 인정받기 어렵습니다. Copying substantial portions of the "heart" of a work—even if a small percentage overall—may weigh against fair use.

4) 시장 가치에 미치는 영향 (Effect on the Market for the Original Work)

이용이 원저작물의 시장을 대체하거나 수익에 부정적 영향을 미치는 경우, 공정 이용으로 인정되기 어렵습니다. If the new use harms the market for the original work or its potential licensing markets, this factor weighs heavily against a finding of fair use. Courts often assess whether the use acts as a substitute, thereby reducing demand for the original.

대법원 2011도5835 판결 (대한민국)은 한국의 저작권법 제35조에 따라 공정이용의 요건으로 이용 목적과 성격, 저작물의 성격, 이용량과 대체 가능성, 이용이 저작물의 시장에 미치는 영향에 따라 심리하였습니다. 미국 법원의 심리 요건과 다르지 않습니다.

In South Korea, the Supreme Court’s 2011Do5835 decision applied similar criteria under Article 35 of the Korean Copyright Act. These include:

The purpose and character of the use
The nature of the work
The amount and significance of the portion used
The potential impact on the market for the original work

Although the statutory language differs, the substantive standards for assessing fair use under Korean law closely mirror those in the United States.

공정 이용은 저작권자에게 부여된 배타적 권리와 접근·교육·혁신이라는 더 광범위한 공익 사이에서 법적 균형을 유지하기 위한 시험 역할을 합니다. 생성형 AI 시대에는 어떤 이용이 변형적 사용(transformative use)에 해당하는지, 그리고 혁신과 무단 차용(appropriation) 사이의 경계가 어디에 있는지를 가르는 일이 주요한 법적 과제로 떠오를 것입니다.

Fair use serves as a legal balancing test between the exclusive rights of copyright holders and the broader public interest in access, education, and innovation. In the age of generative AI, determining what qualifies as transformative use—and where the line is drawn between innovation and appropriation—will be one of the defining legal challenges.

2.3.2. 공정 이용이 인정된 주요 사례 (Key Cases Where Fair Use Was Affirmed)

Google LLC v. Oracle America, Inc. (2021)

구글의 Java API 사용은 새로운 소프트웨어 개발이라는 변형적 목적을 갖고 있었으며, API는 기능적 도구로서 창작성보다 실용성이 강조되므로 공정 이용으로 판단되었습니다. 사용된 부분은 전체 중 극히 일부분이었고, Oracle의 시장을 직접적으로 대체하지 않아 생태계 발전에 기여한 점이 인정되었습니다.

The U.S. Supreme Court held that Google’s use of Java API code in the Android operating system qualified as fair use. The Court emphasized that:

The use served a transformative purpose—namely, developing a new and innovative software platform.
APIs function primarily as utilitarian tools, not expressive works, thus weighing in favor of fair use.
Only a small portion of the total API codebase was used.
Google’s use did not supplant Oracle’s market, and instead contributed to broader software ecosystem development.

Authors Guild v. Google, Inc. (2015)

Google Books가 수백만 권의 책을 스캔하고 발췌 정보를 검색 가능하도록 제공한 행위는 공정 이용으로 인정받았습니다. 법원은 다음과 같은 근거를 제시했습니다.

구글의 행위는 공익에 기여하는 변형적 검색 기능(transformative search functionality)을 제공했다.
책 전체를 스캔했지만, 이용자가 확인할 수 있는 것은 일부분(스니펫)으로 제한되었다.
다만, 법원은 책 전체를 스캔하는 행위가 중간 복제(intermediate copying)로 간주될 여지가 있으며, 상황에 따라(예: AI 훈련 맥락) 제한이 따를 수 있음을 언급했습니다.

Google Books was deemed fair use for scanning millions of books and making excerpts searchable. The court found that:

Google’s use facilitated transformative search functionality that benefited the public.
Although the full texts were scanned, only small snippets were made accessible.
However, the court acknowledged that scanning full texts could constitute intermediate copying, suggesting that the practice could carry limits under different contexts (e.g., AI training).

Campbell v. Acuff-Rose Music (1994)

대법원은 로이 오비슨의 “Oh, Pretty Woman”을 상업적으로 패러디한 사례가 공정 이용에 해당한다고 판결했습니다. 이는 상업성과 공익성만으로 판단하지 않고, 창작적 기여의 유무가 중요함을 보여줍니다. 이 판결을 통해 다음 사항이 명확해졌다.

상업적 의도만이 아니라 변형적 이용(transformative use)이 핵심 고려 요소이다.
새로운 표현, 의미, 메시지를 담고 있다면 상업적인 사용도 공정 이용이 될 수 있다.
이 판례를 통해, 상당한 양의 복제가 이루어지는 경우라도 패러디 분야에서는 공정 이용이 인정될 수 있음을 분명히 했다.

The Supreme Court ruled that a commercial parody of Roy Orbison’s “Oh, Pretty Woman” constituted fair use. The decision clarified that:

Transformative use, not commercial intent alone, is the central consideration.
Even commercial uses can be fair if they add new expression, meaning, or message.
The case established parody as a category where fair use may apply even when substantial copying occurs.

2.3.3. 공정 이용이 부정된 주요 사례 (Key Cases Where Fair Use Was Rejected)

Thomson Reuters v. Ross Intelligence (2025)

법률 AI 검색엔진 Ross Intelligence가 자체 AI 법률 검색 엔진을 훈련하기 위해 Westlaw의 독점적 법률 콘텐츠를 사용함으로써 저작권을 침해했다는 판결이 나왔습니다. 법원은 다음과 같은 이유를 들었습니다. 특히 Ross의 기능이 Westlaw의 시장을 실질적으로 잠식할 수 있다는 점도 중요하게 고려되었습니다.

Ross Intelligence was found to have infringed copyright by using Westlaw's proprietary legal content to train its AI legal search engine. The court found that:

Ross의 AI는 새로운 콘텐츠를 생성하지 않고, 기존 법률 텍스트의 실질적 발췌 내용을 그대로 재현하고 표시했다. Ross's AI did not generate new content, but instead reproduced and displayed substantive excerpts of existing legal texts.

해당 사용은 Westlaw의 핵심 기능을 단순히 복제한 것이므로 변형적(transformative)이지 않다고 보았다. The use was not transformative, as it merely replicated Westlaw's core functionality.

법원은 시장 대체(market substitution)를 강조하며, Ross의 서비스가 Westlaw의 사업 모델을 직접적으로 위협한다고 지적했다. The court emphasized market substitution, noting that Ross’s services directly threatened Westlaw’s business model.

American Geophysical Union v. Texaco Inc. (1995)

Texaco의 사내 연구원들이 과학 학술지를 복사해 사용한 사례에 대해 법원은 공정 이용을 인정하지 않았습니다. 이는 상업적 조직에서의 내부적 연구 목적도 침해로 간주될 수 있음을 보여줍니다. 주된 이유는 다음과 같습니다.

The court rejected fair use for photocopying scientific articles by Texaco’s in-house researchers. Key points included:

복제 목적이 교육 전반의 이익보다 내부 비즈니스 용도에 집중되어 있었다. The copying served internal business purposes, not broader educational goals.

연구 기관 내이더라도, 라이선스 없이 상업적 용도로 이용하면 침해가 될 수 있다. Even within research institutions, commercial use without licensing may constitute infringement.

판결은 복제 규모, 사용 맥락, 그리고 라이선스 대안 존재 여부 등이 공정 이용 분석에서 중요하게 고려되어야 함을 시사했다. The ruling suggested that scale, context, and licensing alternatives are relevant to fair use analysis.

Harper & Row v. Nation Enterprises (1985)

대법원은 The Nation 잡지가 포드 대통령의 회고록 중 일부를 무단으로 발췌해 게재한 행위가 공정 이용에 해당하지 않는다고 판결했습니다. 법원은 다음과 같은 점을 강조했습니다.

The Supreme Court held that The Nation magazine’s unauthorized publication of excerpts from President Ford’s memoirs was not fair use. The Court emphasized:

무단 사용이 작품의 ‘핵심(heart of the work)’ 부분을 포함해 특히 피해가 컸다. The unauthorized use included the “heart of the work,” which was particularly damaging.

공정 이용을 판단할 때는 복제 분량뿐 아니라 사용된 부분의 중요성도 결정적인 요소다. The importance of the portion used, not just its quantity, is a critical fair use factor.

이번 판결은 ‘보도 가치(newsworthiness)’만으로 작품의 핵심 콘텐츠를 복제하는 행위를 정당화할 수 없음을 다시 한 번 확인해 주었다. The decision reinforced the notion that newsworthiness alone does not justify copying core content.

2.4. 복수 관여자에 대한 책임 귀속 (Liability Concerns)

생성형 AI와 저작권이 충돌하는 영역에서는 AI 모델 개발자, 사용자, 플랫폼 등 다양한 주체들이 얽혀 있으며, 특히 네트워크 기반의 복합 시스템 구조로 인해 직접 침해자(direct infringer)가 누구인지 명확히 특정하기 어려운 경우가 많습니다.

In the context of generative AI and copyright law, various stakeholders—such as model developers, end users, and platform providers—may be simultaneously involved in activities that raise questions of infringement. Due to the complex, networked architecture of AI systems, it is often difficult to pinpoint a single direct infringer.

AI가 사용자의 고의 없이도 침해적 결과물을 생성하는 특성상, 법적 책임이 사용자, 개발자, 플랫폼 중 누구에게 귀속되는지, 그리고 기계적 행위가 인간 중심의 법률 책임 체계에 어떻게 편입되는지가 핵심 쟁점으로 부상하고 있습니다.

Because AI systems can generate infringing content without any intent or knowledge on the part of the user, a core legal challenge is determining how liability should be distributed among users, developers, and platforms, and how the actions of autonomous systems are reconciled within a human-centric legal framework.

전통적 법리에 따르면, 다수의 당사자가 저작권 침해에 관여한 경우에도 책임 귀속의 경중을 구별해야 하며, AI와 관련된 주요 주체별 책임은 다음과 같이 정리할 수 있습니다:

Under traditional legal doctrine, when multiple actors are implicated in a copyright infringement claim, courts typically distinguish between levels of liability. In the context of AI, responsibilities can be broken down as follows:

AI 사용자: AI를 통해 저작권 침해 콘텐츠를 생성하거나 활용한 경우, 직접 침해 책임이 발생할 수 있습니다. AI Users may be directly liable if they use the system to generate or exploit infringing content.

AI 개발자: 모델의 설계나 기능이 침해를 조장하거나(promote) 방조(aid)한 경우, 유도 침해 또는 대리 책임이 부과될 수 있습니다. AI Developers may be held responsible if their model design or functionality promotes or facilitates infringement (e.g., inducement or contributory infringement).

AI 플랫폼: 침해적 결과물을 유통하거나 이를 알고도 방치한 경우, 플랫폼 운영자 역시 책임의 주체가 될 수 있습니다. AI Platforms may face liability if they knowingly distribute infringing outputs or fail to remove them despite having the ability to do so.

특히 대리 침해(vicarious infringement) 이론에 따라, AI 사용자뿐 아니라 개발자와 플랫폼까지도 법적 통제력 및 이익 귀속 여부에 따라 책임이 확장될 수 있습니다. 예컨대, AI 서비스 제공자가 사용자 활동을 통제할 수 있는 위치에 있으며, 침해로부터 경제적 이득을 얻는 경우, 전통적인 ISP 책임 구조(Napster 사건 등)와 유사한 법적 귀속이 이루어질 수 있습니다.

Under the doctrine of vicarious liability, developers and platform providers may also be held liable if they have the right and ability to supervise infringing activity and derive a direct financial benefit from it. This is conceptually similar to the liability framework applied to ISPs in early file-sharing cases (e.g., Napster).

한편, 미국 법제에서는 복수의 침해 관여자가 존재할 경우 일반 불법행위법(common law)의 공동불법행위(joint tortfeasor) 원칙보다는, 특허법의 유도 침해(induced infringement) 개념이 유사하게 적용되는 경향이 강합니다. 특히 특허/저작권 분야에서는 침해 유도(inducement) 또는 방조(contributory infringement) 책임이 일반 불법행위 이론보다 선호되는 경향이 있습니다. 즉, 침해를 실질적으로 유도하거나 이를 가능하게 만든 행위자에게 중점을 두고 책임을 부과하는 구조입니다.

In U.S. jurisprudence, when multiple actors are involved, courts tend to apply frameworks derived from patent law—particularly induced infringement and contributory infringement—rather than the common law rule of joint tortfeasors. The emphasis is on identifying the party who materially enabled or encouraged the infringing act.

2.4.1. 주요 사례(Key Case): Andersen v. Stability AI (N.D. Cal. 2023, Case No. 3:23-cv-00201-WHO)

2023년, Sarah Andersen, Karla Ortiz, Kelly McKernan 등 3인의 시각 예술가는 Stability AI, Midjourney, DeviantArt, Runway AI 등을 상대로 미국 캘리포니아 북부지방법원에 소송을 제기하며 다음 세 가지 책임을 주장했습니다:

In this class-action lawsuit, three visual artists—Sarah Andersen, Karla Ortiz, and Kelly McKernan—sued Stability AI, Midjourney, DeviantArt, and Runway AI in the Northern District of California, asserting the following theories of liability:

직접 침해 (Direct Infringement): 원고의 저작물을 무단 수집하여 AI 훈련 데이터로 사용한 행위 Direct Infringement: The defendants allegedly scraped the plaintiffs' copyrighted works without authorization and used them as training data.

유도 침해 (Induced Infringement): 침해 가능성이 있는 모델을 배포하여 제3자의 침해를 유도한 행위 Induced Infringement: The AI models were allegedly distributed in a way that encouraged or enabled third-party users to create infringing outputs.

대리 책임 (Vicarious Liability): 침해 행위를 통제할 수 있었음에도 불구하고 이를 방치하고 경제적 이득을 얻은 행위 Vicarious Liability: The defendants allegedly failed to prevent infringement despite having the ability to do so, while profiting from the infringing activity.

2.4.2. AI SaaS 기업의 책임 회피 전략 (Risk Mitigation Strategies for AI SaaS Providers)

AI SaaS (Software as a Service) 기업들은 ChatGPT, Midjourney, Notion AI 등과 같이 클라우드 기반 API 또는 웹 인터페이스를 통해 AI 기능을 제공합니다. 이러한 기업들은 사용자의 침해 행위로부터 스스로를 보호하기 위해 다음과 같은 계약적 장치를 활용하고 있습니다:

AI SaaS companies such as ChatGPT, Midjourney, and Notion AI typically offer generative services via cloud-based APIs or web platforms. To limit exposure to copyright liability, these providers often incorporate contractual risk-allocation clauses, including:

입력 및 출력에 대한 책임 귀속 조항: 사용자에 의한 프롬프트 및 출력물 이용에 따른 법적 책임은 사용자에게 전적으로 귀속된다는 약관 삽입. Responsibility for Inputs and Outputs: Terms of service that assign full legal responsibility for prompt inputs and resulting outputs to the user.

면책 조항(Disclaimer): 생성물의 정확성, 비침해성, 합법성에 대해 명시적으로 보증하지 않음. Disclaimers: Statements denying warranties regarding the accuracy, legality, or non-infringing nature of generated content.

Indemnification Clause: 제3자의 침해 주장 발생 시, 사용자가 개발사를 방어하고 손해를 배상해야 한다는 조항 포함. Indemnification Clauses: Provisions requiring users to defend and indemnify the provider in the event of third-party copyright claims.

지적재산 귀속 조항: 출력물은 사용자의 소유로 간주하되, 비식별화된 데이터에 한하여 모델 개선을 위한 비독점적 사용을 개발자가 할 수 있도록 허용. Intellectual Property Ownership Clauses: Output content is generally assigned to the user, while the developer retains a non-exclusive right to use anonymized output data for model improvement and further development.

이러한 구조는 책임 분산 및 리스크 최소화를 위한 전형적인 AI 서비스 계약 전략이며, 향후 AI 법제의 변화에 따라 표준 계약 조항으로 정착될 가능성이 높습니다.

This framework reflects a common contractual strategy to distribute liability and minimize legal risk. As legal standards around generative AI continue to evolve, these contract terms are likely to become standardized industry practice.

2.5. AI 생성물에 대한 소유권 귀속의 모호성 (Uncertainty in Ownership of AI-Generated Works)

현행 저작권법은 AI를 저작자로 인정하지 않기 때문에, AI가 생성한 결과물의 법적 소유권이 불분명한 상태입니다. 핵심 쟁점은 생성형 AI의 출력물에 대한 저작권 귀속이 사용자에게 있는지, 또는 개발자에게 있는지를 어떻게 해석할 것인가에 있으며, 이 판단은 계약, 대리관계, 공동저작 원칙 등을 통해 이루어질 수 있습니다. 현재로서는 계약을 통해 귀속 관계를 사전에 규정하는 방식이 가장 현실적이기 때문에, 기업 간 기술이전, 라이선스 계약에서 이 문제는 핵심적인 법률 쟁점으로 떠오르고 있습니다[6][7][8].

Under current copyright law, AI systems cannot be recognized as legal authors, which leaves the ownership of AI-generated content in a legal gray area. The central issue is whether copyright in the outputs of generative AI belongs to the user, the developer, or some combination of both. Resolution of this issue may hinge on contract terms, principles of agency, or co-authorship doctrines. In practice, pre-contractual allocation of ownership has emerged as the most viable approach, making this a pivotal issue in technology transfer and licensing agreements between businesses[6][7][8].

한편, 2023년 유럽연합 집행위원회 연구 보고서는 사용자의 창작 기여도를 기준으로 ‘공동 저작자(co-authorship)’ 이론을 도입할 가능성을 제시한 바 있습니다.

In a 2023 study, the European Commission proposed the potential application of a co-authorship theory based on the extent of user contribution to the creative process.

2.5.1. 생성형 AI의 출력물에 대해 ‘공동저작물’ 이론을 적용할 경우 (Application of the Co-Authorship Doctrine to AI-Generated Content)

(1) 공동저작물의 법적 정의 (Legal Definition of Joint Works)

“둘 이상의 사람이 공동으로 창작한 저작물로서, 각자의 이바지가 분리되어 이용될 수 없는 것”

Under Article 2(11) of Korea’s Copyright Act:

“A work created jointly by two or more persons, the individual contributions of which are inseparable from the final product.”

If an AI output is the result of human-machine collaboration, and the user's prompt input or post-editing constitutes substantial creative contribution, courts may recognize the human user as a co-author.

예시:

프롬프트 창작 기여: 특정 스타일, 장르, 화법을 지시(예: "봄 햇살과 녹색 톤 강조")
반복적 수정 지시: 다양한 프롬프트를 통해 결과물 조정
후속 편집: 이미지 조합, 재배열, 색보정, 문장 재구성 등

Examples of user contributions that may establish authorship:

Prompt engineering: Detailed instructions specifying style, genre, tone, or aesthetic (e.g., “a warm spring scene with emphasis on green tones”).
Iterative prompt revision: Using multiple refined prompts to guide and influence the outcome.
Post-generation editing: Curating, recomposing, recoloring, or rewriting the generated output.

Ownership in co-authored works can be allocated based on different approaches:

기여도 기반 분할(Contribution-based allocation)

창의적 기여도, 독창성, 편집적 통제 정도를 고려해 권리를 분할한다.

예: 사용자에게 80%, 모델 제공자에게 20% 배분

Contribution-based allocation:

Rights are divided according to the level of creative input, originality, and editorial control.
Example: 80% to the user, 20% to the model provider.

계약 우선주의(Contractual primacy)

서비스 이용약관 또는 라이선스 계약을 통해 소유권을 사전에 명시한다.

예: SaaS 계약서에 “생성된 모든 콘텐츠의 소유권은 전적으로 사용자에게 귀속된다”라고 명시

Contractual primacy:

Ownership is defined in advance through terms of service or license agreements.
Example: A SaaS agreement might state that “All generated content is solely owned by the user.”

독립성(분리 가능성) 판단(Separability test)

사용자가 자동화된 생성 과정을 넘어, 식별 가능한 인간적 저작 행위를 했는지를 평가한다.

예: 단순하고 일반적인 텍스트 프롬프트만 입력하고, 사용자 특유의 디자인적 기여가 없는 경우에는 사용자 귀속도가 낮을 수 있다.

Separability test:

Evaluates whether the user’s role involved any distinguishable human authorship beyond automated generation.
Example: Outputs created purely from generic text prompts without user-specific design input may result in low attribution to the user.

2.5.2. 실무 적용 예시 (B2B 환경) (Practical Use Case – B2B Context):

In B2B environments, a commonly used ownership framework is:

“The copyright in all outputs shall vest in the user, while the developer retains a non-exclusive, non-public license to use de-identified data for model improvement and technical refinement.”

Currently, Article 2(1) of Korea’s Copyright Act limits protection to works that express the thoughts or feelings of a human. As a result, works generated solely by AI or those where human input is minimal may fall outside copyright protection.

그러나 AI가 단순 도구를 넘어 창작 과정에서 일정 수준의 기계 개입이 있었던 경우, 이를 ‘AI 보조 창작물’로 분류하여 법적 보호를 가능하게 하려는 논의가 필요합니다.

However, where AI acts not as an autonomous creator but as an assistive tool within a human-directed creative process, some scholars and policymakers propose recognizing a new category of AI-assisted works.

이는 입법적을 ‘AI 보조 창작물’ 정의 삽입하고, AI 보조 창작물에 대한 저작물의 보호요건 신설할 필요가 있습니다.

To do so, legislative reform may be necessary to:

Introduce a statutory definition of AI-assisted work
Establish clear criteria for the level of human input required for protection

Comparative insights:

Japan: The 2023 commentary on its Copyright Act suggests openness to protecting AI-assisted works.

EU: Under the AI Act and Digital Services Act (DSA), emerging frameworks emphasize “human-centric control” as a core standard for evaluating authorship and responsibility.

3. 결론 (Conclusion)

The intersection of AI technology and traditional copyright law represents a legally sensitive area—one where judicial interpretation could significantly shape the future trajectory of technological development. Key legal risks facing both AI developers and users arise from unresolved issues, including reproduction during training, substantial similarity in generated outputs, attribution of liability, and the uncertainty surrounding ownership of AI-generated content.

Going forward, achieving a workable balance between copyright protection and technological innovation will require both legislative refinement and carefully structured contractual frameworks. As legal standards evolve, so too must the mechanisms by which rights and responsibilities are allocated in the AI ecosystem.

[미주(Footnotes): 참고문헌(References)]

A Case Study of Authors Guild v. OpenAI and Microsoft - 4iP Council https://www.4ipcouncil.com/application/files/7517/2189/4919/Copyright_Infringement_and_AI__A_Case_Study_of_Authors_Guild_v._OpenAI_and_Microsoft.pdf]↩︎↩︎↩︎↩︎
NOTE Copyright Infringement in AI-Generated Artworks https://lawreview.law.ucdavis.edu/sites/g/files/dgvnsk15026/files/media/documents/53-5_Gillotte.pdf ↩︎↩︎↩︎↩︎
Copyright and AI: the Cases and the Consequences https://www.eff.org/deeplinks/2025/02/copyright-and-ai-cases-and-consequences ↩︎↩︎↩︎↩︎
Artificial Intelligence, Copyright, and Copyright Infringement https://scholarship.law.marquette.edu/cgi/viewcontent.cgi?article=1355&context=iplr ↩︎↩︎
Artificial intelligence and copyright. from Thaler's dilemma https://www.institutoautor.com/wp-content/uploads/2024/01/inteligencia_eng_final.pdf ↩︎↩︎
Copyright Protection for AI-Generated Works: Exploring Originality ... https://www.cambridge.org/core/journals/asian-journal-of-international-law/article/copyright-protection-for-aigenerated-works-exploring-originality-and-ownership-in-a-digital-landscape/12B8B8D836AC9DDFFF4082F7859603E3 ↩︎↩︎
ARTIFICIAL INTELLIGENCE, COPYRIGHT LAW, other related legal ... https://digitalpolicy.ie/wp-content/uploads/2024/05/AI-copyright-law-and-digital-news.pdf ↩︎↩
BEATRIZ BOTERO ARCILA (2025), Policy Report: AI Liability Along the Value Chain, https://table.media/wp-content/uploads/2025/04/01152117/AI-Liability-Along-the-Value-Chain_Beatriz-Arcila.pdf︎