용어집

대규모 언어 모델(LLM)

대규모 언어 모델(LLM)이 어떻게 고급 NLP를 통해 AI를 혁신하고 챗봇, 콘텐츠 제작 등을 지원하는지 알아보세요. 핵심 개념을 알아보세요!

Large Language Models (LLMs) represent a significant advancement in the field of Artificial Intelligence (AI), particularly within Natural Language Processing (NLP). These models are characterized by their immense scale, often containing billions of parameters, and are trained on vast datasets comprising text and code. This extensive training enables LLMs to understand context, generate coherent and human-like text, translate languages, answer questions, and perform a wide array of language-based tasks with remarkable proficiency. They are a specific type of Deep Learning (DL) model, driving innovation across numerous applications and forming a cornerstone of modern Generative AI.

정의

A Large Language Model is fundamentally a sophisticated neural network (NN), typically based on the Transformer architecture, introduced in the influential paper "Attention Is All You Need". The "large" in LLM refers to the huge number of parameters—variables adjusted during training—that can range from billions to even trillions. Generally, a higher parameter count allows the model to learn more complex patterns from the data.

LLMs learn these patterns through unsupervised learning on massive text corpora gathered from the internet, books, and other sources, often referred to as Big Data. This process helps them grasp grammar, facts, reasoning abilities, and even nuances like tone and style, though it can also lead them to learn biases present in the training data. A core capability developed during training is predicting subsequent words in a sentence. This predictive ability forms the basis for more complex tasks like text generation, language modeling, and question answering.

Well-known examples include the GPT series from OpenAI (like GPT-4), Llama models from Meta AI such as Llama 3, Gemini from Google DeepMind, and Claude from Anthropic.

애플리케이션

LLM의 다용도성 덕분에 다양한 영역에 적용할 수 있습니다. 다음은 두 가지 구체적인 예입니다:

Conversational AI: LLMs are the engines behind many advanced chatbots and virtual assistants. They enable these systems to understand user queries, maintain context over conversations, and generate natural-sounding responses. A prime example is OpenAI's ChatGPT, which uses LLMs to engage in detailed dialogues, answer follow-up questions, and even admit mistakes.
Content Creation and Summarization: LLMs can assist humans in generating various forms of written content, including articles, emails, marketing copy, and creative writing. They are also adept at Text Summarization, condensing lengthy documents into concise overviews, which helps combat information overload. Tools like Jasper and Microsoft Copilot leverage LLMs for these purposes.

주요 개념

LLM을 이해하려면 몇 가지 관련 개념을 숙지해야 합니다:

Foundation Models: LLMs are often considered foundation models because they are trained on broad data and can be adapted (or fine-tuned) for a wide range of downstream tasks without needing to be trained from scratch.
Attention Mechanisms: Particularly self-attention, these mechanisms allow the model to weigh the importance of different words (tokens) in the input sequence when processing a particular word. This is crucial for understanding context and relationships within the text.
Prompt Engineering: This refers to the art and science of designing effective input prompts to guide an LLM towards generating the desired output. The quality of the prompt significantly influences the accuracy and relevance of the response. Techniques like chain-of-thought prompting help improve reasoning in complex tasks.
Tokenization: Before processing text, LLMs break it down into smaller units called tokens. These tokens can be words, subwords, or characters. Tokenization converts raw text into a numerical format the model can understand. Platforms like Hugging Face provide tools and information on different tokenization strategies.

LLMs vs. Computer Vision Models

While LLMs excel at language tasks, they differ significantly from models primarily designed for Computer Vision (CV). CV models, such as Ultralytics YOLO models (e.g., YOLOv8, YOLOv9, YOLOv10, and YOLO11), are specialized for interpreting visual information from images or videos. Their tasks include object detection, image classification, and instance segmentation.

However, the boundary is blurring with the rise of Multi-modal Models and Vision Language Models (VLMs). These models, like OpenAI's GPT-4o or Google's Gemini, integrate understanding across different modalities (e.g., text and images), enabling tasks like describing images or answering questions about visual content.

Platforms like Ultralytics HUB provide tools and infrastructure for training and deploying various AI models, including those for vision tasks, facilitating the development of diverse AI applications. As LLMs and other AI models become more powerful, considerations around AI Ethics, algorithmic bias, and data privacy become increasingly important. For more information on AI concepts and model comparisons, explore the Ultralytics documentation and model comparison pages.

대규모 언어 모델(LLM)

YOLO 모델을 Ultralytics HUB로 간단히
훈련

혁신을 지원하는 유연한 엔터프라이즈 라이선싱 솔루션

다음을 사용하여 몇 초 만에 AI 모델을 훈련하세요. Ultralytics YOLO

Ultralytics HUB로 간단히 YOLO 모델 교육

정의

애플리케이션

주요 개념

LLMs vs. Computer Vision Models

블로그 더 보기

Ultralytics 커뮤니티 가입하기

대규모 언어 모델(LLM)

YOLO 모델을 Ultralytics HUB로 간단히훈련

혁신을 지원하는 유연한 엔터프라이즈 라이선싱 솔루션

다음을 사용하여 몇 초 만에 AI 모델을 훈련하세요. Ultralytics YOLO

Ultralytics HUB로 간단히 YOLO 모델 교육

정의

애플리케이션

주요 개념

LLMs vs. Computer Vision Models

블로그 더 보기

Ultralytics 커뮤니티 가입하기

YOLO 모델을 Ultralytics HUB로 간단히
훈련