My dev journey

CPU Scheduling: From Basics to Execution Flow

Heesu Noh — Thu, 09 Apr 2026 14:00:03 GMT

1️⃣ Basic Concepts and Necessity of CPU Scheduling
2️⃣ Management Structure and Selection Criteria of CPU Scheduling
3️⃣ CPU Scheduling Execution Flow

1️⃣ Basic Concepts and Necessity of CPU Scheduling

CPU Scheduling; Who Does the OS Give the CPU to First?

Last week, we examined the concepts of processes and threads, looking at how execution flows are divided and how the operating system manages execution units.

Once you understand these concepts, a natural question arises: when multiple processes and threads exist simultaneously, who should the CPU be allocated to first? If all processes want to run at the same time, what criteria does the operating system use to make that decision?

This week's topic, CPU scheduling, starts from exactly these questions.

CPU 스케줄링; 운영체제는 누구에게 먼저 CPU를 줄까?

지난주에는 프로세스와 스레드의 개념을 살펴보며, 실행의 흐름이 어떻게 나뉘는지 그리고 운영체제가 실행 단위를 어떻게 관리하는지 확인했다.

개념을 익히고 나면 자연스럽게 이런 의문이 생긴다. 여러 프로세스와 스레드가 동시에 존재할 때, CPU는 과연 누구에게 먼저 할당되어야 할까? 모든 프로세스가 동시에 실행되길 원한다면, 운영체제는 어떤 기준으로 CPU를 나눠줄까?

이번 주제인 CPU 스케줄링은 바로 이 질문에서 출발한다.

Why Things Appear to Run Simultaneously

While listening to music, web pages still open instantly and messenger notifications keep coming through. To our eyes, all of these tasks seem to be happening at the same time.

In reality, the CPU can only process one task at a single moment. So are these multiple tasks truly running simultaneously?

The answer lies in rapid context switching. The operating system allocates the CPU to each task in extremely short intervals, taking turns. Because these switches happen so fast, it feels to the human eye as though everything is running at once.

So how does the operating system decide which task to run next? This is the core problem of CPU scheduling.

동시에 실행되는 것처럼 보이는 이유

음악을 듣는 동안에도 웹페이지는 바로 열리고, 메신저 알림은 끊기지 않는다. 우리 눈에는 이 작업들이 모두 동시에 이루어지는 것처럼 보인다.

그런데 사실 CPU는 한 순간에 단 하나의 작업만 처리할 수 있다. 그렇다면 이 여러 작업들은 정말로 동시에 실행되고 있는 걸까?

답은 빠른 전환(Context Switching) 이다. 운영체제는 각 작업에 CPU를 아주 짧은 시간 동안 번갈아 가며 할당한다. 이 전환이 너무 빠르기 때문에 사람의 눈에는 마치 모든 작업이 동시에 돌아가는 것처럼 느껴지는 것이다.

그렇다면 운영체제는 어떤 기준으로 다음 실행할 작업을 결정할까? 이것이 바로 CPU 스케줄링의 핵심 문제다.

The Core Problem of CPU Scheduling

Let's look at the situation more concretely. Opening a web page requires the CPU to process data. Displaying a messenger notification also uses the CPU. Even playing music involves CPU processing.

If all three of these tasks demand the CPU at the same moment, what choice should the operating system make? Should it run whichever task arrived first? Should it process the task that finishes fastest? Should it prioritize tasks that directly interact with the user?

The problem of deciding who gets the CPU, when, and for how long is the essence of CPU scheduling. The criteria chosen will significantly affect the system's response time, processing efficiency, and user experience.

CPU 스케줄링의 핵심 문제

상황을 조금 더 구체적으로 살펴보자. 웹페이지를 열 때는 데이터 처리를 위해 CPU가 필요하고, 메신저 알림을 표시할 때도, 음악이 재생되는 동안에도 CPU 처리가 이루어진다.

만약 이 세 가지 작업이 동시에 CPU를 요구하는 순간이 온다면 운영체제는 어떤 선택을 해야 할까?

먼저 도착한 작업을 실행할까?
가장 빨리 끝나는 작업을 먼저 처리할까?
사용자와 직접 상호작용하는 작업을 우선시할까?

이처럼 CPU를 누구에게, 언제, 얼마나 줄 것인가를 결정하는 문제가 바로 CPU 스케줄링의 핵심이다. 어떤 기준을 선택하느냐에 따라 시스템의 반응속도, 처리 효율, 사용자 경험이 크게 달라질 수 있다.

The Multi-Programming Environment

Modern operating systems keep multiple programs loaded in memory at the same time. Browsers, messengers, music players, and document editors all exist in a ready state simultaneously- meaning there are always multiple candidates requesting execution.

However, the CPU cannot process multiple programs at once. It can only handle one execution flow at a time, so even when many programs exist, the CPU must take turns processing them in order.

The operating system selects one of the ready processes and assigns the CPU to it. In a multi-programming environment, the process of deciding in what order the CPU is used — that is, scheduling — becomes essential.

다중 프로그래밍 환경

현대 운영체제는 여러 프로그램을 동시에 메모리에 올려둔다. 브라우저, 메신저, 음악 재생기, 문서 편집기 등 여러 프로세스가 동시에 준비 상태에 존재한다는 뜻이다. 즉, 실행을 요구하는 대상이 여러 개인 상황이 항상 발생한다.

그러나 CPU는 동시에 여러 프로그램을 처리할 수 있는 장치가 아니다. 한 시점에는 하나의 실행 흐름만 처리할 수 있기 때문에, 여러 프로그램이 존재하더라도 CPU는 순서를 정해 번갈아 가며 처리할 수밖에 없다.

이때 운영체제는 준비 상태에 있는 여러 프로세스 중 하나를 선택해 CPU를 할당한다. 결국 다중 프로그래밍 환경에서는 CPU를 어떤 순서로 사용할지 결정하는 과정, 즉 스케줄링이 필수적으로 요구된다.

The Illusion of Simultaneous Execution

The execution order is never directly exposed to the user. As a result, each program appears to run independently. The browser opens pages on its own; the messenger displays notifications on its own.

But in reality, the operating system is coordinating all of these execution flows behind the scenes. It continuously decides which process to keep in the running state and when to switch to another.

Because this process is invisible to the user, we feel as though multiple programs are running simultaneously. This is what we call the illusion of concurrent execution — and making this illusion feel natural is one of the key roles of CPU scheduling.

사용자가 느끼는 동시 실행의 착시

실행 순서는 사용자에게 직접 노출되지 않는다. 그 결과 각 프로그램은 마치 독립적으로 실행되는 것처럼 보인다. 브라우저는 브라우저대로 페이지를 열고, 메신저는 메신저대로 알림을 표시한다.

하지만 실제로는 운영체제가 이 모든 실행 흐름을 보이지 않는 곳에서 조율하고 있다. 어떤 프로세스를 실행 상태로 둘 것인지, 언제 다른 프로세스로 전환할 것인지를 운영체제가 끊임없이 결정한다.

이 과정이 사용자 눈에 보이지 않기 때문에, 우리는 여러 프로그램이 동시에 실행되는 것처럼 느끼게 된다. 이것이 바로 동시 실행의 착시다. 그리고 이 착시를 자연스럽게 만드는 것이 곧 CPU 스케줄링의 역할이기도 하다.

The Secret of Apparent Simultaneous Execution

Why does it look like multiple programs are running at the same time? As mentioned, the CPU handles only one task at a time. Execution targets are processed one by one in order, not simultaneously.

The secret is in execution switching. The operating system pauses the currently running process and switches to another. When needed, it returns to the previous process. Because this switching repeats at very high speed, it feels to the user as though multiple programs are operating at once.

The key point is that multiple processes are not truly running at the same time. The operating system rapidly swaps out which process is using the CPU — this is the actual mechanism behind multi-programming environments.

실행 전환: 동시 실행의 비밀

왜 여러 프로그램이 동시에 실행되는 것처럼 보이는 걸까? 앞서 말했듯 CPU는 한 시점에 하나의 작업만 처리한다. 실행 대상은 동시에 처리되는 것이 아니라 순서를 정해 하나씩 처리된다.

그 비밀은 실행 전환에 있다. 운영체제는 현재 실행 중인 프로세스를 잠시 멈추고 다른 프로세스로 전환한다. 그리고 다시 필요해지면 이전에 실행하던 프로세스로 돌아온다. 이 전환이 매우 빠른 속도로 반복되기 때문에, 사용자 입장에서는 여러 프로그램이 동시에 작동하는 것처럼 느껴지는 것이다.

여기서 중요한 점은, 여러 프로세스를 진짜로 동시에 실행하는 것이 아니라는 사실이다. 운영체제가 실행 대상을 빠르게 바꾸면서 CPU를 나눠 쓰는 방식, 이것이 다중 프로그래밍 환경의 실제 동작 원리다.

How Is the Next Process Selected?

We now know the CPU alternates between multiple execution targets. A natural question follows: among the many processes in the ready state, who gets the CPU first?

This is not simply a matter of ordering. Which task runs first directly affects the overall behavior and performance of the system. If a heavy task occupies the CPU for a long time, lighter tasks the user is waiting for will be delayed accordingly.

So who makes this decision? The operating system does. It judges what criteria to use when selecting the next process to run, and this judgment directly affects the response speed and efficiency that users experience.

Ultimately, how execution order is determined governs the system's performance and capability. This is why CPU scheduling is considered a core responsibility of the operating system, not just a routine management task.

실행 대상은 어떻게 선택될까?

CPU가 여러 실행 대상을 번갈아 처리한다는 것을 알았다. 그렇다면 자연스럽게 다음 질문이 생긴다. 준비 상태에 있는 여러 프로세스 중 누가 먼저 CPU를 사용할 것인가?

이 선택은 단순한 순서의 문제가 아니다. 어떤 작업을 먼저 실행하느냐에 따라 시스템 전체의 동작 방식과 성능이 직접적으로 달라진다. 예를 들어 무거운 작업이 CPU를 오래 점유하면, 사용자가 기다리는 가벼운 작업은 그만큼 늦게 처리된다.

그렇다면 이 순서는 누가 결정하는 걸까? 바로 운영체제다. 운영체제는 어떤 기준으로 다음 실행할 프로세스를 선택할지 판단하며, 이 판단이 사용자가 체감하는 반응 속도와 시스템 효율에 직결된다.

결국 실행 순서를 어떻게 정하느냐가 시스템의 성능과 가능성을 좌우한다. 이것이 CPU 스케줄링이 단순한 관리 기법을 넘어 운영체제의 핵심 역할로 다뤄지는 이유다.

The Ready Queue and CPU Scheduling

A process does not stay in one place — it moves through a flow of states.

At the center of that flow is the Ready Queue. The ready queue is where processes waiting to be assigned the CPU gather. Processes that are prepared to execute but have not yet received the CPU wait here.

When a process requests I/O, it leaves the CPU and moves to an I/O device queue. Once the I/O completes, it returns to the ready queue to wait for the CPU again. Processes continuously move between these two queues based on their state.

The important point is that CPU scheduling happens in the ready queue. The operating system selects one process from the ready queue and assigns it the CPU — and this selection process is CPU scheduling itself.

준비 큐와 CPU 스케줄링

프로세스는 한 곳에 머무르는 것이 아니라, 상태에 따라 이동하는 흐름으로 구성되어 있다.

그 흐름의 중심에 준비 큐(Ready Queue) 가 있다. 준비 큐는 CPU를 할당받기 위해 기다리는 프로세스들이 모여 있는 공간이다. 실행 준비는 완료되었지만 아직 CPU를 받지 못한 상태의 프로세스들이 여기에 대기한다.

한편 프로세스가 입출력을 요청하면 CPU를 떠나 입출력 장치 큐로 이동한다. 입출력 작업이 끝나면 다시 준비 큐로 돌아와 CPU 할당을 기다린다. 이처럼 프로세스는 준비 큐와 입출력 장치 큐 사이를 상태에 따라 오가는 구조다.

중요한 점은 CPU 스케줄링은 준비 큐에서 일어난다는 것이다. 운영체제는 준비 큐에 있는 프로세스 중 하나를 선택해 CPU를 할당하며, 이 선택 과정 자체가 바로 CPU 스케줄링이다.

What CPU Scheduling Means

In an environment where multiple processes share a single CPU, many processes may be ready to execute. But only one can actually use the CPU at any given moment. The operating system must therefore choose which of the candidate processes to transition into the running state.

This process of deciding which process to run next among many candidates is CPU scheduling. Put simply, CPU scheduling is the operating system's decision-making process for determining which process executes first.

CPU 스케줄링의 의미

지금까지의 내용을 바탕으로 CPU 스케줄링의 의미를 정리해보자.

하나의 CPU를 여러 프로세스가 함께 사용하는 환경에서는 실행 가능한 프로세스가 여러 개 존재한다. 하지만 실제로 CPU를 사용하는 대상은 한 시점에 단 하나다. 따라서 운영체제는 여러 프로세스 후보 중에서 어떤 프로세스를 실행 상태로 전환할지 선택해야 한다.

이처럼 여러 프로세스 중에서 실행할 대상을 결정하는 과정이 바로 CPU 스케줄링이다. 다시 말해, CPU 스케줄링이란 어떤 프로세스를 먼저 실행할지를 판단하는 운영체제의 결정 과정이라고 할 수 있다.

When Does CPU Scheduling Occur?

A process stays in the ready state until it is assigned the CPU. The operating system selects one ready process and transitions it to the running state. In other words, CPU scheduling occurs at the moment a process transitions from the ready state to the running state.

An important nuance: scheduling does not happen at every state change. It only occurs when the CPU needs to be newly assigned — when a new process must be chosen to run.

CPU 스케줄링은 언제 발생할까?

CPU 스케줄링이 무엇인지 알았다면, 이제 그것이 언제 일어나는지를 살펴보자.

프로세스는 CPU를 할당받기 전까지 준비 상태에 머문다. 운영체제는 준비 상태에 있는 프로세스 중 하나를 선택해 실행 상태로 전환한다. 즉, CPU 스케줄링은 프로세스가 준비 상태에서 실행 상태로 전환되는 순간에 발생한다.

여기서 중요한 점은, 모든 상태 변화에서 스케줄링이 발생하는 것이 아니라는 것이다. CPU를 새로 할당해야 하는 시점, 즉 실행할 프로세스를 새로 골라야 하는 순간에만 스케줄링이 이루어진다.

When Does Scheduling Repeat?

After a process is created, it moves through various states. For example, when a running process transitions from the running state to the waiting state due to an I/O request or time expiration, the CPU becomes idle. The operating system then selects another process from the ready state and assigns the CPU to it. Once the I/O completes, the process returns to the ready state and becomes a candidate for the next execution.

In this way, scheduling occurs repeatedly every time a process undergoes a state change.

It is worth noting that throughout this process, the long-term, medium-term, and short-term schedulers each intervene at different points. For now, however, it is enough to understand the overall structure. The specific role of each scheduler will be covered separately in the next session.

스케줄링은 언제 반복되는가?

프로세스는 생성 이후 여러 상태를 오가게 된다. 예를 들어 실행 중이던 프로세스가 입출력 요청이나 시간 만료로 인해 실행 상태에서 대기 상태로 이동하면, CPU는 비게 된다. 그러면 운영체제는 준비 상태에 있는 다른 프로세스를 선택해 CPU를 할당한다. 입출력이 끝난 프로세스는 다시 준비 상태로 돌아와 다음 실행 대상의 후보가 된다.

이처럼 프로세스의 상태 변화가 일어날 때마다 스케줄링은 반복적으로 발생한다.

한편 이 과정에는 단기, 중기, 장기 스케줄러가 각각 서로 다른 지점에서 개입한다. 다만 이번 내용에서는 전체적인 구조만 이해하는 것으로 충분하다. 각 스케줄러의 구체적인 역할은 다음 시간에 따로 다룰 예정이다.

The Repeated Cycle of Process State Changes and Scheduling

After creation, a process continuously cycles between ready, running, and waiting states. Each time a state change occurs, scheduling happens again.

Scheduling is not a one-time event. It is a continuous decision process that repeats every time a process changes state. When a running process transitions to a waiting state due to an I/O request or time expiration, the CPU becomes free, and a process that returns to the ready state becomes a candidate again.

Ultimately, scheduling is required every time the CPU becomes idle. The operating system selects the next process from the ready queue at each such moment, and this cycle repeats continuously for as long as the system runs.

프로세스 상태 변화와 스케줄링

프로세스는 생성 이후 준비, 실행, 대기 상태 사이를 계속해서 오간다. 그리고 이 상태 변화가 일어날 때마다 스케줄링은 반복적으로 발생한다.

즉, 스케줄링은 한 번으로 끝나는 과정이 아니다. 프로세스의 상태 전환이 일어날 때마다 계속 이루어지는 결정 과정이다. 실행 중이던 프로세스가 입출력 요청이나 시간 만료로 인해 상태가 바뀌면 CPU는 비게 되고, 준비 상태로 돌아온 프로세스는 다시 선택 대상이 된다.

결국 CPU가 비는 순간마다 스케줄링이 필요하다. 운영체제는 그 순간마다 준비 큐에서 다음 실행할 프로세스를 선택하고, 이 과정이 시스템이 동작하는 내내 끊임없이 반복된다.

Why Is CPU Scheduling Necessary?

The CPU can only process one task at a time. Yet execution requests continuously arrive from multiple sources simultaneously. Many tasks demand the CPU at once, and the CPU cannot handle them all immediately.

Therefore, a process for deciding which task to handle first — determining execution order — is absolutely necessary. Without establishing execution order, the system cannot function normally.

Deciding execution order is not optional. CPU scheduling is a structurally necessary process in any multi-programming environment.

CPU 스케줄링은 왜 필요한가?

CPU는 한 시점에 하나의 작업만 처리할 수 있다. 그러나 실행 요청은 계속해서 여러 곳에서 동시에 발생한다. 여러 작업이 동시에 CPU를 요구하지만, CPU는 그 모든 요청을 즉시 처리할 수 없다.

따라서 어떤 작업을 먼저 처리할 것인지, 실행 순서를 결정하는 과정이 반드시 필요하다. 만약 실행 순서를 정하지 않는다면 시스템은 정상적으로 동작할 수 없게 된다.

결국 실행 순서를 정한다는 것은 선택의 문제가 아니다. CPU 스케줄링은 다중 프로그래밍 환경에서 구조적으로 반드시 필요한 과정이다.

Why Execution Order Matters

The result of scheduling ultimately manifests as waiting time. Depending on which task is selected first, the time other tasks must wait before receiving the CPU changes — and so does when they get a response.

Users have no visibility into which algorithm the system uses internally. Instead, they evaluate the system based on how quickly the screen responds and how fast requests are answered. Even with the same resources and tasks, the performance users perceive can vary significantly depending on execution order.

In other words, scheduling is not merely about setting an order of execution. It is a critical factor that determines the performance users experience.

실행 순서가 중요한 이유

그렇다면 실행 순서는 왜 중요할까? 스케줄링의 결과는 결국 대기 시간으로 나타난다. 어떤 작업을 먼저 선택하느냐에 따라 다른 작업이 CPU를 받을 때까지 기다려야 하는 시간이 달라지고, 그에 따라 응답 시점도 달라지기 때문이다.

사용자는 시스템 내부에서 어떤 알고리즘이 사용되는지 알지 못한다. 대신 화면이 얼마나 빨리 반응하는지, 요청에 얼마나 빠르게 응답하는지를 기준으로 시스템을 평가한다. 같은 자원, 같은 작업이라도 실행 순서에 따라 사용자가 체감하는 성능은 크게 달라질 수 있다.

즉, 스케줄링은 단순히 실행 순서를 정하는 문제가 아니다. 사용자가 느끼는 시스템의 성능을 결정하는 핵심 요소다.

The Purpose of CPU Scheduling

The CPU is the busiest resource in any system. Since multiple tasks are constantly competing for it, minimizing idle CPU time is essential. How execution opportunities are divided among tasks is equally important.

The goal is not simply to finish any particular task quickly. How execution opportunities are distributed directly determines the overall performance of the system.

The purpose of CPU scheduling is to use the CPU efficiently while improving total system performance. Scheduling is the operating system's strategy for maximizing performance without wasting CPU resources.

CPU 스케줄링의 목적

지금까지 왜 실행 순서를 정해야 하는지 살펴보았다. 그렇다면 스케줄링은 무엇을 잘하기 위해 존재하는 걸까?

CPU는 시스템에서 가장 바쁜 자원이다. 여러 작업이 동시에 CPU를 기다리는 상황이 반복되기 때문에, CPU가 쉬고 있는 상황을 최소화하는 것이 중요하다. 동시에 여러 작업에게 실행 기회를 어떻게 나눌지도 핵심적인 문제다.

여기서 중요한 점은, 단순히 어떤 작업을 빨리 끝내는 것이 목표가 아니라는 것이다. 실행 기회를 어떻게 분배하느냐에 따라 전체 시스템의 성능이 달라지기 때문이다.

결국 CPU 스케줄링의 목적은 CPU를 효율적으로 사용하면서 전체 시스템의 성능을 향상시키는 것이다. 즉, 스케줄링은 CPU를 낭비하지 않으면서 시스템 성능을 높이기 위한 운영체제의 전략이라고 볼 수 있다.

Another Goal: Fairness and Stability

The goal of CPU scheduling is not performance alone. Efficiency matters, but in an environment where many processes share the CPU, fairness is also critical.

No single process should be allowed to monopolize the CPU indefinitely. Nor should any process go unselected for so long that it waits forever — this situation is called starvation, and when it recurs, trust in the system erodes.

As the number of competing tasks grows, competition intensifies. If scheduling criteria are designed to favor certain tasks, others will consistently be pushed back. When this happens repeatedly, it becomes difficult to predict when any given task will execute, and responsiveness suffers.

Scheduling is therefore not just a matter of increasing throughput. It is equally a matter of maintaining fairness and stable responsiveness.

스케줄링의 또 다른 목표: 공정성과 안정성

CPU 스케줄링의 목표는 성능 향상만이 아니다. CPU를 효율적으로 사용하는 것도 중요하지만, 여러 프로세스가 함께 사용하는 환경에서는 공정성도 중요한 요소다.

특정 프로세스가 CPU를 계속 독점하는 상황은 발생해선 안 된다. 또한 어떤 프로세스가 오랫동안 선택받지 못해 계속 대기하는 상황도 문제가 된다. 이를 기아 상태(Starvation) 라고 하며, 이 상태가 반복되면 시스템에 대한 신뢰가 떨어질 수밖에 없다.

실행을 요구하는 작업이 많아질수록 작업 간 경쟁도 심해진다. 이때 스케줄링 기준이 특정 작업에 유리하게 설계되어 있다면, 일부 작업은 계속 밀릴 수밖에 없다. 이런 상황이 반복되면 어떤 작업이 언제 실행될지 예측하기 어려워지고, 응답성에도 부정적인 영향을 미치게 된다.

따라서 스케줄링은 단순히 처리 속도를 높이는 문제가 아니라, 공정성과 안정적인 응답성을 함께 유지하는 문제이기도 하다.

Why Multiple Scheduling Approaches Are Needed

Can a single execution-order criterion apply to every situation? The priorities vary by system purpose. Sometimes fast response is critical; sometimes total throughput matters more; sometimes fairness is the top priority. The most appropriate scheduling approach depends on which criterion takes precedence.

The operating system selects an execution-order criterion to match the system's purpose, and the system's performance characteristics change accordingly. This is why many different scheduling approaches exist rather than just one.

The specific behavior of each scheduling algorithm will be examined in detail in the next session.

왜 다양한 스케줄링 방식이 필요한가?

지금까지 스케줄링이 CPU의 효율적 사용뿐 아니라 공정성과 안정성도 함께 추구해야 한다는 것을 살펴보았다. 여기서 한 가지 질문이 생긴다. 모든 상황에 동일한 실행 순서 기준을 적용할 수 있을까?

시스템의 목적에 따라 우선순위는 달라진다. 빠른 응답이 중요한 경우도 있고, 전체 처리량이 더 중요한 경우도 있으며, 공정성이 최우선인 경우도 있다. 어떤 기준을 우선하느냐에 따라 더 적합한 스케줄링 방식이 달라지는 것이다.

따라서 운영체제는 시스템의 목적에 맞는 실행 순서 기준을 선택하게 되고, 그 기준에 따라 시스템의 성능 특성도 달라진다. 이것이 스케줄링 방식이 하나가 아니라 여러 가지 존재하는 이유다.

각각의 스케줄링 알고리즘이 구체적으로 어떻게 동작하는지는 다음 시간에 자세히 살펴볼 예정이다.

2️⃣ Management Structure and Selection Criteria of CPU Scheduling

Why the Operating System Manages Execution Order

Multiple processes simultaneously request CPU execution. But because the CPU is a single resource, it cannot handle all requests at once. The result is that processes compete for one CPU.

Without a criterion for which process to run first, certain processes may monopolize the CPU and execution order becomes unpredictable. The operating system must therefore directly select and manage which ready process runs next.

What criterion guides that selection? This is the central question of CPU scheduling, and the various algorithms that answer it will be explored in the next session.

CPU 실행 순서는 왜 운영체제가 관리하는가?

현재 시스템에는 여러 프로세스가 동시에 CPU 실행을 요청한다. 그러나 CPU는 하나의 자원이기 때문에 모든 요청을 동시에 처리할 수 없다. 결국 여러 프로세스가 CPU 하나를 두고 경쟁하는 구도가 만들어진다.

이때 어떤 프로세스를 먼저 실행할지에 대한 기준이 없다면, 특정 프로세스가 CPU를 독점하거나 실행 순서를 예측하기 어려운 상황이 발생한다. 따라서 운영체제가 준비 상태에 있는 프로세스 중 어떤 것을 먼저 실행할지를 직접 선택하고 관리해야 한다.

그렇다면 그 선택은 어떤 기준으로 이루어질까? 이것이 바로 CPU 스케줄링의 핵심 질문이며, 다음 시간에는 이 기준들, 즉 다양한 스케줄링 알고리즘을 구체적으로 살펴볼 예정이다.

Why Execution Order Must Be Managed

In a modern system, multiple processes can request CPU execution at the same time. However, because the CPU is a single shared resource, all processes must take turns using it. Furthermore, execution requests can overlap in time — meaning situations where multiple processes simultaneously compete for the same CPU can arise at any moment.

In such an environment, conflicts over execution order arise naturally. Without a criterion for deciding which process runs first, certain processes may end up holding the CPU for an extended period, and execution results become difficult to predict.

For this reason, a structure that systematically manages CPU execution order is absolutely necessary. This is the fundamental reason why the operating system is responsible for CPU scheduling.

실행 순서가 관리되어야 하는 이유

현재 시스템에서는 여러 프로세스가 동시에 CPU 실행을 요청할 수 있다. 그러나 CPU는 하나의 자원이기 때문에 여러 프로세스가 함께 공유해서 사용해야 한다. 더불어 실행 요청은 시간적으로 겹쳐서 발생할 수 있다. 즉, 여러 프로세스가 같은 CPU를 두고 동시에 실행을 요구하는 상황이 언제든 생길 수 있다.

이런 환경에서는 자연스럽게 실행 순서 충돌이 발생한다. 어떤 프로세스를 먼저 실행할지에 대한 기준이 없다면 특정 프로세스가 CPU를 오래 독점하게 되고, 실행 결과 역시 예측하기 어려워진다.

따라서 이러한 환경에서는 CPU 실행 순서를 체계적으로 관리하는 구조가 반드시 필요하다. 이것이 운영체제가 CPU 스케줄링을 담당하는 근본적인 이유다.

What Does the Operating System Manage, and How?

We established that a structure for managing CPU execution order is necessary. So what exactly does the operating system manage, and how does it do so?

First, the subject of management is the pool of candidate processes registered in the ready queue - processes that are in an executable state but have not yet been assigned the CPU. These processes are not simply waiting; they are candidates that the operating system can select as the next execution target.

Next, looking at the method of management: the operating system maintains a waiting order based on the ready queue and selects one process from it to transition into the running state. The entity that performs this selection is the scheduler.

To summarize, CPU scheduling is the process of selecting the next execution target from among the candidates in the ready queue.

운영체제는 무엇을, 어떻게 관리하는가?

CPU 실행 순서를 관리하는 구조가 필요하다고 했다. 그렇다면 운영체제는 구체적으로 무엇을, 어떻게 관리하는 걸까?

먼저 관리 대상은 준비 큐에 등록된 실행 후보 프로세스들이다. 실행 가능한 상태에 있으면서도 아직 CPU를 할당받지 못한 프로세스들이 여기에 해당한다. 이 프로세스들은 단순히 대기하는 것이 아니라, 운영체제가 다음 실행 대상으로 선택할 수 있는 후보들이다.

다음으로 관리 방식을 살펴보면, 운영체제는 준비 큐를 기반으로 대기 순서를 유지하고 그 중 하나의 프로세스를 선택해 실행 상태로 전환한다. 이 선택을 수행하는 주체가 바로 스케줄러다.

정리하면, CPU 스케줄링이란 준비 큐에 있는 실행 후보 중에서 다음 실행 대상을 선택하는 과정이다.

The Scheduling Queue

The scheduling queue is a data structure that collects and manages processes waiting to execute. In data structures, a queue follows FIFO (First In, First Out) — the first element in is the first one out. The operating system uses this structure to manage process waiting states.

There are two fundamental types of scheduling queues. The ready queue holds processes in an executable state that are waiting for CPU allocation — CPU scheduling operates on these processes. The I/O device queue is where processes go when they issue an I/O request during execution; once I/O completes, they return to the ready queue.

Both are queues in structure, but they are managed separately based on what is being waited for and what role each plays.

스케줄링 큐란 무엇인가?

앞서 실행 후보 프로세스들이 준비 큐를 중심으로 관리된다고 했다. 이처럼 실행을 기다리는 프로세스들을 모아서 관리하는 자료구조를 일반적으로 스케줄링 큐(Scheduling Queue) 라고 부른다.

스케줄링 큐에는 CPU를 바로 사용할 수는 없지만 실행 가능한 상태에 있는 프로세스들이 등록된다. 즉, 스케줄링 큐는 CPU 할당 대상이 되는 프로세스들의 집합이며, 운영체제는 이 큐에 있는 프로세스들을 대상으로 다음 실행 대상을 선택한다.

정리하면, 스케줄링 큐는 실행 대기 프로세스를 체계적으로 관리하고 실행 순서를 결정하기 위한 기준이 되는 구조다. CPU 스케줄링은 결국 이 큐를 어떻게 운용하느냐의 문제라고도 볼 수 있다.

Process Execution Flow Based on the Scheduling Queue

Let's trace how a process flows through the system centered on the scheduling queue.

When one of the processes waiting in the ready queue is selected, it is assigned the CPU and transitions to the running state. If the process requires I/O during execution, it temporarily leaves the CPU and moves to the I/O device queue. Once the I/O completes, the process returns to the ready queue and becomes a candidate for execution again. When all of its work is finished, the process moves to the terminated state.

Throughout this flow, the operating system continuously selects the next execution target from the processes in the ready queue. CPU scheduling is ultimately the core mechanism that sustains this entire cyclical flow.

스케줄링 큐 기반 프로세스 실행 흐름

스케줄링 큐를 중심으로 프로세스가 어떻게 흘러가는지 정리해보자.

준비 큐에서 대기 중인 프로세스 중 하나가 선택되면 CPU를 할당받아 실행 상태로 전환된다. 실행 중에 입출력이 필요해지면 프로세스는 잠시 CPU를 떠나 입출력 장치 큐로 이동한다. 입출력이 완료되면 해당 프로세스는 다시 준비 큐로 돌아와 실행 후보가 된다. 그리고 모든 작업이 끝나면 프로세스는 종료 상태로 이동한다.

이 흐름 속에서 운영체제는 준비 큐에 있는 프로세스 중 다음 실행 대상을 끊임없이 선택한다. 결국 CPU 스케줄링은 이 순환적인 흐름 전체를 지탱하는 핵심 메커니즘이라고 할 수 있다.

Types of Scheduling Queues

In data structures, a queue follows a First In, First Out (FIFO) principle — the first element to enter is the first to leave. The operating system uses this structure to manage the waiting states of processes.

However, there is not just one queue. There are two fundamental types.

The first is the Ready Queue. This is where processes in an executable state wait for CPU allocation. CPU scheduling operates on the processes held in this queue.

The second is the I/O Device Queue. This is where a process goes when it issues an I/O request during execution. It waits here until the I/O completes, at which point it returns to the ready queue.

Both share the same queue structure, but the key point is that they are managed as separate queues based on what is being waited for and what role each serves.

스케줄링 큐의 종류

자료구조에서 큐(Queue)는 먼저 들어온 데이터가 먼저 나가는 선입선출(FIFO, First In First Out) 구조다. 운영체제도 이 큐를 활용해 프로세스의 대기 상태를 관리한다.

다만 이 큐가 하나만 존재하는 것이 아니다. 대표적인 기본 큐는 두 가지다.

첫 번째는 준비 큐(Ready Queue) 다. CPU 할당을 기다리는 실행 가능한 상태의 프로세스들이 모여 있는 공간이며, CPU 스케줄링은 바로 이 준비 큐에 있는 프로세스들을 대상으로 이루어진다.

두 번째는 입출력 장치 큐(I/O Device Queue) 다. 실행 도중 입출력을 요청한 프로세스가 이동하는 공간이다. 입출력이 완료될 때까지 여기서 대기하다가, 완료되면 다시 준비 큐로 돌아온다.

구조는 모두 큐이지만, 기다리는 대상과 역할에 따라 서로 다른 큐로 나누어 관리된다는 점이 핵심이다.

Scheduling Is Not a Single Decision

Scheduling does not occur only once. The operating system manages execution at multiple points in time, each serving a different purpose.

For example, when a new job enters the system, there is a stage for deciding whether to accept it right now. When the number of executable processes grows large, there is a stage for deciding which processes to keep in memory. And when the CPU actually becomes idle, a stage is needed to select which ready process to run immediately.

Scheduling is therefore not a single decision — it is a structure composed of multiple management stages. Each stage intervenes at a different point in time and for a different purpose, collectively coordinating the overall execution flow of the system.

스케줄링은 하나의 결정이 아니다

스케줄링은 하나의 순간에만 이루어지는 과정이 아니다. 운영체제는 시스템의 여러 시점에서 서로 다른 목적을 가지고 실행을 관리한다.

예를 들어 시스템에 새로운 작업이 들어올 때, 이 작업을 지금 받아들일지 결정하는 단계가 있다. 실행 가능한 프로세스가 많아지면 어떤 프로세스를 메모리에 유지할지 조절하는 단계도 존재한다. 그리고 실제로 CPU가 비는 순간이 되면, 준비 상태에 있는 프로세스 중 지금 당장 실행할 대상을 선택하는 단계가 필요하다.

이처럼 스케줄링은 단일한 결정이 아니라, 여러 관리 단계로 나누어 이루어지는 구조다. 각 단계는 서로 다른 시점에, 서로 다른 목적을 위해 개입하며 전체 시스템의 실행 흐름을 함께 조율한다.

The Long-Term Scheduler

When a program requests execution, the job first waits on disk. However, because memory capacity is limited, not all jobs can be loaded into memory immediately. This is where the long-term scheduler intervenes.

The long-term scheduler selects which jobs on disk to load into memory. Rather than deciding who gets the CPU, it decides which jobs to admit into the system as execution candidates. Because the number of processes loaded into memory determines how many processes can exist in the ready state at any given time, the long-term scheduler effectively controls the degree of multi-programming.

It is also worth noting that this decision does not happen frequently — it occurs intermittently, only as new jobs arrive.

장기 스케줄러

프로그램이 실행을 요청하면 해당 작업은 우선 디스크에서 대기한다. 그러나 메모리 용량에는 한계가 있기 때문에 모든 작업을 바로 메모리에 올릴 수는 없다. 이때 개입하는 것이 장기 스케줄러(Long-term Scheduler) 다.

장기 스케줄러는 디스크에 존재하는 작업 중 어떤 작업을 메모리에 올릴지 선택한다. 즉, CPU를 누구에게 줄지 결정하는 것이 아니라, 어떤 작업을 실행 후보로 시스템 안에 들여보낼 것인가를 결정하는 단계다. 메모리에 올라온 프로세스의 수에 따라 동시에 준비 상태로 존재하는 프로세스의 수가 달라지기 때문에, 장기 스케줄러는 곧 다중 프로그래밍의 정도를 결정하는 역할을 한다.

또한 이 결정은 자주 일어나는 것이 아니라, 새로운 작업이 유입될 때마다 간헐적으로 발생한다는 점도 기억해두자.

Summary of the Long-Term Scheduler's Role

Let's trace the long-term scheduler's flow of operation.

When a job arrives on disk, the long-term scheduler intervenes and selects which jobs to load into memory. Selected jobs are loaded into memory and registered as processes in the ready queue. From there, the short-term scheduler selects one of the processes in the ready queue and has it executed by the CPU.

In summary, the long-term and short-term schedulers divide their responsibilities across different stages. The long-term scheduler decides which jobs enter the system, while the short-term scheduler selects the actual CPU execution target. The specific behavior of the short-term scheduler will be examined in a later session.

장기 스케줄러의 역할 정리

장기 스케줄러의 동작 흐름을 정리해보자.

작업이 디스크에 도착하면 장기 스케줄러가 개입해 디스크에 있는 작업 중 어떤 것을 메모리에 올릴지 선택한다. 선택된 작업은 메모리에 올라와 준비 큐에 등록된 프로세스가 된다. 이후 단기 스케줄러(Short-term Scheduler) 가 준비 큐에 있는 프로세스 중 하나를 선택해 CPU에 의해 실행되도록 한다.

결론적으로 장기 스케줄러와 단기 스케줄러는 서로 다른 단계에서 역할을 나눠 담당한다. 장기 스케줄러는 시스템에 들어올 작업을 결정하고, 단기 스케줄러는 실제 CPU 실행 대상을 선택한다. 단기 스케줄러의 구체적인 동작은 추후에 살펴볼 예정이다.

The Medium-Term Scheduler

Even after the long-term scheduler loads jobs into memory, the state of memory continues to change. As the number of running or ready processes grows, memory can become scarce. This is where the medium-term scheduler intervenes.

The medium-term scheduler temporarily suspends some of the processes currently in memory, a process known as swap-out. Conversely, when memory space becomes available, it reloads suspended processes back into memory — a process known as swap-in.

In other words, the medium-term scheduler is responsible for regulating the number of processes maintained in memory. If the long-term scheduler controls how much work enters the system, the medium-term scheduler balances memory resources among the jobs already inside.

중기 스케줄러

장기 스케줄러가 작업을 메모리로 들여보낸 이후에도 메모리 상태는 계속 변한다. 실행 중이거나 준비 중인 프로세스가 많아지면 메모리가 부족해질 수 있는데, 이때 개입하는 것이 중기 스케줄러(Medium-term Scheduler) 다.

중기 스케줄러는 메모리에 올라와 있는 프로세스 중 일부를 일시적으로 중단 상태로 전환한다. 이 과정을 스왑 아웃(Swap-out) 이라고 한다. 반대로 메모리 공간에 여유가 생기면 중단된 프로세스를 다시 메모리로 불러오는데, 이 과정을 스왑 인(Swap-in) 이라고 한다.

즉, 중기 스케줄러는 메모리에 유지되는 프로세스의 수를 조절하는 역할을 담당한다. 장기 스케줄러가 시스템에 들어올 작업의 양을 결정한다면, 중기 스케줄러는 이미 들어온 작업들 사이에서 메모리 자원을 균형 있게 유지하는 역할을 한다고 이해하면 된다.

Summary of the Medium-Term Scheduler's Role

The medium-term scheduler manages processes that are already in memory — including both running and ready processes. When the number of processes causes memory to become overloaded, it suspends some of them and removes them from memory.

The defining characteristic of the medium-term scheduler is that it acts as a balancer between the long-term and short-term schedulers. While the long-term scheduler controls the volume of incoming work and the short-term scheduler determines which process gets the CPU, the medium-term scheduler adjusts the number of processes kept in memory to relieve system load.

중기 스케줄러의 역할 정리

중기 스케줄러의 관리 대상은 이미 메모리에 올라와 있는 프로세스다. 실행 중이거나 준비 상태에 있는 프로세스 모두가 여기에 포함된다. 시스템의 프로세스가 많아져 메모리가 과부하 상태가 되면, 중기 스케줄러는 이들 중 일부를 중단 상태로 전환해 메모리에서 내보낸다.

중기 스케줄러의 핵심 특징은 장기 스케줄러와 단기 스케줄러 사이에서 균형을 맞추는 조정자 역할을 한다는 점이다. 장기 스케줄러가 시스템에 들어오는 작업의 규모를 조절하고, 단기 스케줄러가 CPU 실행 대상을 결정한다면, 중기 스케줄러는 메모리에 유지되는 프로세스 수를 조정해 시스템 부하를 완화하는 역할을 담당한다.

The Short-Term Scheduler

The short-term scheduler selects which process in the ready queue to assign the CPU to. In other words, it is the stage that decides which of the many ready processes to transition into the running state right now.

This decision occurs very frequently. Every time a process terminates, an I/O request is made, or a time quantum expires, the short-term scheduler intervenes and selects the next execution target.

Among the three schedulers, the short-term scheduler has the most direct impact on system responsiveness and performance. All of the scheduling criteria and algorithms covered in later sessions are applied to this scheduler.

단기 스케줄러

단기 스케줄러(Short-term Scheduler)는 준비 큐에 있는 프로세스 중에서 CPU를 할당할 프로세스를 선택하는 역할을 한다. 즉, 준비 상태에 있는 여러 프로세스 중 지금 당장 실행 상태로 전환할 하나를 결정하는 단계다.

이 결정은 매우 자주 발생한다. 프로세스가 종료되거나, 입출력 요청이 발생하거나, 시간 할당량이 끝나는 순간마다 단기 스케줄러가 개입해 다음 실행 대상을 선택한다.

단기 스케줄러는 세 가지 스케줄러 중 시스템의 응답성과 성능에 가장 직접적인 영향을 미치는 스케줄러다. 이후에 배우게 될 다양한 스케줄링 기준과 알고리즘도 바로 이 단기 스케줄러에 적용된다.

The Dispatcher: Actually Handing Over the CPU

Once the short-term scheduler selects which process will use the CPU, the dispatcher intervenes in the next step. The dispatcher is the execution module responsible for actually transferring CPU control to the selected process. During this process, a context switch takes place, and the system transitions to user mode to begin actual execution.

To summarize the roles: the short-term scheduler decides who runs, and the dispatcher handles the transition to the running state. In other words, at the moment a process moves from the ready state to the running state, the short-term scheduler and the dispatcher work together.

디스패처: CPU를 실제로 넘기는 역할

단기 스케줄러가 준비 큐에 있는 프로세스 중 CPU를 사용할 프로세스를 선택하면, 그 다음 단계에서 디스패처(Dispatcher) 가 개입한다. 디스패처는 선택된 프로세스에게 CPU 제어를 실제로 넘기는 실행 담당 모듈이다. 이 과정에서 문맥 교환(Context Switching)이 이루어지며, 사용자 모드로 전환해 실제 실행이 시작된다.

역할을 정리하면 다음과 같다. 단기 스케줄러는 누구를 실행할지 결정하고, 디스패처는 실제 실행 상태로 전환하는 역할을 담당한다. 즉, 준비 상태에서 실행 상태로 넘어가는 순간에 단기 스케줄러와 디스패처가 함께 작동한다.

When the Short-Term Scheduler Intervenes

The short-term scheduler intervenes every time a specific event occurs. Concretely, it operates in the following situations.

When a running process terminates, when a process relinquishes the CPU due to an I/O request, when I/O completes and a process returns to the ready state, and when a time quantum expires or an interrupt occurs.

In this way, the short-term scheduler intervenes at every moment a process undergoes a state transition. Among the three schedulers — long-term, medium-term, and short-term — the short-term scheduler operates by far the most frequently.

단기 스케줄러의 개입 시점

단기 스케줄러는 특정 사건이 발생하는 순간마다 개입한다. 구체적으로는 다음과 같은 상황에서 작동한다.

실행 중인 프로세스가 종료될 때, 프로세스가 입출력 요청으로 인해 CPU를 반납할 때, 입출력이 완료되어 프로세스가 다시 준비 상태로 돌아올 때, 그리고 시간 할당량이 만료되거나 인터럽트가 발생할 때가 이에 해당한다.

이처럼 단기 스케줄러는 프로세스의 상태 전환이 이루어지는 순간마다 개입한다. 장기, 중기, 단기 세 가지 스케줄러 중에서 가장 빈번하게 동작하는 것이 바로 단기 스케줄러다.

Summary: Who Manages CPU Scheduling

Let's take a consolidated look at where each of the three schedulers intervenes in the process state transition flow.

The long-term scheduler decides which jobs to admit into memory at the creation stage — it selects which jobs on disk to load. The medium-term scheduler regulates the number of processes kept in memory by temporarily suspending some and reloading others. The short-term scheduler intervenes at the moment a process transitions from the ready state to the running state and selects which process to assign the CPU to.

These three schedulers intervene at different points in time, but they all operate together on a single process state transition flow. The system as a whole can only run stably when each scheduler's role interlocks properly with the others.

CPU 스케줄링의 관리 주체 정리

지금까지 배운 세 가지 스케줄러가 프로세스 상태 전이 흐름 속에서 어디에 개입하는지 한눈에 정리해보자.

장기 스케줄러는 생성 단계에서 메모리 진입을 결정한다. 디스크에 있는 작업 중 어떤 것을 메모리에 올릴지 선택하는 역할이다. 중기 스케줄러는 메모리 안에서 프로세스를 일시 중단하거나 다시 불러오는 방식으로 메모리에 유지되는 프로세스 수를 조절한다. 단기 스케줄러는 준비 상태에서 실행 상태로 넘어가는 순간에 개입해 CPU를 할당할 프로세스를 선택한다.

이 세 스케줄러는 서로 다른 시점에 개입하지만, 하나의 프로세스 상태 전이 흐름 위에서 함께 작동한다. 각자의 역할이 맞물려야 비로소 시스템 전체가 안정적으로 동작할 수 있다.

Preemptive vs. Non-Preemptive Scheduling

We said that the short-term scheduler intervenes every time a process state change occurs. This raises an important question: can the short-term scheduler forcibly stop a running process at any time?

The answer to this question divides scheduling into two major types. Preemptive scheduling can forcibly halt a running process and hand the CPU to another, while non-preemptive scheduling waits until the running process voluntarily relinquishes the CPU on its own.

Which approach is chosen significantly affects the system's responsiveness, fairness, and processing efficiency.

선점 vs 비선점 스케줄링

단기 스케줄러는 프로세스 상태 변화가 발생할 때마다 개입한다고 했다. 여기서 한 가지 중요한 질문이 생긴다. 단기 스케줄러는 실행 중인 프로세스를 언제든지 강제로 중단시킬 수 있을까?

이 질문에 대한 답에 따라 스케줄링 방식은 크게 두 가지로 나뉜다. 실행 중인 프로세스를 강제로 중단하고 다른 프로세스에게 CPU를 넘길 수 있는 선점 스케줄링(Preemptive Scheduling) 과, 실행 중인 프로세스가 스스로 CPU를 반납할 때까지 기다리는 비선점 스케줄링(Non-preemptive Scheduling) 이다.

어떤 방식을 선택하느냐에 따라 시스템의 응답성, 공정성, 그리고 처리 효율이 달라진다.

Non-Preemptive Scheduling

In non-preemptive scheduling, a process that has been assigned the CPU continues running until it terminates on its own or transitions to a waiting state due to an I/O request. The operating system does not intervene midway to forcibly stop the running process.

The advantage of this approach is that it is structurally simple and incurs low management overhead. Because context switches do not occur frequently, overall overhead remains low.

However, if a long-running task claims the CPU first, all other processes must wait until that task finishes. This can result in response delays, which is the key drawback.

Ultimately, non-preemptive scheduling can be understood as a structure that allows a process to keep running until it voluntarily gives up the CPU.

비선점 스케줄링

비선점 스케줄링(Non-preemptive Scheduling)은 CPU를 할당받은 프로세스가 스스로 종료되거나 입출력 요청으로 인해 대기 상태로 전환될 때까지 계속 실행되는 방식이다. 즉, 운영체제가 중간에 개입해 실행 중인 프로세스를 강제로 중단시키지 않는다.

이 방식의 장점은 구조가 단순하고 관리 부담이 적다는 것이다. 문맥 교환이 자주 발생하지 않기 때문에 오버헤드도 낮다.

그러나 실행 시간이 긴 작업이 먼저 CPU를 점유하면, 그 작업이 끝날 때까지 다른 프로세스는 계속 기다려야 한다. 이로 인해 응답 지연이 발생할 수 있다는 것이 단점이다.

결국 비선점 방식은 프로세스가 CPU를 자발적으로 반납할 때까지 실행을 계속 허용하는 구조라고 이해하면 된다.

Preemptive Scheduling

In preemptive scheduling, the operating system can reclaim the CPU from a running process whenever it judges this to be necessary. If the time quantum expires or a higher-priority process becomes ready, the currently running task can be halted and replaced with another process.

Rather than waiting for a process to give up the CPU on its own, the operating system directly intervenes and changes the execution flow. This approach prevents any single task from monopolizing the CPU for too long, making it well-suited for improving system responsiveness. The downside is that a context switch occurs every time a process is replaced, which can introduce additional overhead.

To summarize the difference simply: non-preemptive scheduling never takes the CPU away, while preemptive scheduling can reclaim it when needed. The various CPU scheduling algorithms that determine actual execution order are built on top of these two approaches, and they will be examined one by one in the next session.

선점 스케줄링

선점 스케줄링(Preemptive Scheduling)에서는 운영체제가 필요하다고 판단하면 실행 중인 프로세스의 CPU를 중간에 회수할 수 있다. 시간 할당량이 만료되었거나 더 높은 우선순위의 프로세스가 준비 상태가 되면, 현재 실행 중인 작업을 중단시키고 다른 프로세스로 교체할 수 있다.

즉, 프로세스가 스스로 CPU를 반납하기를 기다리는 것이 아니라, 운영체제가 직접 개입해 실행 흐름을 바꿀 수 있는 구조다. 이 방식은 특정 작업이 CPU를 오래 독점하는 것을 막을 수 있어 시스템의 응답성을 높이는 데 유리하다. 다만 프로세스가 교체될 때마다 문맥 교환이 발생하기 때문에 추가적인 오버헤드가 생길 수 있다는 단점이 있다.

두 방식의 차이를 간단히 정리하면, 비선점은 CPU를 빼앗지 않고, 선점은 필요하다면 CPU를 회수할 수 있는 방식이다. 이 두 방식을 토대로 실제 실행 순서를 결정하는 다양한 CPU 스케줄링 알고리즘이 만들어지며, 이에 대해서는 다음 시간에 하나씩 살펴볼 예정이다.

CPU Scheduling Execution Flow

We have now examined the structure and management entities of CPU scheduling. Let's look at actual execution examples to see how the execution flow of processes plays out in practice.

In particular, we will explore why execution order in situations that appear to involve simultaneous multi-process operation can differ from what we might expect.

CPU 스케줄링 실행 흐름

지금까지 CPU 스케줄링의 구조와 관리 주체를 살펴보았다. 이번에는 실제 실행 예제를 통해 프로세스의 실행 흐름이 어떻게 나타나는지 확인해보자.

특히 여러 프로세스가 동시에 작동하는 것처럼 보이는 상황에서, 실행 순서가 왜 우리가 예상하는 것과 다르게 나타나는지도 함께 살펴볼 것이다.

We Do Not Decide Execution Order

Although multiple processes appear to run simultaneously, the CPU actually handles only one execution at a time. The CPU alternates between processes in very short time intervals, which makes it feel to the user as though everything is running concurrently.

The important point here is that we do not directly specify this execution order. The operating system decides it. As a result, execution outcomes may not always be identical. Even when running the same program, the operating system may make different choices depending on the state of the system at that moment.

실행 순서는 우리가 정하지 않는다

여러 프로세스가 동시에 실행되는 것처럼 보이지만, 실제 CPU는 한 번에 하나의 실행만 처리한다. CPU는 매우 짧은 시간 단위로 프로세스를 번갈아 실행하기 때문에, 사용자 입장에서는 동시에 실행되는 것처럼 느껴지는 것이다.

여기서 중요한 점은, 우리가 이 실행 순서를 직접 지정하지 않는다는 것이다. 실행 순서는 운영체제가 결정한다. 따라서 실행 결과가 항상 동일하게 나타나지 않을 수 있다. 같은 프로그램을 실행하더라도 그 순간의 시스템 상태에 따라 운영체제가 다른 선택을 할 수 있기 때문이다.

Apparent Concurrency in Practice

When fork() creates parent and child processes, both processes execute a loop, print output, and wait briefly.

Let's look at usleep(1000000) used in the example. While the familiar sleep() waits in units of seconds, usleep() waits in microseconds. usleep(1000000) means 1,000,000 microseconds — that is, 1 second. If the wait time is shortened further, the parent and child processes alternate CPU usage more frequently, causing their output to appear more densely interleaved. This example deliberately uses a short wait time to make the effect of scheduling-driven execution order changes more clearly visible.

The key thing to notice is the output order. In some runs, parent appears first; in others, child appears first. We did not specify this order, yet the operating system alternates between the two processes and their output becomes mixed.

This is an example of apparent concurrency. In reality the processes take turns, but the output looks as though they are running simultaneously.

실행 예제로 보는 겉보기 동시성

fork()를 통해 부모 프로세스와 자식 프로세스가 생성되면, 두 프로세스는 각각 반복문을 실행하며 출력하고 잠시 대기한다.

여기서 사용된 usleep(1000000)을 살펴보자. 기존에 사용하던 sleep()은 초 단위로 대기하지만, usleep()은 마이크로초 단위로 대기한다. usleep(1000000)은 1,000,000 마이크로초, 즉 1초를 의미한다. 만약 대기 시간을 더 짧게 줄이면 부모와 자식 프로세스가 더 자주 CPU를 번갈아 사용하게 되어 출력이 더 촘촘하게 섞여 나온다. 이 예제는 의도적으로 짧은 대기 시간을 활용해 스케줄링에 의해 실행 순서가 바뀌는 모습을 더 명확하게 보여주기 위한 것이다.

여기서 주목해야 할 점은 출력 순서다. 어떤 경우에는 parent가 먼저 출력되고, 어떤 경우에는 child가 먼저 출력된다. 우리가 이 순서를 지정하지 않았음에도 운영체제가 두 프로세스를 번갈아 실행하면서 결과가 섞여 나타나는 것이다.

이것이 바로 겉보기 동시성의 예다. 실제로는 번갈아 실행되지만, 출력은 마치 동시에 실행되는 것처럼 보인다.

The Non-Determinism of Execution Order

Apparent concurrency reveals an important characteristic: even with an identical program, the output order of execution results may differ from run to run.

Why does this happen? Because we did not directly specify which process receives the CPU first. Among the processes in the ready state, which one runs first depends on the scheduling decision made at that moment. In other words, the selection of the execution target is not fixed.

As a result, running the same program again may produce a different order of results. This property is called the non-determinism of execution order.

실행 순서의 비결정성

겉보기 동시성에서 한 가지 중요한 특징이 드러난다. 동일한 프로그램이더라도 실행 결과의 출력 순서가 매번 같지 않을 수 있다는 것이다.

왜 이런 일이 발생할까? CPU를 먼저 할당받을 프로세스를 우리가 직접 지정하지 않았기 때문이다. 준비 상태에 있는 프로세스 중 누가 먼저 실행될지는 그 순간의 스케줄링 판단에 따라 달라진다. 즉, 실행 대상의 선택은 고정되어 있지 않다.

따라서 동일한 프로그램을 다시 실행하면 결과 순서가 달라질 수 있다. 이를 실행 순서의 비결정성이라고 한다.

Observing Non-Determinism Through an Example

Consider an example that uses fork() to create parent and child processes. This time, rather than focusing on the fact that each process has an independent execution flow, we pay attention to the fact that output order differs between runs even with the same program.

The child process prints child and the parent process prints parent. The code has no loops and no wait functions. Yet in some runs parent appears first, and in others child appears first.

This is not because the code differs. It is because the process that receives the CPU first can vary at each execution. The selection of the execution target is not fixed — it is determined by the scheduling decision made at that moment.

This property — whereby execution result order can differ from run to run even with an identical program — is called the non-determinism of execution order. This example demonstrates that non-determinism in its simplest possible form.

실행 순서의 비결정성 — 예제로 확인하기

fork()를 이용해 부모와 자식 프로세스를 생성하는 예제를 살펴보자. 이번에는 두 프로세스가 각각 독립적인 실행 흐름을 가진다는 점보다, 같은 프로그램을 실행해도 출력 순서가 매번 같지 않다는 점에 주목한다.

자식 프로세스는 child를 출력하고, 부모 프로세스는 parent를 출력한다. 이 코드에는 반복문도 없고 대기 함수도 없다. 그럼에도 어떤 경우에는 parent가 먼저, 어떤 경우에는 child가 먼저 출력될 수 있다.

이 현상은 코드가 달라서가 아니다. CPU를 먼저 할당받는 대상이 실행 시점마다 달라질 수 있기 때문이다. 실행 대상의 선택은 고정되어 있지 않으며, 그 순간의 스케줄링 판단에 따라 결정된다.

이처럼 동일한 프로그램이더라도 실행 결과의 순서가 매번 달라질 수 있는 성질을 실행 순서의 비결정성이라고 한다. 이 예제는 그 비결정성을 가장 단순한 형태로 보여주는 코드다.

The Repeated Cycle of Execution and Waiting

To understand why execution order is not fixed, we need to revisit process state changes.

A process that enters the running state does not continue using the CPU indefinitely. When an I/O request occurs, the running process transitions to the waiting state. Once the wait completes, it returns to the ready state and becomes a candidate for execution again. The flow of running → waiting → running is not a one-time sequence — it is a repeating structure.

The key point is that the moment a process transitions to the waiting state, it can no longer use the CPU. As a result, the CPU becomes idle and the operating system selects another ready process.

In a structure where execution and waiting repeat like this, the entity using the CPU constantly changes. That is why CPU execution order is not fixed and continues to vary.

실행과 대기의 반복

실행 순서가 고정되어 있지 않은 이유를 이해하려면 프로세스의 상태 변화를 다시 생각해볼 필요가 있다.

프로세스는 한 번 실행 상태가 되었다고 해서 계속 CPU를 사용하는 것이 아니다. 실행 중이던 프로세스는 입출력 요청이 발생하면 대기 상태로 전환된다. 대기 작업이 끝나면 다시 준비 상태로 돌아와 실행 대상 후보가 된다. 즉, 실행 → 대기 → 실행의 흐름이 한 번으로 끝나는 것이 아니라 반복되는 구조다.

여기서 중요한 점은, 프로세스가 대기 상태로 전환되는 순간 해당 프로세스는 더 이상 CPU를 사용할 수 없게 된다는 것이다. 그 결과 CPU는 비게 되고, 운영체제는 준비 상태에 있는 다른 프로세스를 선택한다.

이처럼 실행과 대기가 반복되는 구조에서는 CPU를 사용하는 주체가 계속 바뀔 수밖에 없다. 그래서 CPU 실행 순서 역시 고정되지 않고 계속 달라질 수 있는 것이다.

Execution and Waiting in Practice

In the example code, the process prints CPU running, waits briefly, then prints I/O waiting, and waits again — this flow repeats.

The sleep() that follows printf("CPU running\n") does not mean the process keeps holding the CPU — it means the process transitions to a waiting state for a period. The repeating structure is: print CPU running → wait → print I/O waiting → wait.

This example simply illustrates the alternating structure of process execution and waiting. Each time such a state change occurs, the CPU can select a different execution target, and as a result execution order continues to vary.

예제로 보는 실행과 대기의 반복

예제 코드에서는 CPU running을 출력한 뒤 잠시 대기하고, 이어서 I/O waiting을 출력한 뒤 다시 대기하는 흐름이 반복된다.

여기서 printf("CPU running\n") 뒤에 오는 sleep()은 프로세스가 CPU를 계속 점유하는 것이 아니라, 잠시 대기 상태로 전환되는 것을 의미한다. 즉, CPU running 출력 → 대기 → I/O waiting 출력 → 대기의 흐름이 반복되는 구조다.

이 예제는 프로세스의 실행과 대기가 번갈아 나타나는 구조를 단순하게 보여준다. 이러한 상태 변화가 발생할 때마다 CPU는 다른 실행 대상을 선택할 수 있고, 그 결과 실행 순서는 계속 달라질 수 있다.

Preemptive Scheduling: Switching Without Waiting

The examples seen so far involved the CPU passing to another process when a process transitioned to the waiting state. In preemptive scheduling, however, the CPU can be switched even when a process is not in the waiting state.

In preemptive scheduling, the operating system can reclaim the CPU and reassign it to another process even if the current task has not finished. This allows shorter tasks or higher-priority tasks to be handled more quickly, making it well-suited for improving system responsiveness.

The downside is that a context switch occurs every time a process is swapped out, which can introduce additional overhead.

The key point here is that the operating system can intervene and reassign the CPU even when a process is not in the waiting state. This is the most significant difference from the non-preemptive approach.

선점 스케줄링: 대기 없이도 교체된다

앞서 살펴본 예제는 프로세스가 대기 상태로 전환될 때 CPU가 다른 프로세스에게 넘어가는 구조였다. 그런데 선점 스케줄링에서는 프로세스가 대기 상태가 아니더라도 CPU가 교체될 수 있다.

선점 스케줄링에서는 작업이 아직 끝나지 않았더라도 운영체제가 판단에 따라 CPU를 회수하고 다른 프로세스에게 재할당할 수 있다. 이 방식은 짧은 작업이나 높은 우선순위의 작업을 더 빠르게 처리할 수 있어 시스템의 응답성을 향상시키는 데 유리하다.

다만 실행 중이던 작업을 중단하고 다른 작업으로 전환할 때마다 문맥 교환이 발생하기 때문에 추가적인 오버헤드가 생길 수 있다.

여기서 핵심은 대기 상태가 아니어도 운영체제가 개입해 CPU를 재할당할 수 있다는 점이다. 이것이 비선점 방식과의 가장 큰 차이다.

Preemptive Scheduling Example Code

This example does not directly implement preemptive scheduling. It is an example designed to observe execution results in an environment where preemptive scheduling is operating.

Looking at the code, the parent and child processes each run a while loop and continuously print their own messages. Neither process terminates on its own, and no instruction to yield the CPU has been written. In other words, the code contains no command to stop execution or hand off to another process.

Yet the execution results show output alternating between the two. This switching is not caused by the code — it is caused by the operating system's preemptive behavior. The operating system uses timer interrupts to reclaim the CPU from the running process after a set interval and switch to another.

In short, this example demonstrates that even when a process does not voluntarily relinquish the CPU, the operating system can intervene and swap execution.

선점 스케줄링 예제 코드

이 예제는 선점 스케줄링을 직접 구현한 코드가 아니다. 선점 스케줄링이 동작하는 환경에서 실행 결과를 관찰하기 위한 예제다.

코드를 보면 부모와 자식 프로세스가 각각 while 반복문을 수행하며 계속해서 자신의 메시지를 출력한다. 두 프로세스 모두 스스로 종료하지 않으며, CPU를 반납하는 명령도 작성되어 있지 않다. 즉, 코드 안에는 실행을 중단하거나 다른 프로세스로 넘기라는 명령이 전혀 없다.

그럼에도 실행 결과를 보면 출력이 번갈아 나타난다. 이 교체는 코드 때문이 아니라 운영체제의 선점 동작 때문이다. 운영체제는 타이머 인터럽트를 통해 일정 시간이 지나면 실행 중인 프로세스의 CPU를 회수하고 다른 프로세스로 전환한다.

즉, 이 예제는 프로세스가 스스로 CPU를 내려놓지 않더라도 운영체제가 개입해 실행을 교체할 수 있다는 것을 보여주는 코드다.

Non-Preemptive Scheduling: Only Voluntary Yielding

In contrast to preemptive scheduling, non-preemptive scheduling does not allow the operating system to forcibly stop a running process. Once a process is assigned the CPU, it continues running until its task finishes or it voluntarily transitions to a waiting state.

This structure is simple and incurs low management overhead. However, if a long-running task claims the CPU first, all other tasks must wait until it completes. This can lead to response delays that directly affect the performance users perceive.

비선점 스케줄링: 자발적 반납만 허용

선점 스케줄링과 대비되는 비선점 스케줄링을 살펴보자. 비선점 스케줄링에서는 운영체제가 실행 중인 프로세스를 강제로 중단시키지 않는다. 한 번 CPU를 할당받은 프로세스는 작업이 끝나거나 스스로 대기 상태로 전환될 때까지 계속 실행된다.

이 구조는 단순하고 관리 부담이 적다는 장점이 있다. 그러나 실행 시간이 긴 작업이 먼저 CPU를 점유하면, 그 작업이 끝날 때까지 다른 작업은 계속 기다려야 한다는 단점이 있다. 결국 응답 지연이 발생할 수 있으며, 이는 사용자 체감 성능에도 직접적인 영향을 미친다.

Non-Preemptive Scheduling Example Code

This example also does not directly implement non-preemptive scheduling. It is designed to observe the execution flow under non-preemptive conditions by creating a situation where forced switching does not occur.

After using fork() to create parent and child processes, the parent calls sleep(6) so that it waits until the child finishes. The important point is that sleep() is not simply a function that passes time — it is a call that transitions the process to a waiting state for a period. In other words, the parent does not continue using the CPU; it voluntarily enters a waiting state and becomes unable to use the CPU. During that time, the child process runs its loop and completes all of its output.

The execution results show all of the child's output appearing first, followed by the parent's output. This is not because the operating system forcibly stopped a running process — it is because the parent voluntarily transitioned to the waiting state, causing the CPU to pass to the child.

In short, this example shows how execution flow changes when a process voluntarily relinquishes the CPU.

비선점 스케줄링 예제 코드

이 예제 역시 비선점 스케줄링을 직접 구현한 코드가 아니다. 강제 교체가 발생하지 않는 상황을 만들어 비선점 방식에서의 실행 흐름을 관찰하는 예제다.

fork()를 이용해 부모와 자식 프로세스를 생성한 뒤, 부모 프로세스는 sleep(6)을 호출해 자식이 끝날 때까지 기다리도록 구성했다. 여기서 중요한 점은 sleep()이 단순히 시간을 보내는 함수가 아니라, 해당 프로세스를 일정 시간 동안 대기 상태로 전환시키는 호출이라는 것이다. 즉, 부모는 CPU를 계속 사용하는 것이 아니라 스스로 대기 상태로 들어가 CPU를 사용할 수 없게 된다. 그 사이에 자식 프로세스는 반복문을 통해 자신의 출력을 끝까지 수행한다.

실행 결과를 보면 자식의 출력이 모두 나타난 뒤 부모의 출력이 이어지는 것을 확인할 수 있다. 운영체제가 실행 중인 프로세스를 강제로 중단시킨 것이 아니라, 부모가 스스로 대기 상태로 전환되었기 때문에 CPU가 자식에게 넘어간 것이다.

즉, 이 예제는 프로세스가 CPU를 자발적으로 반납할 때 실행 흐름이 어떻게 달라지는지를 보여주는 코드다.

Summary

Process execution is not controlled by the user — it is managed by the operating system. Which process uses the CPU and when is not fixed in advance; it varies with the scheduling policy applied, and that policy determines the responsiveness users experience.

CPU scheduling is the mechanism that decides and controls the execution flow of processes. In the next session, we will examine the specific criteria used to determine execution order — the various scheduling algorithms — one by one.

정리

프로세스의 실행은 사용자가 직접 제어하는 것이 아니라 운영체제가 관리한다. 어떤 프로세스가 언제 CPU를 사용할지는 고정되어 있지 않으며, 적용되는 스케줄링 방식에 따라 실행 순서가 달라진다. 그리고 그 순서에 따라 사용자가 느끼는 응답성도 달라진다.

이처럼 프로세스의 실행 흐름을 결정하고 제어하는 것이 바로 CPU 스케줄링이다. 다음 시간에는 이 실행 순서를 결정하는 구체적인 기준, 즉 다양한 스케줄링 알고리즘을 하나씩 살펴볼 예정이다.

Practice: Observing Apparent Concurrency

Let's directly observe what happens when fork() creates parent and child processes that each run independently.

The execution results show the messages of the parent and child interleaved in the output. Although it looks as though the two processes are running simultaneously, the CPU handles only one task at a time. The reason output alternates is that the operating system switches execution targets at very high speed.

What this practice demonstrates is that the two processes are not truly running at the same time, what we see is apparent concurrency, produced by rapid execution switching.

실습으로 확인하는 겉보기 동시성

fork()를 호출했을 때 부모 프로세스와 자식 프로세스가 각각 생성되어 실행되는 것을 직접 확인해보자.

실행 결과를 보면 부모와 자식의 메시지가 섞여서 출력되는 것을 볼 수 있다. 겉으로 보기엔 두 프로세스가 동시에 실행되는 것처럼 보이지만, CPU는 한 번에 하나의 작업만 처리한다. 출력이 번갈아 나오는 이유는 운영체제가 실행 대상을 매우 빠르게 전환하기 때문이다.

이 실습을 통해 확인할 수 있는 것은, 실제로 두 프로세스가 동시에 실행되는 것이 아니라 빠른 실행 전환에 의해 만들어지는 겉보기 동시성이라는 점이다.

Practice: Observing Non-Determinism of Execution Order

This example is a simple program with no loops and no wait functions — the parent and child each print one message.

Running it shows that sometimes child appears first, and sometimes parent appears first. The code is identical, but who receives the CPU first is not fixed.

What this practice demonstrates is that even with an identical program, the order of execution results can differ from run to run. The selection of the execution target is not predetermined — non-determinism is inherent in CPU scheduling.

실습으로 확인하는 실행 순서의 비결정성

이 예제는 반복문도, 대기 함수도 없이 부모와 자식 프로세스가 각각 한 번씩 메시지를 출력하는 단순한 코드다.

실행해보면 어떤 경우에는 child가 먼저 출력되고, 어떤 경우에는 parent가 먼저 출력된다. 코드는 동일하지만 누가 먼저 CPU를 할당받는지는 고정되어 있지 않기 때문이다.

이 실습을 통해 확인할 수 있는 것은, 동일한 프로그램이더라도 실행 결과의 순서가 매번 달라질 수 있다는 점이다. 실행 대상의 선택은 미리 결정되어 있지 않으며, CPU 스케줄링에는 비결정성이 존재한다.

Practice: Observing Repeated Execution and Waiting

This example includes a call to sleep(). Remember that sleep() does not merely delay time — it transitions the process to a waiting state for a period.

Running it shows that after one process prints a message and appears to pause, the other process's output follows. This is because the process transitions to a waiting state via sleep() and can no longer use the CPU, at which point the operating system selects another ready process.

What this lab demonstrates is that processes repeat cycles of execution and waiting, and after each wait period, they pass through the ready state before running again.

실습으로 확인하는 실행과 대기의 반복

이 예제는 sleep() 호출을 포함한다. sleep()은 단순히 시간을 지연시키는 것이 아니라, 해당 프로세스를 일정 시간 동안 대기 상태로 전환시키는 호출이라는 점을 기억하자.

실행해보면 한 프로세스가 메시지를 출력한 뒤 잠시 멈추는 것처럼 보이고, 그 사이에 다른 프로세스의 출력이 이어지는 것을 확인할 수 있다. 실행 중이던 프로세스가 sleep()과 함께 대기 상태로 전환되면서 CPU를 사용할 수 없게 되고, 그 순간 운영체제는 준비 상태에 있는 다른 프로세스를 선택하기 때문이다.

이 실습을 통해 확인할 수 있는 것은, 프로세스는 실행과 대기를 반복하며 대기 이후에는 준비 상태를 거쳐 다시 실행된다는 흐름이다.

Practice: Observing Preemptive Scheduling

This example has both parent and child processes running while loops. The code contains no instruction to stop execution. Yet the execution results show the parent and child output alternating infinitely. Press Ctrl+C to terminate.

This switching does not happen because the processes voluntarily give up the CPU. It happens because the operating system forcibly swaps the running process at fixed time intervals.

What this lab demonstrates is the core characteristic of preemptive scheduling: even when execution is not finished, the operating system can reclaim the CPU when it judges this necessary.

실습으로 확인하는 선점 스케줄링

이 예제는 부모와 자식 프로세스가 모두 while 반복문을 수행하는 구조다. 코드 안에는 실행을 중단하라는 명령이 없다. 그럼에도 실행 결과를 보면 부모와 자식의 출력이 무한히 번갈아가며 나타난다. 종료하려면 Ctrl+C를 누르면 된다.

이 교체는 프로세스가 스스로 CPU를 내려놓은 것이 아니다. 운영체제가 일정 시간 단위로 실행 중인 프로세스를 강제로 교체하기 때문에 나타나는 결과다.

이 실습을 통해 확인할 수 있는 것은, 실행이 끝나지 않았더라도 운영체제가 필요하다고 판단하면 CPU를 회수할 수 있다는 선점 스케줄링의 핵심 특징이다.

Practice: Observing Non-Preemptive Scheduling

This example is configured so that one process calls sleep() and temporarily enters a waiting state.

Running it shows that one side finishes first, and then the other side executes. No forced switching occurs mid-execution. This is because the next process only runs once the running process voluntarily transitions to a waiting state or terminates.

What this practice allows us to directly observe is the defining characteristic of non-preemptive scheduling: execution flow only transfers when a process voluntarily relinquishes the CPU.

실습으로 확인하는 비선점 스케줄링

이 예제는 한 프로세스가 sleep()을 호출해 일시적으로 대기 상태로 들어가도록 구성한 코드다.

실행해보면 한쪽 작업이 먼저 끝난 뒤 다른 쪽 작업이 실행되는 것을 확인할 수 있다. 실행 도중 강제로 교체되는 모습은 나타나지 않는다. 실행 중인 프로세스가 스스로 대기 상태로 전환되거나 종료되어야 비로소 다음 프로세스의 실행이 이루어지기 때문이다.

이 실습을 통해 프로세스가 자발적으로 CPU를 반납할 때만 실행 흐름이 전환되는 비선점 스케줄링의 특징을 직접 확인할 수 있다.

Linux Text File & Editor: Vim

Heesu Noh — Wed, 08 Apr 2026 13:03:46 GMT

1️⃣ Text File
2️⃣ Use a Text Editor

1️⃣ Text File

Linux and Unix: Are They the Same?

Linux and Unix are not completely identical, but they can be considered very similar from a user's perspective. Looking at their internal structure, there are real differences, and there is also the distinction that Unix is paid while Linux is free. However, since they are not significantly different in terms of actual usage, learning how to use Linux will allow you to adapt to a Unix environment without much difficulty.

In this session, we will look at how to edit text files in a Linux environment.

텍스트 파일 편집: 리눅스와 유닉스는 같은가?

리눅스와 유닉스는 완전히 동일하지는 않지만, 사용자 관점에서는 매우 유사하다고 할 수 있다. 내부 구조를 들여다보면 실질적인 차이가 존재하고, 유닉스가 유료인 반면 리눅스는 무료라는 차이점도 있다. 하지만 실제 사용법 측면에서는 크게 다르지 않기 때문에, 리눅스를 중심으로 운영체제 사용법을 익혀두면 유닉스 환경에서도 무리 없이 적응할 수 있다.

이번 시간에는 실제 리눅스 환경에서 텍스트 파일을 어떻게 편집하는지 살펴본다.

What is a Text File?

The most familiar example of a text file is the .txt file. By definition, a text file is a file that stores electronically represented characters arranged in sequence.

The difference becomes clear when compared to a PPT file. PPT requires a specific company's editor which is PowerPoint - to open it, whereas text files are not tied to any specific format. There is a dedicated viewer for text files called a text editor, but since it comes pre-installed on computers at no additional cost, it is referred to as a basic editor.

So why is this basic editor necessary? The simplest reason is that it is needed for handling program inputs and modifying various system settings. Since text files are stored in a human-readable format, opening one in an editor lets you see characters like A, B, and C directly.

Early computers were developed around the alphabet, so non-alphabetic characters like Korean were not supported. Today, however, the underlying structure has evolved to support a wide range of languages.

텍스트 파일이란?

텍스트 파일은 평소에 익숙하게 접해온 .txt 파일이 대표적인 예다. 정의하자면, 전자적인 문자가 나열된 형태로 저장된 파일을 텍스트 파일이라고 한다.

PPT 파일과 비교하면 차이가 명확해진다. PPT는 파워포인트라는 특정 회사의 편집기가 있어야만 열람이 가능한 반면, 텍스트 파일은 특정 형식에 종속되지 않는다. 텍스트 파일을 열기 위한 전용 뷰어, 즉 텍스트 편집기가 존재하지만, 추가 비용 없이 컴퓨터에 기본으로 설치되어 있기 때문에 '기본 편집기'라고 부른다.

그렇다면 이 기본 편집기는 왜 필요할까? 가장 간단한 이유는 프로그램의 입력값을 다루거나 각종 설정을 변경할 때 필요하기 때문이다. 텍스트 파일은 사람이 읽기 쉬운 형식으로 되어 있어, 편집기로 열면 A, B, C와 같은 문자를 그대로 확인할 수 있다.

초기 컴퓨터는 알파벳 중심으로 발전해왔기 때문에 한국어와 같은 비알파벳 문자는 지원되지 않았다. 그러나 현재는 기본 구조 수준에서 다양한 언어를 지원할 수 있도록 발전하였다.

How to Check a Text File

To check whether a file is in text format, use the file command.

file ~/.profile

This will return something like .profile: ASCII text, confirming it is a text file. On the other hand,

file /usr/bin/ls

returns something like ELF 64-bit LSB pie executable .... This is an executable file, commonly referred to as a binary file. A binary file is represented in binary and is formatted for computers to process rather than for humans to read.

텍스트 파일 확인 방법

현재 다루는 파일이 텍스트 형식인지 확인하려면 file 명령어를 사용한다.

file ~/.profile

위 명령어를 입력하면 .profile: ASCII text와 같이 텍스트 형식임을 알려준다. 반면,

file /usr/bin/ls

를 입력하면 ELF 64-bit LSB pie executable ...과 같은 결과가 출력된다. 이는 실행 가능한 파일로, 통상 바이너리 파일이라고 부른다. 바이너리 파일은 이진수로 표현된 파일로, 사람이 읽기보다는 컴퓨터가 처리하기에 적합한 형식이다.

ASCII and Encoding

As mentioned earlier, Korean was added to computers later, which relates to ASCII - the early character encoding system. ASCII is composed of 7 bits and divides characters into two categories:

Printable characters: 32 ~ 126
Control characters (non-printable): 0 ~ 31, 127

Computers recognize everything as numbers. For example, uppercase A is stored as 65 and lowercase a as 97. These numbers are converted into human-readable characters according to ASCII-based encoding and decoding rules. Both 65 and 97 fall within the printable character range.

ASCII와 인코딩

한글은 컴퓨터에 나중에 적용되었다고 언급했는데, 이는 초기 컴퓨터 문자 체계인 ASCII와 관련이 있다. ASCII는 7비트로 구성되며, 문자는 크게 두 종류로 나뉜다.

출력 가능한 문자: 32 ~ 126
제어 문자 (출력되지 않는 문자): 0 ~ 31, 127

컴퓨터는 모든 것을 숫자로 인식한다. 예를 들어 대문자 A는 65, 소문자 a는 97로 저장되며, ASCII 기반의 인코딩·디코딩 규칙에 따라 이 숫자들이 사람이 읽을 수 있는 문자로 변환된다. 65와 97은 모두 출력 가능한 문자 범위에 해당한다.

Viewing File Contents: `head`

head is a command that outputs the beginning (head) of a file. It is used when you only want to check the first part of a text file that is organized line by line.

The reason for viewing only the beginning goes back to the limitations of early monitors. While modern monitors can display large amounts of text on a single screen, early monitors could only display around 40–80 characters wide and about 20 lines tall. Before that, there were no screens at all — punch card devices were used to output one line at a time. Because what could be shown on screen was so limited, and like a TV where content that has scrolled past cannot be seen again, commands were needed to view just a portion of a file.

Usage: head ~/.profile outputs the default 10 lines, and head -n 7 ~/.profile outputs only 7 lines using the -n option. The -n can be omitted and it will work the same way.

텍스트 파일 내용 보기: `head`

head는 파일의 앞부분(머리 부분)을 출력하는 명령어다. 텍스트 파일은 문자들이 줄 단위로 구성되어 있는데, 그 중 앞부분만 확인하고 싶을 때 사용한다.

굳이 앞부분만 보는 이유는 초기 모니터의 한계에서 비롯된다. 현재의 모니터는 많은 양의 텍스트를 한 화면에 표시할 수 있지만, 초기 모니터는 가로 40~80자, 세로 20줄 내외만 표현 가능했다. 그 이전에는 화면 자체가 없었고 펀치카드 장치로 한 줄씩 출력하는 방식을 사용했다. 이처럼 한 화면에 표시할 수 있는 내용이 제한적이었기 때문에, TV 화면처럼 한번 지나간 내용은 다시 볼 수 없는 환경에서 파일의 일부분만 확인하기 위한 명령어가 필요했다.

사용법은 다음과 같다.

head ~/.profile을 입력하면 기본값인 10줄이 출력되며, head -n 7 ~/.profile과 같이 -n 옵션으로 출력할 줄 수를 직접 지정할 수도 있다. -n은 생략해도 동일하게 동작한다.

Viewing File Contents: `tail`

tail is the counterpart to head - just like the relationship between a head and a tail, it outputs the end of a file. Entering tail -n 7 ~/.profile outputs the last 7 lines of the file. The usage is the same as head, but the result is different. In summary, head outputs from the beginning and tail outputs from the end.

Note that depending on the user's environment, the .profile file may not exist or its contents may differ. If the file has not been modified from its initial state, the results from head and tail may look nearly the same, but if changes have been made, the results will differ.

텍스트 파일 내용 보기: `tail`

tail은 head와 상응하는 명령어로, 머리와 꼬리의 관계처럼 파일의 뒷부분을 출력한다. tail -n 7 ~/.profile을 입력하면 파일의 마지막 7줄을 출력한다는 점에서 head와 사용법은 같지만 결과는 다르다. 정리하자면 head는 앞에서부터, tail은 뒤에서부터 지정한 줄 수만큼 출력한다고 이해하면 된다.

단, 사용자 환경에 따라 .profile 파일이 없거나 내용이 다를 수 있다. 초기 설정 그대로라면 head와 tail의 결과가 거의 동일하게 보일 수 있지만, 중간에 내용을 수정했다면 결과에 차이가 생긴다.

The Meaning of `~`

Why do we use ~ in commands? The ~ symbol represents the home directory of the currently logged-in user. It is both a relative and absolute reference. In most cases, users continue with the account they initially logged in with, but administrators can switch to another user without logging out. In that case, ~ points to the home directory of the switched user. In other words, ~ always refers to the home directory of the currently active user.

`~`의 의미

명령어에서 ~를 사용하는 이유는 무엇일까? ~는 현재 로그인한 사용자의 홈 디렉토리를 의미한다. 상대적인 표시이면서도 절대적인 위치를 가리킨다는 특징이 있다. 대부분의 경우 처음 로그인한 계정을 계속 사용하지만, 관리자는 로그아웃 없이도 다른 사용자로 전환할 수 있다. 이 경우 ~는 전환된 사용자의 홈 디렉토리를 가리키게 된다. 즉, ~는 항상 현재 활성화된 사용자의 홈 디렉토리를 의미한다.

Viewing File Contents: `cat`

cat stands for "concatenate files and print on the standard output." It is a command that joins files together and outputs their contents to the standard output device.

What is the standard output device? In modern terms, it is natural to think of a monitor, but this has not always been the case. In the early days of computing, the standard output device was paper — a printer. Therefore, saying "output to the monitor" is specific to a particular situation. The more accurate expression is "output to the standard output device." More details on standard output will be covered later.

Usage: entering cat ~/.profile outputs the entire contents of that file to the standard output device. In a typical environment, this will appear on the monitor.

텍스트 파일 내용 보기: `cat`

cat은 "concatenate files and print on the standard output" 파일을 이어 붙여 표준 출력장치로 내용을 출력하는 명령어다.

여기서 표준 출력장치란 무엇일까? 현재 기준으로는 모니터라고 생각하면 자연스럽지만, 이것이 항상 당연한 것은 아니다. 컴퓨터 초창기에는 표준 출력장치가 모니터가 아닌 종이, 즉 프린터였다. 따라서 "모니터에 출력된다"는 표현은 특정 상황에 한정된 표현이며, 보다 정확하게는 "표준 출력장치로 출력된다"고 하는 것이 맞다. 표준 출력에 대한 자세한 내용은 이후에 다룰 예정이다.

사용법은 다음과 같다. cat ~/.profile을 입력하면 해당 파일의 전체 내용이 표준 출력장치로 출력된다. 일반적인 환경에서는 모니터에 출력되는 것을 확인할 수 있다.

Viewing File Contents: `more`

more is a command that displays file contents divided according to the terminal screen size. Unlike cat, which outputs everything at once, more shows only as much as the current screen can display.

Running more ~/.profile shows --More-- (32%) at the bottom of the screen, meaning 32% of the total document is currently displayed. The amount shown varies depending on the terminal window size — whether it shows 24 lines or 50 lines depends on the current screen. Since it allows you to read through the entire content in order rather than cutting off a portion like head or tail, it is more useful when you actually want to read a file.

텍스트 파일 내용 보기: `more`

Display the contents of a file in a terminal

more는 파일의 내용을 터미널 화면 크기에 맞춰 나눠서 출력하는 명령어다. cat처럼 파일 전체를 한꺼번에 출력하는 것이 아니라, 현재 화면에 표시할 수 있는 만큼만 보여준다.

more ~/.profile을 실행하면 화면 하단에 --More-- (32%)와 같은 표시가 나타나는데, 이는 전체 문서 중 32%가 현재 화면에 표시되었다는 의미다. 화면 크기는 사용자마다 다를 수 있어 24줄짜리 화면이든 50줄짜리 화면이든 현재 터미널 창의 크기에 따라 표시되는 양이 달라진다. head나 tail처럼 일부만 잘라서 보는 것이 아니라 전체 내용을 순서대로 읽어나갈 수 있기 때문에, 파일 내용을 실제로 읽어볼 때 더 유용한 명령어다.

Viewing File Contents: `less`

Despite its name, less has more features than more. While more only allows forward navigation, less allows you to move both forward and backward freely. Like the scrollbar on the right side of a web browser, less lets you scroll up and down through the file content, making it more convenient for reading long files.

To exit less, press q.

텍스트 파일 내용 보기: `less`

less는 이름과 달리 more보다 기능이 많은 명령어다. more가 앞으로만 이동 가능한 반면, less는 앞뒤로 자유롭게 이동할 수 있다. 웹 브라우저 오른쪽의 스크롤 바처럼 파일 내용을 위아래로 자유롭게 탐색할 수 있어 긴 파일을 읽을 때 더욱 편리하다.

Using `cat` with `<<` (Here Document)

In addition to standalone use, cat can be used together with the << operator. << is called the here document operator and allows you to input multiple lines of text at once.

Usage: type cat << end and press Enter, then input the desired content line by line. When finished, type end on the last line and press Enter to terminate input.

`cat`과 `<<` (Here Document) 활용

cat은 단독으로 사용하는 것 외에도 << 연산자와 함께 복합적으로 활용할 수 있다. <<는 here document 연산자라고 부르며, 여러 줄의 텍스트를 한 번에 입력할 수 있게 해준다.

사용법은 다음과 같다. cat << end를 입력하고 엔터를 누른 뒤, 원하는 내용을 한 줄씩 입력한다. 입력이 끝나면 마지막 줄에 end를 입력하고 엔터를 누르면 입력이 종료된다.

Using `cat`, `<<`, and `>` Together

Adding > (redirect output) to the here document operator << allows you to save the input content to a file.

The > is the redirect output operator, meaning send the output to the specified file instead of the monitor. So > new.txt means save the input content to a file named new.txt. By using cat, <<, and > together, you can easily create a text file.

`cat`, `<<`, `>`를 함께 활용하기

앞서 살펴본 here document 연산자 <<에 >(redirect output)를 추가하면 입력한 내용을 파일로 저장할 수 있다.

여기서 >는 redirect output 연산자로, 출력 결과를 모니터 대신 지정한 파일로 보내라는 의미다. 즉 > new.txt는 입력한 내용을 new.txt라는 이름의 텍스트 파일로 저장하라는 뜻이 된다. 이처럼 cat, <<, >를 함께 사용하면 간단하게 텍스트 파일을 생성할 수 있다.

Using `echo` with `>` and `>>`

echo is a command that outputs the text you type. When used with >, it saves the content to a file.

Here, > overwrites any existing file content. In contrast, >> appends to the existing content. If > is used instead of >>, the existing sample-text would be gone and only another-line would remain. In summary, > means overwrite and >> means append. One thing to note: if >> is used when there is no existing content, since there is nothing to append to, a new file will be created with the input as the first line.

`echo`와 `>`, `>>` 활용

echo는 입력한 텍스트를 출력하는 명령어로, >와 함께 사용하면 내용을 파일로 저장할 수 있다.

여기서 >는 기존 파일 내용을 덮어쓴다. 반면 >>는 기존 내용에 이어붙이는 역할을 한다. 만약 >>대신 >를 사용했다면 기존의 sample-text는 사라지고 another-line만 출력된다. 정리하자면 >는 덮어쓰기, >>는 이어붙이기라고 이해하면 된다. 한 가지 주의할 점은, 아무 내용도 없는 상태에서 >>를 사용하면 이어붙일 내용이 없으므로 새 파일이 생성되며 입력한 내용이 첫 줄로 저장될 것이다.

Editing Text Files: `sed`

Once a file is created, editing becomes necessary. As content grows, it is not practical to recreate the file every time a single character needs to be fixed. This is where sed comes in, defined as "stream editor for filtering and transforming text."

cat sample.txt | sed 's/-/*/'

The meaning of 's/-/*/' is as follows: s stands for substitute, meaning replace - with *. Running the command outputs the following:

sample*text
another*line

Importantly, the original file is not changed. Running cat sample.txt again still outputs sample-text and another-line. The result of sed is only shown on the standard output device and is not applied to the original file.

텍스트 파일 편집: `sed`

파일을 만들었다면 이제 편집이 필요하다. 내용이 많아질수록 한 글자 때문에 파일을 매번 새로 만들 수는 없기 때문이다. 이때 사용하는 것이 sed로, "stream editor for filtering and transforming text"로 정의된다. 즉 텍스트를 걸러내고 변환하는 스트림 편집기다.

여기서 's/-/*/'의 의미는 다음과 같다. s는 substitute(치환)를 뜻하며, -를 *로 바꾸라는 의미다. 위 명령어를 실행하면 결과는 아래와 같이 출력된다.

sample*text
another*line

단, 중요한 점은 원본 파일은 변경되지 않는다는 것이다. cat sample.txt를 다시 실행하면 여전히 sample-text, another-line으로 출력된다. sed의 결과는 표준 출력장치에 보여지기만 할 뿐, 원본 파일에 적용되지 않기 때문이다.

Saving to a File: `tee`

To save the result of sed to a file, use the tee command. tee is defined as "read from standard input and write to standard output and files," meaning it can simultaneously write to both the standard output device and a file. In other words, the result is displayed on screen and saved to a file at the same time — two benefits in one.

Running cat sample.txt outputs sample-text and another-line. Piping this to tee smpl.txt displays the same content on screen while also saving it to a new file called smpl.txt. Running cat smpl.txt afterward will show the same result as cat sample.txt.

파일로 저장하기: `tee`

sed의 결과를 파일에 저장하려면 tee 명령어를 활용한다. tee는 "read from standard input and write to standard output and files"로 정의되며, 표준 입력에서 읽어온 내용을 표준 출력장치와 파일 양쪽에 동시에 쓸 수 있다. 즉, 화면에도 결과가 출력되면서 파일에도 저장되는 일석이조의 명령어다.

cat sample.txt를 실행하면 sample-text, another-line이 출력된다. 여기에 tee smpl.txt를 연결하면 같은 내용이 화면에 출력됨과 동시에 smpl.txt라는 새 파일에도 저장된다. 이후 cat smpl.txt를 실행하면 sample.txt의 결과와 동일하게 sample-text, another-line이 출력되는 것을 확인할 수 있다.

2️⃣ Use a Text Editor

Installing `vim`

Before installing vim, the package list needs to be updated. When Ubuntu was first installed, it may have only known about version 1.0, but version 1.5 with bug fixes may have since been released. Running sudo apt update refreshes the package list to its latest state. Then install vim with sudo apt install vim, and check the installed version by running vim --version.

`vim` 설치

vim을 설치하기 전에 먼저 패키지 목록을 최신화해야 한다. 우분투를 처음 설치했을 때는 1.0 버전만 알고 있었더라도, 이후 버그 패치 등을 거쳐 1.5 버전이 출시되었을 수 있기 때문이다. sudo apt update를 실행하면 패키지 목록이 최신 상태로 갱신된다. 이후 sudo apt install vim으로 vim을 설치하고, vim --version을 입력하면 현재 설치된 vim의 버전을 확인할 수 있다.

The `vim` Launch Screen

Type vi or vim at the prompt and press Enter to launch the editor. On the launch screen, each line begins with a ~ symbol — but be careful here. The ~ used in file paths means the current user's home directory, while the ~ at the start of each line in the vim screen means an empty line with no content. They look the same but mean completely different things, so be careful not to confuse them.

The numbers displayed on the right side of the screen indicate the current cursor position, representing the line (row) and column. In the center of the screen, a brief explanation and manual for using vim is displayed.

`vim` 실행 화면

프롬프트에서 vi 또는 vim을 입력하고 엔터를 누르면 편집기 화면이 실행된다. 실행 화면에서 각 줄 맨 앞에 ~ 표시가 보이는데, 여기서 주의할 점이 있다. 경로에서 사용하는 ~는 현재 사용자의 홈 디렉토리를 의미하지만, vim 화면에서 줄 맨 앞에 표시되는 ~는 아무 내용도 없는 빈 줄을 의미한다. 같은 모양이지만 전혀 다른 의미이므로 혼동하지 않도록 주의해야 한다.

화면 오른쪽에 표시되는 숫자는 현재 커서의 위치를 나타내며, 줄(row, line)과 칸(column)을 의미한다. 화면 가운데에는 vim 사용 시 참고할 수 있는 간단한 설명과 매뉴얼이 표시된다.

`vim`'s Two Modes

vim is divided into two main modes. One is INSERT mode, where you can directly type text or program code, indicated by -- INSERT -- at the bottom of the screen. The other is NORMAL mode, where instead of typing, you navigate - moving the cursor and finding specific locations.

When first encountering vim, this mode concept can be very confusing. Pressing keys does not type characters; the cursor just moves. However, this mode distinction is actually one of vim's strengths.

`vim`의 두 가지 모드

vim은 크게 두 가지 모드로 구분된다. 하나는 INSERT 모드로, 텍스트나 프로그램 코드를 직접 입력할 수 있는 상태다. 화면 하단에 -- INSERT --라고 표시된다. 다른 하나는 NORMAL 모드로, 텍스트를 입력하는 것이 아니라 커서를 이동하거나 원하는 위치를 찾는 등의 탐색을 하는 상태다.

처음 vim을 접하면 이 모드 개념이 굉장히 혼란스럽게 느껴진다. 키보드를 눌러도 글자가 입력되지 않고 커서만 움직이기 때문이다. 그러나 이 모드 구분이 오히려 vim의 장점으로 작용하기도 한다.

NORMAL Mode and EX Mode

NORMAL mode, as mentioned, is a state for navigation — moving the cursor, scrolling the screen, deleting content, pasting, and so on. Pressing : in NORMAL mode switches to EX mode, where you can enter commands.

EX mode originates from the concept of a line editor, derived from the era before visual interfaces when text was processed one line at a time.

To return to NORMAL mode from INSERT mode, press ESC. Pressing ESC twice guarantees a return to NORMAL mode regardless of the current state.

Entering INSERT Mode

INSERT mode is the state where text can be directly entered. From NORMAL mode, pressing any of i, I, a, A, o, O activates INSERT mode, shown by -- INSERT -- at the bottom of the screen. All of them enter INSERT mode, but each key differs in where the cursor is positioned when input begins.

i — starts input at the current cursor position
I — starts input at the beginning of the current line
a — starts input one position after the current cursor
A — starts input at the end of the current line
o — creates a new line below the current line and starts input there
O — creates a new line above the current line and starts input there

The cursor shape may appear as an underline, a block, or other forms. It is good practice to press ESC twice to ensure you are in NORMAL mode before trying each key to observe the differences in cursor placement.

`vim` INSERT 모드 진입 방법

INSERT 모드는 텍스트를 직접 입력할 수 있는 상태로, NORMAL 모드에서 i, I, a, A, o, O 중 하나를 누르면 화면 하단에 -- INSERT --가 표시되며 진입할 수 있다. 모두 INSERT 모드로 전환된다는 공통점이 있지만, 각 키마다 커서가 시작되는 위치가 다르다.

각 키의 동작 차이는 다음과 같다. i는 현재 커서 위치에서, I는 현재 줄의 맨 앞에서 입력이 시작된다. a는 현재 커서의 다음 글자 위치에서 입력이 시작되며, A는 현재 줄의 맨 끝에서 시작된다. o는 현재 줄의 다음 줄에 새 줄을 만들어 입력할 수 있고, O는 현재 줄의 이전 줄에 새 줄을 만들어 입력할 수 있다.

커서 모양도 밑줄, 네모 등 여러 형태로 표시될 수 있다. ESC를 습관적으로 두 번 눌러 NORMAL 모드로 확실히 전환한 뒤 각 키를 눌러보면 커서 위치가 달라지는 것을 직접 확인할 수 있다.

Creating an Example File and UTF-8 Encoding

Entering man ls > vimLS.txt saves the manual for the ls command into a file called vimLS.txt. man ls outputs the manual, and > redirects that output to a file. Running file vimLS.txt afterward shows the file's attributes — the result will show UTF-8 text.

This differs from earlier when file ~/.profile returned ASCII. ASCII is composed of 7 bits and can only represent 128 characters, which is not enough to include Korean and many other languages. UTF-8 was developed to overcome this limitation. It is a newly established encoding standard that accommodates languages with large character sets, such as Korean, Japanese, and Chinese, and is now used almost universally. When Korean characters fail to display properly, it is often referred to as a CJK problem - an abbreviation for Chinese, Japanese, Korean - indicating the system cannot represent those characters. The saved file can be opened in vim by running vim vimLS.txt.

`vim` 예시 파일 생성과 UTF-8 인코딩

man ls > vimLS.txt를 입력하면 ls 명령어의 설명서 내용이 vimLS.txt 파일로 저장된다. man ls는 ls 명령어의 매뉴얼을 출력하는 명령어이고, >로 리다이렉트하여 그 내용을 파일에 저장하는 것이다. 이후 file vimLS.txt를 입력하면 해당 파일의 속성을 확인할 수 있는데, 결과로 UTF-8 text가 출력된다.

앞서 file ~/.profile에서는 ASCII로 출력되었던 것과 차이가 있다. ASCII는 7비트로 구성되어 128가지 문자만 표현할 수 있기 때문에 한글을 비롯한 다양한 언어를 표현하지 못한다. 이러한 한계를 극복하기 위해 등장한 것이 UTF-8 인코딩 방식이다. UTF-8은 한글, 일본어, 중국어처럼 표현해야 할 문자 수가 많은 언어를 수용하기 위해 새롭게 제정된 인코딩 방식으로, 요즘은 거의 대부분 UTF-8을 사용한다. 한글이 제대로 표시되지 않을 때 CJK 문제라는 표현을 쓰는데, 이는 Chinese, Japanese, Korean의 약자로 해당 언어들의 문자를 표현하지 못하는 상태를 의미한다. 이렇게 저장된 파일은 vim vimLS.txt를 입력하면 저장된 내용을 vim 편집기에서 확인할 수 있다.

`vim` Cursor Movement

Opening vim vimLS.txt displays information at the bottom such as "vimLS.txt" 249L, 8383B. Here, 249L is the total number of lines including blank lines, and 8383B is the number of bytes the document occupies. The 1,1 in the bottom right indicates the current cursor position — line (row) and column.

One thing to note about bytes: CJK characters — Korean, Japanese, Chinese — typically occupy 2 bytes. For example, the word "한글" appears as two characters visually, but internally it is stored as more bytes than that.

Cursor movement can be done with h, j, k, l instead of the arrow keys — left, down, up, and right respectively. This is a legacy from early keyboards that lacked arrow keys. Since computers at that time were paper-based, directional movement was unnecessary. In the modern era, arrow keys are available, but the h, j, k, l convention remains a vim tradition and is still fully supported.vim 커서 이동

vim vimLS.txt를 입력하면 편집기 화면이 열리며, 화면 하단에 "vimLS.txt" 249L, 8383B와 같은 정보가 표시된다. 여기서 249L은 빈 줄을 포함한 전체 줄 수를 의미하고, 8383B는 문서가 차지하는 바이트 수를 의미한다. 화면 오른쪽 하단의 1,1은 현재 커서의 위치로, 줄(row)과 칸(column)을 나타낸다.

바이트 수와 관련하여 한 가지 알아둘 점이 있다. CJK 문자, 즉 한국어, 일본어, 중국어는 보통 2바이트를 차지한다. 예를 들어 "한글"은 눈에 보이기엔 두 글자지만 컴퓨터 내부에서는 그보다 더 많은 바이트로 처리된다.

커서 이동은 방향키 대신 h, j, k, l 키로 할 수 있다. 각각 왼쪽, 아래, 위, 오른쪽에 해당한다. 이는 초기 키보드에 방향키가 없었던 시절의 흔적이다. 당시 컴퓨터는 종이 기반으로 작동했기 때문에 상하좌우 이동이 필요 없었고, 별도의 키 매핑이 필요했다. 현대에는 키보드가 발전하여 방향키를 사용할 수 있지만, h, j, k, l 방식은 vim의 전통으로 남아 있으며 vim improved에서도 동일하게 사용 가능하다.

Cursor Movement: Beginning and End of a Line

In NORMAL mode, 0, $, and ^ allow quick movement within a line. 0 moves to the very beginning of the line, $ moves to the very end, and ^ moves to the first non-blank character of the line. For example, if there are spaces at the start of a line, 0 moves to the absolute beginning including those spaces, while ^ moves to where the actual content starts. This can also help identify whether the leading space was created with the spacebar or the tab key.

Note that cursor movement commands including h, j, k, l only work in NORMAL mode. Always confirm you have pressed ESC to switch to NORMAL mode before using them.

`vim` 커서 이동: 줄의 시작과 끝

NORMAL 모드에서 0, $, ^ 키를 사용하면 커서를 줄 단위로 빠르게 이동할 수 있다. 0을 누르면 줄의 맨 앞으로, $를 누르면 줄의 맨 끝으로 이동한다. ^를 누르면 줄에서 내용이 시작되는 첫 번째 문자로 이동한다. 예를 들어 줄 앞에 빈칸이 있는 경우, 0은 빈칸을 포함한 맨 앞으로 이동하지만 ^는 실제 내용이 시작되는 위치로 이동한다. 이를 통해 해당 빈칸이 스페이스바로 만들어진 것인지 탭 키로 만들어진 것인지도 확인할 수 있다.

주의할 점은 h, j, k, l을 비롯한 커서 이동 명령은 반드시 NORMAL 모드에서만 작동한다는 것이다. ESC 키를 눌러 NORMAL 모드로 전환된 것을 확인한 뒤 사용하도록 하자.

Screen-Level Movement

Beyond line-by-line movement, you can jump by entire screens. In NORMAL mode, Ctrl+f moves one full screen down, Ctrl+b moves one full screen up, Ctrl+d moves half a screen down, and Ctrl+u moves half a screen up. This is useful for navigating long files much faster than moving one line at a time.

`vim` 화면 단위 이동

한 줄씩 이동하는 것 외에도 화면 단위로 한 번에 이동할 수 있다. NORMAL 모드에서 Ctrl키와 함께 사용하며, Ctrl+f는 한 화면 아래로, Ctrl+b는 한 화면 위로 이동한다. Ctrl+d는 화면의 절반만큼 아래로, Ctrl+u는 화면의 절반만큼 위로 이동한다. 한 줄씩 이동하는 것보다 빠르게 원하는 위치로 점프할 수 있어 긴 파일을 탐색할 때 유용하다.

Jumping to a Specific Line

For files with tens of thousands of lines, even screen-level movement can take a long time. In NORMAL mode, pressing gg jumps to the first line, and pressing G jumps to the last line. Typing a line number followed by G jumps directly to that line.

This is one of the most powerful reasons to use vim — no need to scroll with a mouse to reach a specific location. While the experience varies by user, many people use vim not just because it looks impressive, but because it genuinely meets their needs.

`vim` 특정 줄로 이동

페이지가 수만 줄에 달하는 파일이라면 화면 단위 이동으로도 한참을 이동해야 한다. 이때 특정 줄로 바로 이동하는 기능을 활용할 수 있다. NORMAL 모드에서 gg를 누르면 첫 번째 줄로, 대문자 G를 누르면 맨 마지막 줄로 이동한다. 이동하고 싶은 줄 번호를 입력한 뒤 G를 누르면 해당 줄로 바로 이동할 수 있다.

이것이 vim을 사용하는 가장 강력한 이유 중 하나다. 마우스로 스크롤하지 않아도 원하는 위치로 즉시 이동할 수 있기 때문이다. 사용자에 따라 경험은 다르겠지만, vim을 사용하는 사람들은 단순히 멋있어서가 아니라 실제로 필요하기 때문에 사용하는 경우가 많다.

Word-Level Movement

Movement is also possible at the word level. In NORMAL mode, pressing w moves to the next word and b moves to the previous word.

One important thing to keep in mind is that case matters. For example, |, l, and I look visually similar but are entirely different characters. Catching these differences quickly comes from a developer's eye, and the more you work with computers from a developer's perspective, the more naturally this awareness develops.

`vim` 단어 단위 이동

한 문자씩 혹은 화면 단위 이동 외에도 단어 단위로 이동하는 것도 가능하다. NORMAL 모드에서 w를 누르면 다음 단어로, b를 누르면 이전 단어로 이동한다.

여기서 주의해야 할 점은 대소문자를 반드시 구분해야 한다는 것이다. 예를 들어 |, l, I는 시각적으로 비슷해 보이지만 모두 다른 문자다. 이러한 차이를 빠르게 캐치하는 것은 개발자적 감각에서 비롯되며, 컴퓨터를 개발자 입장에서 많이 다뤄볼수록 자연스럽게 익숙해진다.

Word-Level Movement 2: `B` and `W`

Unlike lowercase b and w, uppercase B and W use whitespace as the word boundary. Lowercase versions recognize special characters and punctuation as word boundaries, while uppercase versions only recognize spaces. Therefore, W moves to the word after the next space, and B moves to the word before the previous space. Since the movement range is broader than the lowercase versions, navigation is faster.

`vim` 단어 단위 이동 2: 대문자 `B`, `W`

앞서 살펴본 소문자 b, w와 달리 대문자 B, W는 빈칸을 기준으로 단어 단위 이동을 한다. 소문자는 특수문자나 구두점 등을 단어의 경계로 인식하지만, 대문자는 오직 빈칸만을 기준으로 삼는다. 따라서 W는 다음 빈칸 너머의 단어로, B는 이전 빈칸 너머의 단어로 이동한다. 이동 범위가 소문자보다 넓기 때문에 더 빠르게 이동할 수 있다.

Entering INSERT Mode 1: `i` and `a`

To type actual text or code in NORMAL mode, you need to switch to INSERT mode. The two most common keys are lowercase i and a. Both enter INSERT mode, but they differ in where input begins.

For example, if the cursor is on the second character of a 5-character text, pressing i starts input at the current position (second character), while pressing a starts input one position after (third character). In short, i starts at the current position and a starts one character to the right.

`vim` INSERT 모드 진입 1: `i`와 `a`

NORMAL 모드에서 실제 문자나 코드를 입력하려면 INSERT 모드로 전환해야 한다. 대표적인 방법이 소문자 i와 a이며, 둘 다 INSERT 모드로 진입하지만 커서 위치에 차이가 있다.

예를 들어 5칸짜리 텍스트에서 커서가 두 번째 칸에 위치해 있다고 가정하면, i를 누르면 현재 커서 위치인 두 번째 칸부터 입력이 시작된다. 반면 a를 누르면 현재 커서의 바로 다음 위치인 세 번째 칸부터 입력이 시작된다. 즉 i는 현재 위치에서, a는 현재 위치의 한 칸 뒤에서 입력이 시작된다는 차이가 있다.

Entering INSERT Mode 2: `o` and `O`

Lowercase o creates a new line below the current cursor position and begins input at the first character of that new line. Uppercase O creates a new line above the current cursor position and begins input at the first character of that new line.

Deleting and Undoing

In NORMAL mode, pressing x deletes the single character at the current cursor position. Pressing uppercase D deletes everything from the current cursor position to the end of the line. Pressing dd deletes the entire line.

For undoing, this is similar to Ctrl+Z in Windows. Pressing u in NORMAL mode undoes one action at a time, stepping back through previous states with each press. Uppercase U restores the entire current line to its state before any changes were made.

`vim` 삭제와 되돌리기

NORMAL 모드에서 x를 누르면 현재 커서 위치의 한 글자가 삭제된다. 대문자 D를 누르면 현재 커서 위치부터 줄의 끝까지 한 번에 삭제된다. dd를 누르면 커서가 있는 줄 전체가 삭제된다.

되돌리기는 윈도우의 Ctrl+Z와 유사한 개념이다. NORMAL 모드에서 u를 누르면 누를 때마다 이전 상태로 한 단계씩 되돌아간다. 대문자 U는 커서가 있는 줄 전체를 변경 이전 상태로 되돌린다.

Replacing a Single Character

As learned, x deletes a single character at the cursor position. To replace rather than delete a character, there are two methods.

Pressing ~ toggles the case of the character at the cursor position between uppercase and lowercase. This is useful in alphabetic language environments and does not apply to CJK characters like Korean, Japanese, or Chinese.

In NORMAL mode, pressing r followed by a desired key replaces the character at the cursor with the newly pressed character — a quick way to swap a single character without the extra step of deleting and retyping.

`vim` 한 글자 바꾸기

앞서 x는 현재 커서 위치의 한 글자를 삭제한다고 배웠다. 삭제가 아닌 한 글자만 바꾸고 싶을 때는 두 가지 방법을 사용할 수 있다.

~를 누르면 커서 위치의 문자 대소문자가 전환된다. 알파벳 기반의 언어권에서 유용하게 활용할 수 있으며, 한국어, 일본어, 중국어와 같은 CJK 문자에는 해당되지 않는다.

NORMAL 모드에서 r을 누른 뒤 원하는 키를 입력하면 현재 커서 위치의 글자가 새로 입력한 문자로 바뀐다. 삭제 후 다시 입력하는 번거로움 없이 한 글자를 빠르게 교체할 수 있다.

Searching for Content

In NORMAL mode, pressing / moves the cursor to the bottom of the screen. Type the search term and press Enter to jump to the matching location. / searches in the forward (downward) direction from the current cursor position. Pressing n continues searching in the same direction, while N searches in the opposite direction.

Pressing ? instead searches in the backward (upward) direction. In this case, n continues in the same direction (upward) and N goes in the opposite direction (downward). Note that forward and backward here refer to the direction of the search, not the visual position on screen.

`vim` 내용 찾기

NORMAL 모드에서 /를 누르면 화면 맨 아래로 커서가 이동한다. 찾고자 하는 문구를 입력하고 엔터를 누르면 해당 내용이 있는 위치로 이동한다. /는 현재 커서 위치에서 뒤쪽 방향으로 탐색을 시작한다. 이때 n을 누르면 같은 방향(뒤쪽)으로 계속 탐색하고, N을 누르면 반대 방향(앞쪽)으로 탐색한다.

반대로 ?를 누르고 문구를 입력하면 앞쪽 방향으로 탐색을 시작한다. 이 경우 n은 진행 방향 그대로 앞쪽으로, N은 반대 방향인 뒤쪽으로 탐색한다. 여기서 앞쪽과 뒤쪽은 화면 기준이 아닌 탐색이 진행되는 방향을 기준으로 한다는 점에 주의해야 한다.

Other `vim` Commands

Commands used after input can be referenced from the ed editor documentation. There are a great many commands, and knowing regular expressions will be an advantage in understanding and using them.

A notable example is :s/short/SHORT/gc, which replaces all instances of short with SHORT in the file. This is similar to the sed 's/-/*/' covered earlier. Here, gc stands for global confirm — it finds and replaces throughout the entire document while asking for confirmation each time.

Regular expressions are a vast topic — broad enough to be a subject on their own — so it is not possible to cover everything in this session. It is recommended to explore them separately when time allows, or through a related course.

For learning vim, the built-in help tutorial is a good resource, and https://vim-adventures.com/ offers a game-based approach to learning vim commands. Note that the site is paid.

`vim` 기타 명령어

입력 후 사용하는 명령어는 ed 편집기 사용법을 참고하면 된다. 사용법이 매우 많으며, regular expression(정규 표현식)을 알고 있다면 이해와 활용에 유리하다.

대표적인 예로 :s/short/SHORT/gc는 파일 안의 모든 short를 SHORT로 변경하는 명령어다. 앞서 배운 sed 's/-/*/'와 유사한 방식이다. 여기서 gc는 global confirm의 약자로, 문서 전체에서 찾아 바꾸되 매번 확인을 거친다는 의미다.

정규 표현식은 그 자체로 하나의 과목이 될 만큼 방대한 주제이므로 이번 시간에 모두 다루기는 어렵다. 관련 과목을 수강하거나 시간이 날 때 별도로 찾아보는 것을 권장한다.

vim 학습에 도움이 되는 자료로는 vim 내장 help의 튜토리얼이 있으며, https://vim-adventures.com/ 에서 게임 형식으로 사용법을 익힐 수도 있다. 단, 해당 사이트는 유료임을 참고하자.

Test Design: Concepts, Techniques, and Black Box Testing

Heesu Noh — Sat, 04 Apr 2026 02:50:08 GMT

1️⃣ Core Concepts of Test Design
2️⃣ Classification of Test Design Techniques
3️⃣ Black Box Test Preview

What is Test Design?

Test design is an activity performed after test planning is complete. It is the stage where you define what to test and how to test it. During this process, the test targets and scope are determined, enabling more efficient test execution.

In software testing, it is practically impossible to verify every possible combination. For example, with just 2 variables that hold True/False values, there are already 4 combinations. As the number of variables grows to 20 or 100, the number of possible combinations increases exponentially, making exhaustive testing completely infeasible. This is precisely why an efficient test strategy is needed — and test design plays that central role.

From a PDCA cycle perspective, test design falls under the P (Plan) phase. This is because it is preparatory work that must be completed before tests are actually executed dynamically. Specifically, this phase includes test case development, test procedure definition, and test environment preparation. Each of these activities will be covered in detail in subsequent posts.

테스트 설계란 무엇인가?

테스트 설계는 테스트 계획이 완료된 이후에 수행되는 활동으로, 무엇을 어떻게 테스트할지를 구체화하는 단계다. 이 과정에서 테스트 대상과 범위가 결정되며, 이를 바탕으로 보다 효율적인 테스트를 수행할 수 있게 된다.

소프트웨어 테스트에서 모든 경우의 수를 전부 검증하는 것은 현실적으로 불가능하다. 예를 들어 True/False 값을 가지는 변수가 단 2개만 있어도 조합은 4가지가 되는데, 변수가 20개, 100개로 늘어나면 가능한 조합의 수는 기하급수적으로 증가해 완전한 테스트 자체가 불가능해진다. 바로 이 때문에 효율적인 테스트 전략이 필요하며, 테스트 설계가 그 핵심 역할을 담당한다.

PDCA 사이클 관점에서 테스트 설계는 P(Plan) 단계에 해당한다. 실제 테스트를 동적으로 실행하기 이전에 사전에 준비해야 하는 작업이기 때문이다. 구체적으로는 테스트 케이스 개발, 테스트 절차 정의, 테스트 환경 준비가 이 단계에 포함된다. 각 세부 활동에 대한 내용은 이후 강의에서 다룰 예정이다.

Test Design as Defined by ISO/IEC/IEEE 29119

ISO/IEC/IEEE 29119 is an international standard for software testing. Let's look at how test design is positioned within this standard.

The standard divides the test process into two main areas: the Test Management Process and the Dynamic Test Process. Test planning takes place within the test management process, and its outputs are passed on to the dynamic test process where actual testing activities occur.

Within the dynamic test process, there are two areas that correspond to test design.

The first is Test Design and Implementation. This stage takes the test plan as input and concretizes what and how to test, with Test Basis analysis as its starting point.

The second is Test Environment Setup and Maintenance. Based on the environment requirements identified during test design and implementation, this activity involves building and managing the actual test environment.

The outputs of these two stages; the test cases and the prepared test environment; are passed together to the Test Execution stage, where dynamic testing finally takes place.

In summary, the key point is that within the 29119 standard, test design is defined as a concept that encompasses not just writing test cases, but also setting up the test environment.

ISO/IEC/IEEE 29119에서 정의하는 테스트 설계

ISO/IEC/IEEE 29119는 소프트웨어 테스팅에 관한 국제 표준으로, 이 표준에서 테스트 설계가 어떻게 위치하는지를 살펴보자.

표준에서 테스트 프로세스는 크게 테스트 관리 프로세스와 동적 테스트 프로세스로 구분된다. 테스트 계획은 테스트 관리 프로세스에서 진행되며, 그 결과가 동적 테스트 프로세스로 전달되어 실제 테스트 활동이 이루어진다.

동적 테스트 프로세스 내에서 테스트 설계에 해당하는 영역은 두 가지다.

첫째, 테스트 설계 및 구현이다. 테스트 계획을 입력으로 받아 테스트 베이시스(Test Basis)를 도출하는 단계로, 무엇을 어떻게 테스트할지 구체화된다.

둘째, 테스트 환경 구성 및 유지다. 테스트 설계 및 구현 단계에서 도출된 환경 요구사항을 바탕으로 실제 테스트 환경을 구성하고 관리하는 활동이다.

이 두 단계의 결과물, 즉 테스트 케이스와 준비된 테스트 환경이 함께 테스트 수행 단계로 전달되어 비로소 동적 테스트가 실제로 실행된다.

정리하면, 29119 표준에서 테스트 설계는 단순히 테스트 케이스를 작성하는 것에 그치지 않고, 테스트 환경 구성까지 포함하는 개념으로 정의된다는 점이 핵심이다.

Test Design and Implementation / Test Environment Setup and Maintenance

ISO/IEC/IEEE 29119 divides test design into two major activities.

Test Design and Implementation is the stage where test cases and procedures are defined based on the test scope and strategy identified in the test plan. The starting point is Test Basis analysis. The Test Basis refers to the artifacts that serve as the foundation for creating test cases. In the V-model, for example, architecture design artifacts serve as the basis for creating integration test cases - the left-side (development) artifacts that are referenced to build right-side (test) cases. By analyzing the Test Basis, the following items are derived: Test requirements, Test conditions, Test coverage criteria and Test cases.

Test Environment Setup and Maintenance requires that an appropriate environment and data be ready before tests can actually be executed. Taking automotive software as an example, testing may begin in a PC environment, gradually expand to a laboratory setting, and ultimately extend to a real road environment where actual vehicles operate. Constructing these various test environments and preparing the necessary data for each is the core activity of this stage.

테스트 설계 및 구현 / 테스트 환경 구성 및 유지

ISO/IEC/IEEE 29119 표준에서 테스트 설계는 크게 두 가지 활동으로 나뉜다.

테스트 설계 및 구현 테스트: 계획에서 식별된 테스트 범위와 전략을 바탕으로 테스트 케이스와 절차를 구체화하는 단계다. 이 과정의 출발점은 테스트 베이시스(Test Basis) 분석이다. 테스트 베이시스란 테스트 케이스를 만들기 위한 기반이 되는 산출물을 말한다. V모델을 예로 들면, 통합 테스트 케이스를 만들 때 아키텍처 설계 산출물이 그 기반이 된다. 즉, 오른쪽(테스트 단계)의 케이스를 만들기 위해 참조하는 왼쪽(개발 단계)의 산출물이 테스트 베이시스에 해당한다. 이 테스트 베이시스를 분석함으로써 다음 항목들을 도출하게 된다.

테스트 요구사항
테스트 조건
테스트 커버리지
기준 테스트 케이스

테스트 환경 구성 및 유지: 테스트를 실제로 실행하려면 적절한 환경과 데이터가 준비되어야 한다. 자동차 소프트웨어를 예로 들면, 초기에는 PC 환경에서 테스트하지만 점차 실험실 환경으로 확장되고, 최종적으로는 실제 차량이 주행하는 도로 환경에서까지 테스트가 이루어질 수 있다. 이처럼 다양한 테스트 환경을 구성하고, 각 환경에서 필요한 데이터를 준비하는 것이 이 단계의 핵심 활동이다.

The Need for Test Design

Why is test design necessary? It can be summarized into two main purposes.

First, to perform efficient and effective testing.

Second, to properly verify software across diverse platforms and environments.

More detailed coverage of each purpose follows below.

테스트 설계의 필요성

테스트 설계는 왜 필요한 걸까? 크게 두 가지 목적으로 정리할 수 있다.

첫째, 효율적이면서 효과적인 테스트를 수행하기 위해서다.

둘째, 다양한 플랫폼과 환경에서 소프트웨어를 제대로 검증하기 위해서다.

각 목적에 대한 더 자세한 내용은 이어지는 내용에서 다룰 예정이다.

Two Reasons Why Test Design is Necessary

Efficient and Effective Test Execution Verifying every single test case is simply not realistic. Projects always operate under limited resources — people, budget, tools, and equipment. Therefore, a systematic approach is needed to secure sufficient coverage with a minimal number of test cases. What matters is not the sheer volume of test cases, but the ability to derive cases with a high probability of detecting defects. Even with tens of thousands of test cases, if only a handful of defects are found, it cannot be considered good testing. Test design provides the systematic techniques to achieve this.
Verification Across Diverse Environments and Platforms No matter how well software runs on a developer's PC, it is meaningless if it does not operate reliably in the actual user environment. To verify behavior across various operating systems, browsers, hardware, and devices, the corresponding environments must be set up in advance. From the V-model perspective, starting to build the test environment only after coding is finished is already too late. Test environments, equipment, and tool configurations must be prepared in parallel with the left-side activities of writing test specifications and specs.

Key Point: Test design does not happen after coding — it proceeds in parallel with the development activities of the V-model.

테스트 설계의 필요성; 두 가지 이유

효율적이고 효과적인 테스트 수행: 모든 테스트 케이스를 일일이 검증하는 것은 현실적으로 불가능하다. 프로젝트에서는 인력, 비용, 도구, 장비 등 자원이 항상 제한되어 있기 때문이다. 따라서 적은 수의 테스트 케이스로 충분한 커버리지를 확보하는 체계적인 접근이 필요하다. 이때 중요한 것은 단순히 테스트 케이스의 수가 아니라 결함을 발견할 확률이 높은 테스트 케이스를 도출하는 것이다. 테스트 케이스가 수만 개라도 발견한 결함이 극히 일부에 불과하다면 좋은 테스트라고 할 수 없다. 테스트 설계는 이를 위한 체계적인 기법을 제공한다.
다양한 환경과 플랫폼에서의 검증: 개발자 PC에서 아무리 잘 동작하는 소프트웨어라도, 실제 사용자 환경에서 안정적으로 동작하지 않으면 의미가 없다. 다양한 운영체제, 브라우저, 하드웨어, 디바이스에서의 동작 여부를 검증하려면 해당 환경이 미리 구축되어 있어야 한다. V모델 관점에서 보면, 테스트 환경 구축은 코딩이 끝난 후에 시작해서는 너무 늦다. 테스트 명세와 테스트 스펙을 작성하는 왼쪽 활동과 병행해서 실제 테스트 환경, 장비, 도구 세팅을 함께 준비해 나가야 한다.

핵심 포인트: 테스트 설계는 코딩 이후가 아닌, V모델의 개발 활동과 함께 진행된다는 점을 반드시 기억하자.

Test Design Activity ① Test Case Development

The first major activity in test design is test case development. Based on the Test Basis; the left-side V-model artifacts such as requirements documents, design documents, and code. This involves writing test cases to verify that the software properly satisfies its requirements.

A test case is a document that systematically records test conditions, input values, expected output values, and actual results. Let's look at a concrete example.

In TC_1, since the lowercase "cuk" is stored in the DB, entering "CUK" in uppercase should produce an "ID not found" warning. If the actual result is "Login successful," it means case sensitivity is not properly implemented, and debugging is required. Once resolved, if "ID not found" warning appears correctly, the defect is fixed.

TC_2's expected output and actual result both show "Incorrect password warning," so it is determined to be functioning correctly.

In this way, test cases are written based on the Test Basis; test specs, design documents, etc. — to systematically verify that the software meets its requirements.

테스트 설계의 주요 활동 ① 테스트 케이스 개발

테스트 설계의 첫 번째 주요 활동은 테스트 케이스 개발이다. 요구사항, 설계서, 코드 등 V모델 왼쪽의 산출물인 테스트 베이시스를 기반으로, 소프트웨어가 요구사항을 제대로 만족하는지 확인하기 위한 테스트 케이스를 작성하는 작업이다.

테스트 케이스는 테스트 조건, 입력값, 예상 출력값, 실행 결과 등을 체계적으로 기록한 문서다. 아래 예시를 통해 구체적으로 살펴보자.

TC_1의 경우 DB에 소문자 cuk로 저장되어 있으므로 대문자 CUK 입력 시 "아이디 없음 경고"가 예상 출력값이다. 그런데 실행 결과가 "정상 로그인"이라면 대소문자 구분이 제대로 구현되지 않은 것이므로 디버깅이 필요하다. 조치 후 "아이디 없음 경고"가 정상 출력되면 결함이 해결된 것이다.

TC_2는 예상 출력값과 실행 결과가 모두 "비밀번호 틀림 경고"로 일치하므로 정상 동작으로 판단한다.

이처럼 테스트 케이스는 테스트 스펙이나 설계서 등 테스트 베이시스를 기반으로, 소프트웨어가 요구사항을 만족하는지를 체계적으로 검증하기 위한 형태로 작성된다.

[Reference] Test Case Components

There are certain components that must be included when writing a test case.

ID is a naming convention used to identify each test case. When there are hundreds or thousands of test cases, a systematic ID system is essential for distinguishing them. For example, unit tests might use UT-1, UT-2, and system tests might use ST-1, ST-2.

Conditions are the preconditions required for the test to run. These specify what must be in place before execution — such as a specific environment being set up, or certain variable values being pre-stored.

Expected output is the anticipated result when the test runs correctly. Just as "ID not found warning" is the expected output when an uppercase ID is entered in the earlier example, each test case must have a clearly defined expected result defined in advance.

In addition, the test purpose, test execution date, and assigned tester should also be recorded when writing test cases.

[참고] 테스트 케이스의 구성 항목

테스트 케이스를 작성할 때 반드시 포함되어야 하는 구성 항목들이 있다.

ID는 테스트 케이스를 식별하기 위한 명명 규칙이다. 테스트 케이스가 수백, 수천 개에 달하는 경우 각각을 구분할 수 있어야 하므로 체계적인 ID 부여가 필수다. 예를 들어 유닛 테스트라면 UT-1, UT-2, 시스템 테스트라면 ST-1, ST-2와 같은 형식으로 작성한다.

조건은 테스트가 수행되기 위한 사전 조건이다. 특정 환경이 갖춰져 있어야 한다거나, 변수값이 미리 저장되어 있어야 하는 등 테스트 실행 전에 반드시 정의되어 있어야 하는 항목들을 명시한다.

예상 출력값은 테스트가 정상적으로 수행될 때 기대되는 결과값이다. 앞선 예시에서 대문자 아이디 입력 시 "아이디 없음 경고"가 출력되어야 하는 것처럼, 테스트 케이스마다 명확한 기대값이 사전에 정의되어 있어야 한다.

이 외에도 테스트 목적, 테스트 수행 날짜, 담당 테스터 등의 항목도 테스트 케이스 작성 시 함께 기록되어야 한다.

Test Design Activity ② Test Procedure Definition

The second major activity in test design is test procedure definition. This involves determining the specific order in which various tests — unit, integration, system, acceptance, etc. — will be performed.

Integration Test Procedure Example

In the V-model, integration testing verifies that interfaces between modules operate correctly, based on the software architecture. A representative artifact used here is the sequence diagram — a diagram that represents the flow of function calls between modules from top to bottom.

For example, when integration testing software composed of Mod_A through Mod_G, the procedure can be defined incrementally as follows. Rather than integrating all modules at once, you first verify that Function_A() between A and D operates correctly, then add E in step 2, followed by F, G, and so on. When software contains dozens or hundreds of modules, integrating everything at once is impossible — which is why defining a step-by-step procedure starting from small units is essential.

Key Point

Test procedure definition is not a concept exclusive to integration testing. Systematically defining the execution order applies to all test types, including unit testing and acceptance testing. From the V-model perspective, test procedure definition must be carried out in parallel with the left-side development activities.

테스트 설계의 주요 활동 ② 테스트 절차 정의

테스트 설계의 두 번째 주요 활동은 테스트 절차 정의다. 단위, 통합, 시스템, 인수 등 다양한 테스트를 어떤 순서로 수행할지 구체적인 절차를 결정하는 활동이다.

통합 테스트 절차 예시

V모델에서 통합 테스트는 소프트웨어 아키텍처를 기반으로 모듈 간 인터페이스가 적절히 동작하는지를 검증한다. 이때 활용되는 대표적인 산출물이 시퀀스 다이어그램이다. 시퀀스 다이어그램은 모듈 간에 수행되는 함수 호출 흐름을 위에서 아래로 표현한 다이어그램이다.

예를 들어 Mod_A ~ Mod_G로 구성된 소프트웨어를 통합 테스트한다면, 절차는 다음과 같이 단계적으로 정의할 수 있다. 한 번에 모든 모듈을 통합하는 것이 아니라, 1단계에서 A와 D 간의 Function_A() 호출이 정상 동작하는지 확인한 뒤, 2단계에서 E를 추가하고, 이후 F, G를 순차적으로 붙여가며 검증하는 방식이다. 소프트웨어 내에 수십, 수백 개의 모듈이 존재할 때 한 번에 통합하는 것은 불가능하므로, 이처럼 작은 단위부터 순서대로 통합해 나가는 절차가 반드시 필요하다.

핵심 포인트

테스트 절차 정의는 통합 테스트에만 해당하는 개념이 아니다. 단위 테스트, 인수 테스트 등 모든 테스트 유형에서 수행 순서를 체계적으로 정의하는 것이 필요하다. 또한 V모델 관점에서 테스트 절차 정의는 개발의 왼쪽 활동과 함께 병행하여 수행되어야 한다는 점을 반드시 기억하자.

Test Design Activity ③ Test Environment Preparation

The third major activity in test design is test environment preparation. This involves deciding in which environment each test; unit, integration, system, acceptance - will actually be executed, and building that environment in advance.

Test environment preparation requires not just software, but also hardware, tools, and equipment. Taking an automotive electronic control system as an example, the environment expands incrementally as integration progresses:

Application Software — standalone test
Platform Software — integration and test
Hardware (ECU) — integration and test
Sensor / Actuator — connection and test
Mechanics — full system test including mechanical components

Software development alone does not make testing possible. Sensors that provide input to the controller, actuators that handle actual operation, and mechanical components must all be prepared at each stage well in advance.

For this reason, test environment preparation should not begin after development is complete. It must be systematically planned and built in parallel with the left-side activities of the V-model.

테스트 설계의 주요 활동 ③ 테스트 환경 준비

테스트 설계의 세 번째 주요 활동은 테스트 환경 준비다. 단위, 통합, 시스템, 인수 등 각 테스트를 실제로 어떤 환경에서 수행할지 결정하고, 그에 맞는 환경을 미리 구축하는 활동이다.

테스트 환경 준비에는 소프트웨어뿐만 아니라 하드웨어, 도구, 장비 등이 함께 갖춰져야 한다. 자동차 전자 제어 시스템을 예로 들면, 전체 구조는 다음과 같이 단계적으로 통합되며 테스트 환경도 이에 맞춰 확장된다.

Application Software 단독 테스트
Platform Software 통합 후 테스트
Hardware(ECU) 통합 후 테스트
Sensor / Actuator 연결 후 테스트
Mechanics(기계적 구성 요소) 포함한 전체 시스템 테스트

이처럼 소프트웨어만 개발한다고 테스트가 가능한 것이 아니다. 제어기에 입력을 제공하는 센서, 실제 구동을 담당하는 액추에이터, 그리고 기계적 구성 요소까지 각 단계에서 필요한 환경이 미리미리 준비되어 있어야 한다.

이러한 이유로 테스트 환경 준비는 개발이 완료된 후에 시작하는 것이 아니라, V모델의 왼쪽 활동과 병행하여 사전에 체계적으로 계획하고 구축해 나가야 한다.

2️⃣ Classification of Test Design Techniques

Classification of Test Design Techniques

When creating test cases, systematic test design techniques are applied. Before exploring the types of techniques, let's take a moment to think about it with the V-model in mind.

How can test design techniques be classified?

In the V-model, unit, integration, system, and acceptance tests correspond to each stage of development. Just as the target and purpose of testing differ at each stage, the techniques used to derive test cases can also be classified in various ways depending on the criteria and perspective. The specific classification will be explored in the content that follows.

테스트 설계 기법의 분류

테스트 케이스를 만들 때는 체계적인 테스트 설계 기법을 적용한다. 기법의 종류를 살펴보기 전에, 먼저 V모델을 떠올리며 스스로 생각해보자.

테스트 설계 기법은 어떤 기준으로 분류할 수 있을까?

V모델에서는 개발 단계에 따라 단위, 통합, 시스템, 인수 테스트가 대응된다. 각 단계마다 테스트의 대상과 목적이 다르듯, 테스트 케이스를 도출하는 기법도 그 기준과 관점에 따라 다양하게 분류될 수 있다. 구체적인 분류 방법은 이어지는 내용에서 살펴보도록 하자.

Classification of Test Design Techniques — Black Box vs. White Box

Test design techniques are broadly divided into two categories: Black Box Testing and White Box Testing.

Black Box Testing (Specification-Based Testing)

A "black box" refers to a state where the internal structure and details are not visible. Black box testing focuses solely on whether the output for a given input is appropriate, without considering the software's internal logic. For example, the goal is to verify that inputting 1 and 2 produces 3; not to examine what logic internally calculated that result.

Test cases are derived not from source code, but from the left-side artifacts of the V-model — requirements documents and design specifications. Because it is based on specifications rather than code, it is also known as Specification-Based Testing.

White Box Testing (Structure-Based Testing)

A "white box" refers to a state where the internal structure and details are fully visible. White box testing creates test cases from the source code itself, with the goal of covering as many of the program's internal flows and paths as possible.

Since test cases are derived from the algorithms and logic defined in the source code or detailed design documents, it is also called Structure-Based Testing, as it involves examining the entire structure of the program.

테스트 설계 기법의 분류; 블랙박스 vs 화이트박스

테스트 설계 기법은 크게 블랙박스 테스트와 화이트박스 테스트 두 가지로 나뉜다.

블랙박스 테스트 (명세 기반 테스트)

블랙박스란 내부 구조나 세부 내용이 보이지 않는 상태를 의미한다. 블랙박스 테스트는 소프트웨어의 내부 로직은 고려하지 않고, 입력에 대해 적절한 출력이 나오는지에만 집중한다. 예를 들어 1과 2를 입력했을 때 3이 출력되는지를 확인하는 것이지, 내부적으로 어떤 로직을 통해 3이 계산되었는지는 보지 않는다.

테스트 케이스를 만들 때도 소스코드가 아닌 V모델의 왼쪽 산출물, 즉 요구사항 명세서나 설계 스펙을 기반으로 도출한다. 코드가 아닌 명세를 기반으로 한다는 점에서 명세 기반 테스트(Specification-based Test) 라고도 불린다.

화이트박스 테스트 (구조 기반 테스트)

화이트박스는 내부 구조와 세부 내용이 보이는 상태를 의미한다. 화이트박스 테스트는 소스코드 그 자체를 대상으로 테스트 케이스를 만들며, 프로그램 내부의 다양한 흐름과 경로를 최대한 커버하는 것을 목표로 한다.

소스코드 또는 상세 설계서에 정의된 알고리즘과 로직을 기반으로 테스트 케이스를 도출하기 때문에, 프로그램의 전체 구조를 들여다본다는 의미에서 구조 기반 테스트(Structure-based Test) 라고도 불린다.

Black Box Testing (Specification-Based Testing) — In Depth

Black box testing excludes the software's internal logic entirely and focuses only on whether the output for a given input is correct. Because test cases are derived from specifications such as requirements documents and design documents — rather than source code — it is also referred to as specification-based or spec-based testing.

For example, consider a program that outputs the largest number among several inputs. The internal comparison algorithm used is irrelevant. Based on the specification, the following test cases are derived:

Case where A is the largest
Case where B is the largest
Case where C is the largest
Case where A is entered as a negative number

The essence of black box testing is verifying that the program produces the correct output in each of these situations.

블랙박스 테스트 (명세 기반 테스트) 심화

블랙박스 테스트는 소프트웨어 내부의 로직은 배제하고, 입력에 대한 출력 값에만 초점을 두는 테스트 방법이다. 테스트 케이스는 소스코드가 아닌 요구사항 명세서나 설계서 등의 스펙을 기반으로 도출하기 때문에 명세 기반 테스트, 스펙 기반 테스트라고도 불린다.

예를 들어 여러 숫자 중 가장 큰 수를 출력하는 프로그램이 있다고 하자. 내부적으로 어떤 비교 알고리즘을 사용하는지는 관심 밖이다. 스펙을 기반으로 다음과 같은 테스트 케이스를 도출한다.

A가 가장 큰 경우
B가 가장 큰 경우
C가 가장 큰 경우
A를 음수로 입력하는 경우

이처럼 프로그램이 각 상황에서 올바른 출력을 내는지를 확인하는 것이 블랙박스 테스트의 핵심이다.

White Box Testing (Structure-Based Testing) — In Depth

White box testing aims to test as many independent paths within the source code as possible. Test cases are derived from the source code itself, or equivalent detailed design documents and algorithm logic.

While the form of inputs and outputs is the same as in black box testing, white box testing directly examines the internal logic, creating a test case for each individual execution path within the program. Using the "largest number output" program as an example:

Because it examines the entire internal structure and aims to cover every independent path without omission, this approach is also called Structure-Based Testing.

화이트박스 테스트 (구조 기반 테스트) 심화

화이트박스 테스트는 소스코드 내의 모든 독립적인 경로를 최대한 테스트하는 방법이다. 소스코드, 혹은 이에 준하는 상세 설계서와 알고리즘 로직을 기반으로 테스트 케이스를 도출한다.

입력과 출력의 형태는 블랙박스 테스트와 동일하지만, 내부 로직을 직접 들여다보기 때문에 프로그램 안에 존재하는 다양한 실행 경로 하나하나를 테스트 케이스로 만든다. 앞서 살펴본 큰 수 출력 프로그램을 예로 들면 다음과 같다.

이처럼 내부 구조를 모두 들여다보며 독립적인 경로를 빠짐없이 커버하려 한다는 점에서 구조 기반 테스트(Structure-based Test) 라고도 부른다.

[Reference] Test Approaches by V-Model Stage

The test target, environment, and applicable technique differ at each stage of the V-model.

Unit Testing targets individual modules such as functions and files, and is conducted in the developer's own development environment. Since test cases are derived directly from source code, white box testing is applied.

Integration Testing verifies the interfaces between internal and external modules, and is typically conducted in a laboratory environment. Because both source code and specifications are often referenced together, it can be applied in a gray box form that combines white box and black box approaches.

System Testing covers the entire integrated software, verifying not only functional aspects but also non-functional qualities such as performance, usability, maintainability, and compatibility. It is conducted in an environment similar to the actual operating environment, and can also be applied in gray box form.

Acceptance Testing verifies the final software from the user's perspective in a real-world environment, confirming that it operates according to the requirements defined by the user. Since users have no need to examine internal code, black box testing is applied, using requirements definition documents and similar specs as the Test Basis.

[참고] V모델 단계별 테스트 방안

V모델의 각 테스트 단계별로 테스트 대상, 환경, 적용 기법이 달라진다.

단위 테스트 ㅡ 단위 모듈(함수, 파일 등)을 대상으로 하며, 개발자의 개발 환경에서 진행된다. 소스코드 자체를 기반으로 테스트 케이스를 도출하기 때문에 화이트박스 테스트가 적용된다.

통합 테스트 ㅡ 내부 및 외부 모듈 간의 인터페이스를 검증하는 단계로, 보통 실험실 환경에서 진행된다. 소스코드와 스펙을 함께 참조하는 경우가 많아 화이트박스와 블랙박스를 혼합한 그레이박스 형태로 적용이 가능하다.

시스템 테스트 ㅡ 통합된 소프트웨어 전체를 대상으로 기능적 측면뿐만 아니라 성능, 사용성, 유지보수성, 호환성 등 비기능적인 부분까지 검증하는 단계다. 실제 운영환경과 유사한 환경에서 진행되며, 그레이박스 형태로도 적용 가능하다.

인수 테스트 ㅡ 사용자 관점에서 최종 소프트웨어를 실제 환경에서 검증하는 단계로, 사용자가 제시한 요구사항대로 동작하는지를 확인한다. 사용자가 내부 코드를 볼 필요가 없으므로 요구사항 정의서 등의 스펙을 테스트 베이시스로 삼는 블랙박스 테스트가 적용된다.

3️⃣ Block box testing Preview

Black Box Test Design Techniques

Black box testing is an approach that excludes the internal logic of source code and verifies whether the output for a given input is appropriate. Test cases are derived from specifications such as requirements documents and architecture design documents. ISO/IEC/IEEE 29119 defines a variety of black box test techniques. Representative techniques include:

Syntax Testing
Equivalence Partitioning
Boundary Value Analysis
And many more

In this post, we will use Syntax Testing to understand the fundamental concept of black box test techniques. The remaining techniques will be covered in subsequent posts.

블랙박스 테스트 설계 기법

블랙박스 테스트는 소스코드의 내부 로직은 배제하고, 입력에 대한 출력이 적절한지를 검증하는 방식이다. 테스트 케이스는 요구사항 명세서, 아키텍처 설계서 등의 스펙을 기반으로 도출된다. ISO/IEC/IEEE 29119 표준에서는 다양한 블랙박스 테스트 기법을 정의하고 있다. 대표적인 기법으로는 다음과 같은 것들이 있다.

신택스 테스팅 (Syntax Testing)
동등 분할 (Equivalence Partitioning)
경계값 분석 (Boundary Value Analysis)
그 외 다수

이번에는 신택스 테스팅(Syntax Testing) 을 통해 블랙박스 테스트 기법의 기본 컨셉을 살펴보고, 나머지 기법들은 이어지는 내용에서 다룰 예정이다.

Black Box Test Technique ① Syntax Testing

Syntax Testing is one of the easiest black box test techniques to apply. The core concept is to divide input values into Valid and Invalid categories and create test cases accordingly. Test cases are derived from requirements documents or design specifications.

Example: User Registration ㅡ Username Input

Username condition: Korean characters only, minimum 2 characters, maximum 8 characters

When "홍길동 99" is entered, the expected output is "Number included — invalid," but if the actual result shows "Normal," it means the program failed to properly detect the number, indicating a defect.

In this way, Syntax Testing verifies only the output result — without looking at the internal logic at all — by creating test cases based on the valid/invalid conditions defined in the specification. The same approach can be applied to all input fields such as username, password, and email address.

블랙박스 테스트 기법 ① 신택스 테스팅 (Syntax Testing)

신택스 테스팅은 블랙박스 테스트 기법 중 가장 쉽게 적용할 수 있는 방법이다. 핵심 개념은 입력값을 적합(Valid) 과 부적합(Invalid) 으로 구분하여 테스트 케이스를 만드는 것이다. 테스트 케이스는 요구사항 명세서나 설계 스펙을 기반으로 도출된다.

예시: 회원가입 — 사용자 이름 입력

사용자 이름 조건: 2자리 이상 8자리 이하의 한글만 허용

"홍길동 99" 입력 시 예상 출력값은 "숫자 포함 부적합 경고"이지만 실행 결과가 "정상"으로 출력된다면, 프로그램이 숫자를 제대로 인식하지 못한 것이므로 결함이 존재한다는 것을 알 수 있다.

이처럼 신택스 테스팅은 내부 로직은 전혀 보지 않고, 스펙에 정의된 적합/부적합 조건을 기준으로 테스트 케이스를 만들어 출력 결과만을 검증하는 방식이다. 아이디, 비밀번호, 이메일 주소 등 모든 입력 항목에 동일한 방식으로 적용할 수 있다.

Syntax Testing Applied — Shopping Mall Product Search Feature

Let's apply Syntax Testing to a product search feature in a shopping mall system. This feature allows users to enter a product name or product number as a keyword, and the system searches for and displays the matching product.

All valid/invalid conditions are derived from the specification (requirements documents, design documents, etc.). Test cases can only be accurately created when these conditions are clearly defined.

The Importance of Specifications

There is a critical point that must be emphasized here. From the V-model perspective, test cases are built from the left-side artifacts — requirements definitions and design documents. If these artifacts are not properly written, it becomes difficult to accurately derive test cases. In the worst case, testers may end up creating test cases based on guesswork rather than a solid specification.

Ultimately, good testing starts with good specs. Never forget that the thoroughness with which left-side artifacts are written determines the overall quality of the entire test design activity.

신택스 테스팅 적용 예시; 쇼핑몰 상품 검색 기능

쇼핑몰 시스템의 상품 검색 기능을 예시로 신택스 테스팅을 적용해보자. 이 기능은 사용자가 상품명 또는 상품 번호를 키워드로 입력하면 해당 상품을 검색하여 표시하는 기능이다. 이 적합/부적합 조건은 모두 스펙(요구사항 명세서, 설계서 등) 으로부터 도출된다. 조건이 명확히 정의되어 있어야 테스트 케이스를 정확하게 만들 수 있다.

스펙의 중요성

여기서 반드시 짚고 넘어가야 할 점이 있다. V모델 관점에서 테스트 케이스는 왼쪽의 산출물, 즉 요구사항 정의서나 설계서를 기반으로 만들어진다. 따라서 이 산출물들이 제대로 작성되어 있지 않으면 테스트 케이스를 정확히 도출하기가 어려워진다. 최악의 경우 테스터가 스펙 없이 추측에 의존해 테스트 케이스를 만들게 되는 상황이 발생할 수 있다.

결국 좋은 테스트는 좋은 스펙에서 시작된다. 왼쪽 산출물을 충실히 작성하는 노력이 테스트 설계 활동 전체의 품질을 좌우한다는 점을 반드시 기억하자.

Syntax Testing Applied — Writing Test Cases for the Product Search Feature

Let's write actual test cases based on the valid/invalid conditions defined earlier.

Product Name Test Cases

When "노트북^^" is entered, the special characters are ignored and laptop search results are displayed. Since this differs from the expected output defined in the spec, it is identified as a defect and debugging is required.

When "AA12345678" is entered, the letters are not properly recognized and the result shows "Normal." This is likewise identified as a defect and the code must be corrected.

Key Takeaway

Every step of this process — identifying input variables, defining valid/invalid conditions, and deriving test cases — is based not on source code, but on specifications such as requirements documents and architecture design documents. Ultimately, the quality of test cases depends on the quality of the specifications. Always remember: how thoroughly the left-side artifacts of the V-model are written determines the overall completeness of the entire test effort.

신택스 테스팅 적용 예시; 쇼핑몰 상품 검색 기능의 테스트 케이스 작성

앞서 정의한 적합/부적합 조건을 바탕으로 실제 테스트 케이스를 작성해보자.

"노트북^^" 입력 시 특수 기호를 무시하고 노트북 검색 결과가 출력되었다. 스펙에 명시된 예상 출력값과 다르므로 결함으로 판단하고 디버깅이 필요하다.

"AA12345678" 입력 시 영문을 제대로 인식하지 못하고 정상으로 출력되었다. 마찬가지로 결함으로 판단하고 코드를 수정해야 한다.

핵심 정리

이 모든 과정 ~ 입력 변수를 찾고, 적합/부적합 조건을 정의하고, 테스트 케이스를 도출하는 것 은 소스코드가 아닌 요구사항 명세서, 아키텍처 설계서 등의 스펙을 기반으로 이루어진다. 결국 테스트 케이스의 품질은 스펙의 품질에 달려 있다. V모델의 왼쪽 산출물을 얼마나 충실하게 작성하느냐가 테스트 전체의 완성도를 결정한다는 점을 반드시 기억하자.

Threads: Concepts, Implementation, and Practice

Heesu Noh — Thu, 02 Apr 2026 19:55:14 GMT

1️⃣Concept of Threads
2️⃣ Implementation of Threads
3️⃣ Process / Thread Practice

1️⃣Concept of Threads

Execution Flows Inside a Process — The Story of Threads

When you open a messenger app, multiple things happen at once. You can type a message, notifications pop up, and a file download continues in the background. So are all these tasks handled by a single execution flow?

What if only one flow existed? You'd have to wait for the file download to finish before sending a message, and the screen would freeze until a notification was processed. The reason we rarely experience such inconvenience is that threads are working behind the scenes, dividing and handling each role.

Today's lesson starts from exactly this point — "How does execution flow split inside a single program?"

프로세스 안에서 흐름이 나뉜다; 스레드(Thread) 이야기

메신저 프로그램을 켜면 동시에 여러 일이 일어난다. 메시지를 입력할 수 있고, 알림이 울리고, 파일 다운로드도 진행된다. 그렇다면 이 모든 작업은 하나의 실행 흐름으로 처리되는 걸까?

만약 하나의 흐름만 존재한다면 어떻게 될까? 파일 다운로드가 끝나야 메시지를 보낼 수 있고, 알림이 처리되기 전까지 화면이 멈춰 있을 것이다. 우리가 프로그램을 쓸 때 이런 불편함을 거의 느끼지 못하는 건, 내부에서 스레드가 역할을 나눠 처리하고 있기 때문이다.

오늘 수업은 바로 이 지점, "하나의 프로그램 안에서 실행 흐름이 어떻게 나뉘는가" 라는 질문에서 출발한다.

The Basic Unit of Execution, and the Flows Within It

In the last session, we covered the process as the basic unit through which the OS manages execution. A process is a running program itself, and the OS allocates memory and resources to each process, managing them independently.

So when execution splits into multiple flows inside a process, how does the OS handle it? Does it manage the whole process as one unit, or does it treat each split flow as a separate entity?

The answer is the latter. The OS distinguishes each individual execution flow inside a process as a thread. In other words, when determining scheduling or execution order, the unit the OS actually works with is not the process, but the thread.

If a process is "the space where a program lives," then a thread is "the actual execution flow moving within that space." This session will carefully work through this relationship — how the OS breaks execution down into manageable units.

실행의 기본 단위, 그리고 그 안에서 나뉘는 흐름

지난 시간에는 운영체제가 실행을 관리하는 기본 단위로 프로세스(Process) 를 다뤘다. 프로세스는 실행 중인 프로그램 그 자체이며, 운영체제는 각 프로세스에 메모리와 자원을 할당하고 독립적으로 관리한다.

그렇다면 프로세스 안에서 실행이 여러 갈래로 나뉠 때, 운영체제는 이를 어떻게 다룰까? 하나의 프로세스로 묶어서 통째로 관리할까, 아니면 나뉜 흐름 각각을 별도의 단위로 구분해서 관리할까?

답은 후자다. 운영체제는 프로세스 내부에서 나뉜 각각의 실행 흐름을 스레드(Thread) 라는 단위로 구분하여 관리한다. 즉, 스케줄링이나 실행 순서를 결정할 때 운영체제가 실질적으로 다루는 단위는 프로세스가 아니라 스레드인 셈이다.

프로세스가 "프로그램이 살아있는 공간"이라면, 스레드는 "그 공간 안에서 실제로 움직이는 실행 흐름"이라고 볼 수 있다. 오늘 수업에서는 바로 이 관계를, 즉 운영체제가 실행을 어떤 단위로 쪼개어 다루는지를 차근차근 정리해나갈 것이다.

What Is a Thread? — An Execution Flow Inside a Process

A thread is the unit of execution flow operating inside a process. More precisely, it is the minimum execution unit that actually receives CPU allocation and runs.

Consider an internet banking application. A user can check their balance while simultaneously requesting a transfer or searching transaction history. All of these functions happen within a s ingle program, but each is handled by a different execution flow. The flow handling the balance, the flow handling the transfer, the flow handling the transaction history — each of these is a thread.

Two key points matter here.

First, one or more threads can exist inside a single process. If a process represents the entire execution environment, a thread is the minimal execution unit actually moving within it.

Second, threads share resources but execute independently. Resources like the code region, global data, and heap are shared among threads within the same process. However, each thread's execution flow proceeds independently.

This is the essence of a thread. Share resources, execute separately.

스레드란 무엇인가? 프로세스 안의 실행 흐름

스레드(Thread)는 프로세스 내부에서 동작하는 실행 흐름의 단위다. 좀 더 정확히 말하면, CPU를 실제로 할당받아 동작하는 최소 실행 단위라고 할 수 있다.

인터넷 뱅킹 프로그램을 예로 들어보자. 사용자는 잔액 조회를 하면서 동시에 이체를 요청하거나 거래 내역을 검색할 수 있다. 이 모든 기능이 하나의 프로그램 안에서 이루어지지만, 각 기능은 서로 다른 실행 흐름으로 처리된다. 잔액을 처리하는 흐름, 이체를 처리하는 흐름, 거래 내역을 처리하는 흐름, 이 각각이 바로 스레드다.

여기서 중요한 점은 두 가지다.

첫째, 하나의 프로세스 안에는 하나 이상의 스레드가 존재할 수 있다. 프로세스가 실행 환경 전체를 의미한다면, 스레드는 그 안에서 실제로 움직이는 최소한의 실행 단위다.

둘째, 스레드는 자원을 공유하되, 실행은 독립적으로 이루어진다. 코드 영역, 전역 데이터, 힙(Heap) 영역과 같은 자원은 같은 프로세스 안의 스레드들이 함께 공유한다. 그러나 각 스레드의 실행 흐름 자체는 서로 독립적으로 진행된다.

이것이 스레드의 핵심이다. 자원은 나눠 쓰고, 실행은 따로 간다.

Single-Thread vs Multi-Thread; Four Execution Structures

The number of processes and the number of threads are independent concepts. Their combination determines the execution structure, which can be broken into four forms.

① Single Process + Single Thread

Only one execution flow exists, so tasks proceed sequentially. One task must finish before the next begins. If one task takes a long time, everything else waits. The simplest structure.

② Single Process + Multiple Threads

One process, but execution splits into multiple flows. One thread handles file downloads while another handles user input; multiple tasks appear to run simultaneously within the same program. The messenger and internet banking examples from earlier fall into this category.

③ Multiple Processes + Single Thread

Multiple processes exist, each with only one execution flow. Running a web browser and a music player at the same time is a typical example. The reason execution appears divided here is not threads, but the fact that multiple independent processes are running.

④ Multiple Processes + Multiple Threads

Multiple processes exist, and each contains multiple threads. For example, running a browser and a messenger simultaneously; the browser runs multiple threads per tab, and the messenger has separate threads for message handling and file downloading. This is the most common structure in modern operating systems.

The number of processes and the number of threads operate on different dimensions. How you combine them determines a program's execution structure.

단일 스레드와 다중 스레드: 실행 구조의 네 가지 형태

프로세스의 수와 스레드의 수는 서로 독립적인 개념이다. 이 두 요소의 조합에 따라 실행 구조가 달라지며, 크게 네 가지 형태로 나눠볼 수 있다.

① 단일 프로세스 + 단일 스레드

실행 흐름이 하나뿐이므로 작업이 순차적으로 진행된다. 한 작업이 끝나야 다음 작업으로 넘어가기 때문에, 어떤 작업이 오래 걸리면 나머지는 그동안 기다려야 한다. 가장 단순한 구조다.

② 단일 프로세스 + 다중 스레드

프로세스는 하나지만 실행 흐름이 여러 갈래로 나뉘어 동작한다. 한 스레드는 파일 다운로드를, 다른 스레드는 사용자 입력을 처리하는 식으로, 같은 프로그램 안에서 여러 작업이 동시에 수행되는 것처럼 보인다. 앞서 살펴본 메신저나 인터넷 뱅킹이 이 구조에 해당한다.

③ 다중 프로세스 + 단일 스레드

프로세스 자체가 여러 개 존재하고, 각 프로세스는 하나의 실행 흐름만 가진다. 웹 브라우저와 음악 재생 프로그램을 동시에 실행한 경우가 대표적인 예다. 이때 실행이 나뉘어 보이는 이유는 스레드 때문이 아니라, 서로 독립된 프로세스가 여러 개 동작하고 있기 때문이다.

④ 다중 프로세스 + 다중 스레드

여러 프로세스가 존재하고, 각 프로세스 안에서도 여러 스레드가 동작하는 구조다. 예를 들어 웹 브라우저와 메신저를 동시에 실행했을 때, 브라우저 안에서는 탭마다 여러 스레드가 돌아가고, 메신저 안에서도 메시지 처리와 파일 다운로드를 담당하는 스레드가 각각 존재한다. 이것이 현대 운영체제에서 가장 일반적으로 사용되는 구조다.

결국 프로세스의 개수와 스레드의 개수는 서로 다른 차원의 개념이다. 이 둘을 어떻게 조합하느냐에 따라 프로그램의 실행 구조가 결정된다.

Process Internal Structure; What Threads Share and What They Keep Separate

A process holds common resources: the code region, global data region, and heap region. In a multi-threaded environment, multiple threads within the same process share these resources.

Consider the following code:

int count = 0;

void process() {
    int x = 0;
}

The global variable count is stored in the data region, so all threads within the same process can access it. If two threads simultaneously perform count++, then count becomes a shared resource. The code region, global data, and heap exist at the process level and are shared by all threads.

However, not everything is shared. Each thread independently holds its own stack region. Even if two threads call the same function simultaneously, the local variable x inside that function is created separately on each thread's stack. Thread A's x and Thread B's x occupy different memory locations despite having the same name, so changes in one do not affect the other.

In summary: the code region, global data, and heap are shared between threads, while the stack is independently maintained per thread. This structure allows multiple threads to efficiently share resources while each executes without interfering with the others.

프로세스 내부 구조: 스레드는 무엇을 공유하고 무엇을 따로 가지는가

프로세스는 코드 영역, 전역 데이터 영역, 힙 영역과 같은 공통 자원을 가지고 있다. 다중 스레드 환경에서는 같은 프로세스 안에 있는 여러 스레드가 이 자원들을 함께 사용하게 된다.

예를 들어 아래와 같은 코드가 있다고 해보자.

int count = 0;

void process() {
    int x = 0;
}

여기서 전역 변수 count는 데이터 영역에 저장되기 때문에, 같은 프로세스 안의 모든 스레드가 함께 접근할 수 있다. 두 스레드가 동시에 count++ 연산을 수행한다면, count는 두 스레드가 공유하는 자원이 된다. 코드 영역, 전역 데이터, 힙 영역은 이처럼 프로세스 단위로 존재하며 스레드들이 공유한다.

그러나 모든 자원을 공유하는 것은 아니다. 각 스레드는 실행에 필요한 스택(Stack) 영역을 독립적으로 가진다. 예를 들어 두 스레드가 같은 함수를 동시에 호출하더라도, 그 함수 안의 지역 변수 x는 각 스레드의 스택에 따로 생성된다. 스레드 A의 x와 스레드 B의 x는 이름은 같아도 서로 다른 메모리 공간에 존재하기 때문에, 한쪽의 변경이 다른 쪽에 영향을 주지 않는다.

정리하면 다음과 같다. 코드 영역, 전역 데이터, 힙 영역은 스레드 간에 공유되고, 실행 흐름을 위한 스택은 스레드마다 독립적으로 존재한다. 이 구조 덕분에 여러 스레드가 자원을 효율적으로 나눠 쓰면서도, 각자의 실행은 서로 간섭 없이 독립적으로 이루어질 수 있다.

Thread Address Space; A Structure That Shares Yet Separates

When a process runs, the OS allocates a memory space exclusively for that process. Within this space, the code, data, and heap regions are placed at distinct locations. "Distinct locations" means they sit at different addresses — not that they use separate memory. They all exist within the same memory range belonging to one process, just partitioned.

In a multi-threaded environment, multiple threads share this memory space. The same function in the code region can be executed by multiple threads simultaneously, and global variables are accessible to all threads.

Yet each thread has its own separate stack. This means that even when executing the same function concurrently, each thread independently maintains its own parameter values and local variables. One thread's execution cannot affect another thread's local variables.

To summarize: code, data, and heap are shared between threads; the stack exists independently per thread. The ability to share memory while maintaining independent execution flows is made possible by this structure. This shared-yet-separated design is the defining characteristic of a multi-threaded environment.

스레드의 주소 공간: 공유하되 분리된 구조

프로세스가 실행되면 운영체제는 그 프로세스만의 메모리 공간을 할당한다. 이 공간 안에는 코드 영역, 데이터 영역, 힙 영역이 각각 구분된 위치에 배치된다. 여기서 "각각 다른 위치"란 이 영역들이 서로 다른 주소에 놓인다는 뜻이지, 별개의 메모리를 쓴다는 의미가 아니다. 모두 하나의 프로세스에 속한 동일한 메모리 범위 안에 구분되어 존재한다.

다중 스레드 환경에서는 이 메모리 공간을 여러 스레드가 함께 사용한다. 코드 영역에 있는 같은 함수를 여러 스레드가 동시에 실행할 수 있고, 전역 변수 역시 모든 스레드가 함께 접근할 수 있다.

그러나 각 스레드는 자신만의 스택 영역을 따로 가진다. 덕분에 같은 함수를 동시에 실행하더라도 각 스레드는 서로 다른 매개변수 값과 지역 변수를 독립적으로 유지할 수 있다. 한 스레드의 실행이 다른 스레드의 지역 변수에 영향을 주지 않는 것도 이 때문이다.

정리하면, 코드·데이터·힙은 스레드 간에 공유되고, 스택은 스레드마다 독립적으로 존재한다. 같은 메모리를 나눠 쓰면서도 각자의 실행 흐름을 유지할 수 있는 것은 바로 이 구조 덕분이다. 공유하되 분리된 이 구조가 멀티스레드 환경의 핵심 특징이다.

Thread Characteristics; Resource Sharing and Execution Separation

The core characteristics of threads can be summarized in two points.

① Resource Sharing

Threads within the same process share the code, data, and heap regions. In a shopping mall website, one thread fetches the product list, another calculates the cart total, and another handles user input. Since product information and user data all reside in the same memory, there is no need to copy data when passing it between threads, they are all looking at the same memory. This structure enables efficient cooperation between threads inside a process.

② Separation of Execution Flows

Because multiple execution flows can exist within one process, each thread has its own execution order and operates independently. Think of video streaming: one thread downloads video data, another renders it on screen, and another handles user input. The execution is divided, yet all flows run within a single program.

Ultimately, threads operate on two principles: share resources, execute separately.

스레드의 특징; 자원 공유와 실행 흐름의 분리

스레드의 핵심 특징은 크게 두 가지로 정리할 수 있다.

① 자원 공유

같은 프로세스 안에 있는 스레드들은 코드, 데이터, 힙 영역을 함께 사용한다. 쇼핑몰 웹사이트를 예로 들면, 한 스레드는 상품 목록을 불러오고, 다른 스레드는 장바구니를 계산하고, 또 다른 스레드는 사용자 입력을 처리한다. 이때 상품 정보나 사용자 데이터는 모두 같은 메모리에 존재하기 때문에, 스레드 간에 데이터를 주고받을 때 굳이 복사할 필요가 없다. 같은 메모리를 함께 바라보고 있기 때문이다. 이 구조 덕분에 프로세스 내부에서 스레드 간 협력이 효율적으로 이루어질 수 있다.

② 실행 흐름의 분리

하나의 프로세스 안에 여러 실행 흐름이 존재할 수 있기 때문에, 각 스레드는 자신만의 실행 순서를 가지면서 독립적으로 동작한다. 영상 스트리밍을 생각해보면, 한 스레드는 영상 데이터를 다운로드하고, 다른 스레드는 화면에 출력하고, 또 다른 스레드는 사용자 입력을 받는다. 실행은 나뉘어 있지만 이 모든 흐름이 하나의 프로그램 안에서 돌아가고 있는 것이다.

결국 스레드는 자원은 함께 쓰고, 실행은 따로 간다는 두 가지 원칙 위에서 동작한다.

Process vs Thread — A Comparison

Processes and threads are often mentioned together, but they are distinct concepts.

A process represents the entire running program. Each process has its own independent memory space, so running a web browser and Notepad simultaneously means the two programs operate in separate memory spaces with no direct access to each other's memory.

A thread is an execution flow running inside one process. Threads belonging to the same process share memory, and multiple threads can exist within a single process.

In one sentence: a process is the entire execution environment, and a thread is a single execution flow performing actual work within it.

스레드와 프로세스의 비교

프로세스와 스레드는 자주 함께 언급되지만 서로 다른 개념이다.

프로세스는 실행 중인 프로그램 전체를 의미한다. 각 프로세스는 독립적인 메모리 공간을 가지기 때문에, 웹 브라우저와 메모장을 동시에 실행하면 두 프로그램은 서로 다른 메모리 공간에서 각각 동작하며 서로의 메모리에 직접 접근할 수 없다.

스레드는 하나의 프로세스 안에서 실행되는 흐름이다. 같은 프로세스에 속한 스레드들은 메모리를 공유하며, 하나의 프로세스 안에 여러 스레드가 존재할 수 있다.

한 문장으로 정리하면, 프로세스는 실행 환경 전체이고 스레드는 그 내부에서 실제로 작업을 수행하는 하나의 실행 흐름이다.

Why Use Threads?

① Improved Responsiveness

If user input goes unprocessed while a file is uploading, the program feels frozen. But if one thread handles the upload and another handles user input, the screen continues to respond while uploading proceeds. Whether the tasks truly run simultaneously or not, the user perceives the program as uninterrupted.

② Efficiency

Since threads share the same process memory, creating a new thread is far less costly than creating a new process. A new process requires a separate memory allocation, whereas a thread reuses existing memory. When a browser handles multiple tasks, splitting them into threads within one process is far more resource-efficient than spawning a new process for each task.

③ Structural Separation

Threads make it easy to divide work by role within a single program. In a game, one thread handles user input, another handles physics calculations, and another handles rendering. Dividing execution flows by function makes the program's behavior easier to understand and simplifies maintenance; fixing a specific feature only requires modifying its corresponding thread.

Taken together, these three reasons point to the core of threads: the combination of sharing and separation. The ability to share memory while splitting execution independently is the most fundamental reason to use threads — and their greatest defining trait.

스레드를 사용하는 이유

① 응답성 향상

파일을 업로드하는 동안 사용자 입력이 전혀 처리되지 않는다면, 사용자 입장에서는 프로그램이 멈춘 것처럼 느껴진다. 하지만 업로드 작업을 한 스레드가 맡고, 사용자 입력 처리를 다른 스레드가 맡는다면 업로드가 진행되는 동안에도 화면은 계속 반응하게 된다. 실제로 작업이 동시에 이루어지든 아니든, 사용자 입장에서는 프로그램이 멈추지 않은 것처럼 느껴지는 것이다.

② 효율성

스레드는 같은 프로세스의 메모리를 공유하기 때문에, 새로운 프로세스를 생성하는 것보다 부담이 훨씬 적다. 프로세스를 새로 만들면 독립적인 메모리 공간을 별도로 할당해야 하지만, 스레드는 기존 메모리를 그대로 활용한다. 예를 들어 브라우저에서 여러 작업을 처리할 때, 각 작업마다 프로세스를 새로 만드는 것보다 같은 프로세스 안에서 스레드로 나누어 처리하는 것이 자원 사용 면에서 훨씬 효율적이다.

③ 구조적 분리

세 번째 이유는 구조적인 분리다. 스레드를 활용하면 하나의 프로그램 안에서 역할별로 작업을 나누기가 쉬워진다. 게임 프로그램을 예로 들면, 사용자 입력을 처리하는 스레드, 물리 연산을 담당하는 스레드, 화면 출력을 맡는 스레드로 역할을 구분할 수 있다. 이렇게 기능별로 실행 흐름을 나누면 프로그램의 동작 구조를 이해하기 쉬워지고, 특정 기능을 수정하거나 개선할 때도 해당 스레드만 손보면 되기 때문에 유지보수가 훨씬 수월해진다.

지금까지 살펴본 세 가지 이유를 종합하면, 스레드의 핵심은 공유와 분리의 결합에 있다. 같은 메모리를 공유하면서도 실행은 독립적으로 나누어 처리할 수 있는 이 구조가 스레드를 사용하는 가장 근본적인 이유이자, 스레드의 가장 큰 특징이다.

Real-World Examples of Thread Usage

① Video Player

Smooth playback requires multiple tasks to run simultaneously. Processing screen output, audio playback, and user input (play/pause) sequentially in one flow would cause the video to freeze or the audio to cut out. Assigning each function to a separate thread allows video, audio, and input handling to proceed concurrently, keeping the overall experience seamless.

② Game Program

A game must continuously refresh the screen, respond instantly to user input, and simultaneously calculate character positions and collision detection. Doing this sequentially would cause the screen to stutter or responses to lag. Splitting tasks by role into independent execution flows solves this problem.

③ Why Not Just Use Multiple Processes?

Separating these tasks into processes might seem feasible on the surface. However, separate processes have independent memory spaces, requiring inter-process communication (IPC) to share state and data — adding complexity and overhead. For closely related tasks, splitting into threads within one process is far more appropriate, since they can share memory while keeping execution separate.

스레드의 실제 사용 예시

스레드가 왜 필요한지는 실제 프로그램 사례를 보면 더 명확하게 이해할 수 있다.

① 동영상 재생 프로그램

영상이 끊기지 않으려면 여러 작업이 동시에 이루어져야 한다. 화면 출력, 음성 재생, 사용자의 재생·일시정지 입력 처리를 하나의 흐름으로 순차적으로 실행한다면 영상이 멈추거나 소리가 끊기는 문제가 생긴다. 각 기능을 별도의 스레드로 나누어 처리하면 영상, 음성, 입력 처리가 동시에 이루어지면서 전체 흐름이 자연스럽게 유지된다.

② 게임 프로그램

게임은 화면을 끊임없이 갱신하면서, 사용자 입력에 즉각 반응하고, 동시에 캐릭터 위치 계산이나 충돌 처리도 수행해야 한다. 이를 순차적으로 처리하면 화면이 멈추거나 반응이 느려질 수밖에 없다. 역할별로 스레드를 나누어 각 기능을 독립적인 실행 흐름으로 처리하면 이 문제를 해결할 수 있다.

③ 프로세스로 나누면 안 될까?

같은 작업을 프로세스로 분리하는 것도 겉으로는 가능해 보인다. 그러나 프로세스를 분리하면 각각 독립된 메모리 공간을 가지게 되어, 작업 간에 상태 정보를 주고받거나 데이터를 공유하기 위해 별도의 프로세스 간 통신이 필요해진다. 이는 구조를 복잡하게 만들고 관리 부담도 커진다. 서로 밀접하게 연관된 작업이라면 하나의 프로세스 안에서 스레드로 나누어 처리하는 것이 훨씬 적합하다. 같은 메모리를 공유하면서 실행만 분리할 수 있기 때문이다.

Threads and Asynchronous Behavior; Word Processor Example

Consider a word processor. Typing on the keyboard should instantly display characters on screen, images should load in the background, and auto-save should run at regular intervals. Handling all of this sequentially in one flow would cause input delays or make the screen appear frozen.

Threads solve this. Assigning input handling, screen rendering, and auto-save to separate threads lets all three proceed without waiting for each other. The user can keep typing without waiting for auto-save to complete, and the program runs naturally.

This structure — where multiple tasks proceed independently without waiting for each other — is called asynchronous behavior. More precisely, it is closer to concurrent execution via threads, but it carries asynchronous characteristics in the sense that tasks proceed without blocking one another.

스레드로 구현하는 비동기적 동작; 워드 편집기 예시

워드 편집기를 생각해보자. 사용자가 키보드로 입력하면 즉시 화면에 글자가 나타나야 하고, 동시에 이미지 로딩이 이루어지고, 일정 시간마다 자동 저장도 실행된다. 이 작업들을 하나의 흐름으로 순차적으로 처리한다면 입력이 지연되거나 화면이 멈춘 것처럼 보일 수 있다.

이를 해결하는 방법이 바로 스레드다. 입력 처리, 화면 갱신, 자동 저장을 각각 별도의 스레드로 나누면 세 작업이 서로를 기다리지 않고 독립적으로 진행된다. 사용자는 자동 저장이 끝날 때까지 기다릴 필요 없이 계속 타이핑할 수 있고, 프로그램 전체는 자연스럽게 동작한다.

이처럼 여러 작업이 서로를 기다리지 않고 독립적으로 진행되는 구조를 비동기적 동작이라고 부른다. 정확히는 스레드를 통한 동시적 실행에 가깝지만, 작업들이 서로 블로킹 없이 진행된다는 점에서 비동기적 특성을 띤다고 볼 수 있다.

Web Browser and Threads;Another Example of Asynchronous Behavior

A web browser works the same way. It simultaneously fetches data from the network, loads images, and handles user interaction. Processing all of this sequentially in one flow would make the page appear to freeze. Browsers are internally structured to use multiple threads for each task simultaneously, allowing users to experience pages that respond continuously without interruption.

웹 브라우저와 스레드; 비동기적 동작의 또 다른 예

웹 브라우저도 마찬가지다. 브라우저는 네트워크에서 데이터를 받아오고, 이미지를 로딩하고, 사용자와의 상호작용을 처리하는 작업을 동시에 수행한다. 이 모든 작업을 하나의 흐름으로 순차적으로 처리한다면 페이지가 멈춘 것처럼 보일 수 있다. 브라우저는 내부적으로 여러 스레드를 활용해 각 작업을 동시에 처리하도록 구성되어 있고, 그 결과 사용자는 페이지가 끊김 없이 계속 반응하는 것으로 느끼게 된다.

2️⃣ Implementation of Threads

How Does the OS Manage Threads?

We learned earlier that a thread is the minimum execution unit that receives CPU allocation and runs, and that multiple threads can exist within a single process. This naturally leads to the following question.

If multiple threads exist simultaneously, how does the OS determine their execution order, and what information does it use to manage them?

Just as the OS uses a data structure called the PCB (Process Control Block) to manage processes, it must also systematically maintain information about each thread. This section examines that structure; specifically, how the OS actually tracks and manages threads.

운영체제는 스레드를 어떻게 관리할까?

앞서 스레드는 CPU를 할당받아 실행되는 최소 실행 단위이며, 하나의 프로세스 안에 여러 스레드가 존재할 수 있다는 것을 배웠다. 여기서 자연스럽게 다음 질문이 생긴다.

여러 스레드가 동시에 존재한다면, 운영체제는 어떤 기준으로 실행 순서를 정하고 어떤 정보를 바탕으로 스레드를 관리할까?

프로세스를 관리할 때 운영체제가 PCB(Process Control Block)라는 자료구조를 사용했던 것처럼, 스레드를 관리할 때도 운영체제는 각 스레드에 대한 정보를 체계적으로 유지해야 한다. 이번 시간에는 바로 그 구조, 즉 운영체제가 스레드를 실제로 어떻게 파악하고 관리하는지를 살펴본다.

Thread Execution States; Why the OS Tracks Them

When multiple threads exist within a single process, each thread can be in a different execution state. One thread may currently be running on the CPU, another may be waiting for its turn, and yet another may be paused waiting for a specific event.

Because each thread has a different state, the OS must continuously decide which thread to run next and when to hand the CPU over to another thread. In other words, a thread is not simply a flow of execution, it is something the OS must constantly track and manage.

스레드의 실행 상태: 운영체제가 추적하는 이유

하나의 프로세스 안에 여러 스레드가 존재하면, 각 스레드는 서로 다른 실행 상태를 가질 수 있다. 어떤 스레드는 현재 CPU를 사용해 실행 중이고, 어떤 스레드는 실행 순서를 기다리고 있으며, 또 어떤 스레드는 특정 이벤트를 기다리며 멈춰 있을 수 있다.

스레드마다 상태가 다르기 때문에 운영체제는 지금 어떤 스레드를 실행시킬지, 언제 CPU를 다른 스레드에게 넘길지를 계속해서 판단해야 한다. 즉 스레드는 단순히 실행되는 흐름이 아니라, 운영체제가 끊임없이 추적하고 관리해야 하는 대상이 된다.

TCB - The Data Structure for Managing Threads

To manage threads, the OS must know the following about each one: its current state, where execution should resume next, what values are stored in the CPU registers, and where the stack is located.

To store and manage this information, the OS uses the TCB (Thread Control Block). The TCB holds information about a single thread and serves as the management unit used to track and switch between threads.

Just as the PCB exists to manage processes, the TCB exists to manage threads. The OS uses the TCB to determine which thread to run and when, and to restore the necessary information when switching from one thread to another.

TCB: 스레드를 관리하는 자료구조

운영체제가 스레드를 관리하려면 각 스레드에 대해 다음과 같은 정보를 파악하고 있어야 한다. 현재 어떤 상태인지, 다음에 어디서부터 실행을 이어가야 하는지, CPU 레지스터에는 어떤 값이 저장되어 있는지, 스택은 어디에 위치해 있는지가 그것이다.

이 정보를 저장하고 관리하기 위해 운영체제는 TCB(Thread Control Block, 스레드 제어 블록) 를 사용한다. TCB는 스레드 하나에 대한 정보를 담고 있으며, 스레드를 추적하고 전환하기 위해 사용되는 관리 단위다.

프로세스를 관리하기 위해 PCB가 존재하듯, 스레드 단위의 관리를 위해 TCB가 존재하는 것이다. 운영체제는 이 TCB를 바탕으로 어떤 스레드를 언제 실행할지 판단하고, 실행 중인 스레드를 다른 스레드로 전환할 때 필요한 정보를 복원한다.

The Role of the TCB; A Data Structure That Remembers Execution Flow

A thread is an execution flow that receives CPU allocation and runs. The OS must be able to pause this flow and resume it later.

Consider a scenario where Thread A is running, then pauses and switches to Thread B, and eventually returns to Thread A. For Thread A to resume correctly, it must remember exactly where it stopped. To enable this, the OS stores each thread's execution state and the information needed to resume it in the TCB.

Ultimately, the TCB is a data structure that preserves execution flow so that a thread can be resumed at any time. The key takeaway is this: a thread is an execution flow, and the TCB is the structure that remembers it.

TCB의 역할: 실행 흐름을 기억하는 자료구조

스레드는 CPU를 할당받아 실행되는 흐름이다. 운영체제는 이 실행 흐름을 중단했다가 나중에 다시 이어서 실행할 수 있어야 한다.

예를 들어 스레드 A가 실행되다가 멈추고 스레드 B로 전환되었다가 다시 스레드 A로 돌아온다고 해보자. 이때 스레드 A는 어디까지 실행했는지를 기억하고 있어야 정확한 위치부터 이어서 실행할 수 있다. 이를 위해 운영체제는 각 스레드의 실행 상태와 실행을 재개하는 데 필요한 정보를 TCB(Thread Control Block) 에 저장한다.

결국 TCB는 스레드가 언제든 다시 실행될 수 있도록 실행 흐름을 보존하는 자료구조다. 핵심만 짚으면 이렇다. 스레드는 실행 흐름이고, TCB는 그 흐름을 기억하는 구조다.

The Structure of the TCB

Let's look at what information the TCB contains. A process's memory space includes code, data, and heap regions, all of which are shared by every thread within that process. The stack, however, exists separately for each thread. Since each thread has an independent execution flow, the local variables and return addresses generated during function calls are stored on each thread's own stack.

Each thread's TCB holds three key pieces of information:

Thread ID: A unique identifier that distinguishes this thread from others.
PC (Program Counter): Indicates how far the thread has executed.
SP (Stack Pointer): Indicates where the stack is currently pointing.

When a thread is paused and needs to resume, these values must be accurately restored. The PC must be restored so execution can continue from where it stopped, and the SP must be restored so the local variables and return addresses on the stack can be correctly referenced.

In short, the TCB stores everything the OS must know to pause and resume a thread — who this thread is, how far it has executed, and where its stack is.

스레드 제어 블록(TCB)의 구조

TCB가 어떤 정보를 담고 있는지 구조적으로 살펴보자. 프로세스의 메모리 공간에는 코드, 데이터, 힙 영역이 있고, 이 영역들은 프로세스 안의 모든 스레드가 함께 공유한다. 반면 스택은 스레드마다 별도로 존재한다. 각 스레드가 독립적인 실행 흐름을 가지기 때문에, 함수 호출 시 생성되는 지역 변수와 반환 주소도 각자의 스택에 따로 저장된다.

각 스레드의 TCB에는 크게 세 가지 정보가 담긴다.

스레드 ID: 이 스레드가 누구인지를 식별하는 고유 번호다.
PC (Program Counter): 스레드가 현재 어디까지 실행되었는지를 나타낸다.
SP (Stack Pointer): 스택이 현재 어디를 가리키고 있는지를 나타낸다.

스레드가 중단되었다가 다시 실행되려면 이 값들이 정확히 복원되어야 한다. PC가 복원되어야 중단된 위치부터 이어서 실행할 수 있고, SP가 복원되어야 스택에 저장된 지역 변수와 반환 주소를 올바르게 참조할 수 있다.

결국 TCB는 운영체제가 스레드를 멈췄다가 다시 실행하기 위해 반드시 알아야 할 정보, 즉 이 스레드가 누구인지, 어디까지 실행됐는지, 스택은 어디에 있는지를 저장하는 자료구조다.

The Relationship Between PCB and TCB

Once you understand the relationship between processes and threads, the relationship between PCB and TCB follows naturally.

The PCB manages information about a process's resources and its entire execution environment. The TCB stores information about the execution flow of each thread within that process.

Taking a web browser as an example: information about the entire browser process is stored in the PCB, while information about the threads handling individual tabs or tasks is stored in each thread's TCB.

In other words, PCB and TCB form a structure where one PCB is linked to multiple TCBs. If a process is the container of the execution environment, then each thread moving within it is individually tracked and managed by the OS through its own TCB.

PCB와 TCB의 관계

프로세스와 스레드의 관계를 이해했다면, 이를 관리하는 자료구조인 PCB와 TCB의 관계도 자연스럽게 이해할 수 있다.

PCB(Process Control Block) 는 프로세스의 자원과 실행 환경 전체에 대한 정보를 관리한다. TCB(Thread Control Block) 는 그 프로세스 안에 존재하는 각 스레드의 실행 흐름에 대한 정보를 저장한다.

웹 브라우저를 예로 들면, 브라우저 프로세스 전체에 대한 정보는 PCB에 저장되고, 각 탭이나 개별 작업을 처리하는 스레드의 정보는 각각의 TCB에 저장된다.

즉 PCB와 TCB는 하나의 PCB에 여러 개의 TCB가 연결된 구조다. 프로세스가 실행 환경의 그릇이라면, 그 안에서 움직이는 각 스레드는 자신만의 TCB를 통해 운영체제에 의해 개별적으로 추적되고 관리된다.

Distinguishing the Roles of PCB and TCB

The relationship between PCB and TCB can be summarized in one sentence:

The PCB manages a process's resources and execution environment; the TCB manages the threads executing within it.

Process-level management information is handled by the PCB, and thread-level execution information is handled by the TCB. The OS uses both data structures together to manage processes and threads hierarchically. If the PCB holds the big picture, the TCB tracks the fine details of each individual execution flow moving within it.

PCB와 TCB의 역할 구분

PCB와 TCB의 관계를 한 문장으로 정리하면 다음과 같다.

PCB는 프로세스의 자원과 실행 환경을 관리하고, TCB는 그 안에서 실행되는 스레드를 관리한다.

프로세스 단위의 관리 정보는 PCB가 담당하고, 스레드 단위의 실행 정보는 TCB가 담당한다. 운영체제는 이 두 자료구조를 함께 활용해 프로세스와 스레드를 계층적으로 관리한다. PCB가 큰 그림을 잡아준다면, TCB는 그 안에서 실제로 움직이는 각각의 실행 흐름을 세밀하게 추적하는 역할을 한다.

Thread Implementation; It Depends on Who Manages Them

We established earlier that a thread is an execution flow. This raises an important question: the way threads are implemented varies depending on how far the OS recognizes and manages them.

Based on who manages the threads, implementations fall into three categories:

User-Level Threads
Kernel-Level Threads
Hybrid Threads

Let's examine how each works and what distinguishes them.

스레드의 구현 방식: 누가 관리하느냐에 따라 달라진다

앞서 스레드는 실행 흐름이라는 것을 배웠다. 여기서 중요한 질문이 하나 생긴다. 운영체제가 스레드를 어디까지 인식하고 관리하느냐에 따라 스레드의 구현 방식이 달라진다는 것이다.

스레드를 관리하는 주체가 누구냐에 따라 크게 세 가지로 구분할 수 있다.

사용자 수준 스레드
커널 수준 스레드
혼합형 스레드

각각의 방식이 어떤 구조로 동작하고, 어떤 차이가 있는지 하나씩 살펴보자.

Implementation ① — User-Level Threads

In the user-level thread model, thread creation, scheduling, and management are performed not by the OS, but by a thread library. The OS does not recognize the existence of threads at the kernel level and views the process as a single execution unit. Therefore, even if multiple threads are taking turns executing within a process, the OS sees it as one process using one CPU.

The advantages are threefold. First, no kernel call is needed when switching threads, so context switch overhead is low. Second, thread creation and management is fast. Third, applications can freely design their own scheduling policies without depending on the OS.

The disadvantage is significant. Parallel execution in multi-core environments is not possible. Because the OS treats the entire process as a single execution unit, it allocates only one CPU. No matter how many threads exist internally, they cannot run on multiple cores simultaneously.

스레드의 구현 방식 ① 사용자 수준 스레드

사용자 수준 스레드 방식에서는 스레드의 생성, 스케줄링, 관리가 운영체제가 아닌 스레드 라이브러리에 의해 수행된다. 운영체제는 커널 수준에서 스레드의 존재 자체를 인식하지 못하며, 해당 프로세스를 하나의 실행 단위로만 바라본다. 따라서 프로세스 내부에서 여러 스레드가 번갈아 실행되더라도, 운영체제 입장에서는 하나의 프로세스가 하나의 CPU를 사용하는 것처럼 보인다.

장점은 세 가지로 정리할 수 있다. 첫째, 스레드 전환 시 커널 호출이 필요 없기 때문에 문맥 교환 오버헤드가 적다. 둘째, 스레드의 생성과 관리가 빠르다. 셋째, 운영체제에 의존하지 않고 프로그램 내부에서 자체적인 스케줄링을 자유롭게 설계할 수 있다.

반면 단점도 분명하다. 가장 큰 문제는 멀티코어 환경에서 병렬 실행이 어렵다는 점이다. 운영체제는 이 프로세스를 하나의 실행 단위로만 인식하기 때문에 CPU를 하나만 할당한다. 내부에 여러 스레드가 존재하더라도 실제로는 여러 코어에서 동시에 실행되지 못하는 것이다.

N:1 Mapping in User-Level Threads

The structure of user-level threads can be understood from two perspectives.

From a structural position standpoint: the user space sits at the top, the kernel space below it, and hardware at the bottom. Multiple user-level threads exist inside a process in user space, and they are created and scheduled by a thread library in user space — not by the kernel. Crucially, these threads are invisible in the kernel space. The kernel sees the process as a single execution unit.

From a mapping standpoint: multiple user-level threads (N threads) are connected to a single kernel thread. This is called N:1 mapping. Even if there are multiple threads in user space, the kernel sees only one execution flow. If a program contains five threads, the kernel assigns only one kernel thread to the CPU.

This structure makes the previously mentioned disadvantage clear. Even in a multi-core environment, parallel execution across multiple cores is impossible. No matter how many threads exist internally, the kernel's single-unit view prevents simultaneous use of multiple cores.

사용자 수준 스레드의 다대일(N:1) 매핑 구조

사용자 수준 스레드의 구조는 두 가지 관점에서 살펴볼 수 있다.

구조적 위치 관계를 보면, 위쪽에는 사용자 영역, 아래에는 커널 영역, 맨 아래에는 하드웨어 영역이 위치한다. 사용자 영역 안의 프로세스 내부에 여러 개의 사용자 수준 스레드가 존재하며, 이 스레드들은 커널이 아닌 사용자 영역의 스레드 라이브러리에 의해 생성되고 스케줄링된다. 중요한 점은 커널 영역에서는 이 스레드들이 보이지 않는다는 것이다. 커널은 해당 프로세스를 그저 하나의 실행 단위로만 인식한다.

매핑 관계를 보면, 여러 개의 사용자 수준 스레드(N개)가 하나의 커널 스레드에 연결되는 구조를 취한다. 이를 N:1 매핑이라고 부른다. 사용자 영역에는 스레드가 여러 개 존재하더라도, 커널 입장에서는 하나의 실행 흐름으로만 보인다. 프로그램 안에 스레드가 다섯 개 있더라도 커널은 하나의 커널 스레드만 CPU에 할당한다.

이 구조에서 앞서 언급한 단점이 명확하게 드러난다. 멀티코어 환경에서도 여러 코어에 병렬로 실행되지 못한다는 것이다. 커널이 프로세스 전체를 하나의 실행 단위로만 보기 때문에, 내부의 스레드가 아무리 많아도 동시에 여러 코어를 활용할 수 없다.

Implementation ② Kernel-Level Threads

In the kernel-level thread model, the OS directly recognizes and manages threads. Thread creation, scheduling, and state management all occur inside the kernel, and CPU scheduling is performed at the thread level rather than the process level. Each thread is treated as an independent execution unit from the OS's perspective.

There are three advantages. First, if one thread enters a waiting state, other threads can continue running. For example, if a thread is waiting for file I/O, the OS places that thread in a waiting state and assigns the CPU to another thread. Second, parallel processing is possible in multi-core environments, since each thread is independently scheduled by the kernel and can run on multiple CPU cores simultaneously. Third, because the kernel manages resources at the thread level, scheduling and protection are handled consistently.

The disadvantage is that kernel involvement is required every time a thread is created or switched. This means context switch overhead is higher than with user-level threads, and the overall management cost increases due to the need for kernel mode transitions.

스레드의 구현 방식 ② 커널 수준 스레드

커널 수준 스레드 방식에서는 운영체제가 스레드를 직접 인식하고 관리한다. 스레드의 생성, 스케줄링, 상태 관리가 모두 커널 내부에서 이루어지며, CPU 스케줄링도 프로세스 단위가 아닌 스레드 단위로 수행된다. 운영체제 입장에서 각 스레드는 독립적인 실행 단위로 취급된다.

장점은 세 가지다. 첫째, 한 스레드가 대기 상태가 되더라도 다른 스레드는 계속 실행될 수 있다. 예를 들어 한 스레드가 파일 입출력을 기다리고 있다면, 운영체제는 해당 스레드를 대기 상태로 두고 다른 스레드에 CPU를 할당한다. 둘째, 멀티코어 환경에서 병렬 처리가 가능하다. 각 스레드가 커널에 의해 독립적으로 스케줄링되기 때문에 여러 CPU 코어에서 동시에 실행될 수 있다. 셋째, 커널이 스레드 단위로 자원을 관리하기 때문에 스케줄링과 보호가 일관되게 이루어진다.

단점도 존재한다. 스레드를 생성하거나 전환할 때마다 커널의 개입이 필요하기 때문에, 사용자 수준 스레드에 비해 문맥 교환 오버헤드가 크다. 커널 모드 전환이 필요한 만큼 전반적인 관리 비용도 증가한다.

1:1 Mapping in Kernel-Level Threads

Kernel-level threads use a 1:1 mapping structure, where each user thread maps exactly to one kernel thread.

Unlike N:1 mapping, where multiple user threads are bound to a single kernel thread, 1:1 mapping allows the kernel to recognize each thread individually and schedule them independently. This enables true parallel execution in multi-core environments, where multiple threads can be assigned to different cores simultaneously.

The contrast with N:1 is clear. In N:1, the kernel sees the process as a single unit, making parallel execution impossible. In 1:1, the kernel directly tracks every thread, allowing full utilization of multi-core hardware.

커널 수준 스레드의 일대일(1:1) 매핑 구조

커널 수준 스레드는 1:1 매핑 구조를 취한다. 사용자 스레드 하나가 커널 스레드 하나와 정확히 대응되는 구조다.

N:1 매핑에서는 여러 사용자 스레드가 하나의 커널 스레드에 묶여 있었던 것과 달리, 1:1 매핑에서는 커널이 각 스레드를 개별적으로 인식하고 독립적으로 스케줄링할 수 있다. 덕분에 멀티코어 환경에서 여러 스레드가 서로 다른 코어에 동시에 할당되어 진정한 병렬 실행이 가능해진다.

앞서 살펴본 사용자 수준 스레드의 N:1 구조와 비교하면 차이가 명확하다. N:1에서는 커널이 프로세스를 하나의 실행 단위로만 보기 때문에 병렬 실행이 불가능했지만, 1:1 구조에서는 커널이 스레드 하나하나를 직접 파악하고 있기 때문에 멀티코어의 이점을 온전히 활용할 수 있다.

Why Do Modern OSes Use Kernel-Level Threads?; Multi-core and Kernel Threads

Most modern computers use multi-core CPUs. Kernel-level threads demonstrate their true value in this environment. Because the kernel independently recognizes and schedules each thread, multiple threads can be distributed across different CPU cores. This effectively improves parallel processing performance.

This is the most decisive difference between user-level and kernel-level threads. User-level threads cannot take advantage of multi-core hardware because the kernel sees the process as a single unit. Kernel-level threads, on the other hand, can run as many threads simultaneously as there are cores, a far more suitable structure for modern hardware.

왜 현대 운영체제는 커널 수준 스레드를 사용하는가? 멀티코어 환경과 커널 수준 스레드

현대 컴퓨터는 대부분 멀티코어 CPU를 사용한다. 커널 수준 스레드는 이 환경에서 진가를 발휘한다. 커널이 각 스레드를 독립적으로 인식하고 스케줄링하기 때문에, 여러 스레드를 서로 다른 CPU 코어에 분산하여 실행할 수 있다. 그 결과 병렬 처리 성능을 효과적으로 끌어올릴 수 있다.

이것이 사용자 수준 스레드와 커널 수준 스레드의 가장 결정적인 차이다. 사용자 수준 스레드는 커널이 프로세스를 하나의 실행 단위로만 보기 때문에 멀티코어의 이점을 살리지 못하지만, 커널 수준 스레드는 코어 수만큼 스레드를 동시에 실행할 수 있어 현대 하드웨어 환경에 훨씬 적합한 구조다.

Why Do Modern OSes Use Kernel-Level Threads? Additional Reasons

Beyond parallel processing, there are further reasons to use kernel-level threads.

First, if one thread enters a waiting state due to I/O, other threads can keep running. Because the kernel manages state at the thread level, one thread stalling does not freeze the entire program.

Second, because the kernel manages resources and performs CPU scheduling at the thread level, resource protection and execution control are handled consistently.

Third, if an error occurs in a specific thread, the impact on the overall system is reduced, making it much easier to isolate and manage problems.

For these reasons, modern operating systems adopt kernel-level threads as their default structure. From parallel processing performance to stable resource management and fault isolation, kernel-level threads meet all the demands of the modern computing environment.

왜 현대 운영체제는 커널 수준 스레드를 사용하는가? 커널 수준 스레드를 사용하는 이유

앞서 살펴본 병렬 처리 외에도, 커널 수준 스레드를 사용하는 이유는 더 있다.

첫째, 한 스레드가 입출력 때문에 대기 상태가 되더라도 다른 스레드는 계속 실행될 수 있다. 커널이 스레드 단위로 상태를 관리하기 때문에, 하나의 스레드가 멈춰도 프로그램 전체가 멈추지 않는다.

둘째, 커널이 스레드 단위로 자원을 관리하고 CPU 스케줄링을 직접 수행하기 때문에 자원 보호와 실행 제어가 일관되게 이루어진다.

셋째, 특정 스레드에서 오류가 발생하더라도 시스템 전체에 미치는 영향이 줄어들어 문제를 통제하고 관리하기가 훨씬 수월하다.

이러한 이유들로 인해 현대 운영체제는 커널 수준 스레드를 기본 구조로 채택하고 있다. 병렬 처리 성능, 안정적인 자원 관리, 오류 격리까지, 커널 수준 스레드는 현대 컴퓨팅 환경이 요구하는 조건을 고루 충족하는 구조다.

When Are User-Level Threads Still Useful?

If kernel-level threads are the standard in modern OSes, when are user-level threads still worth using? The key is when you want to control threads quickly in user space without kernel involvement.

① When thread creation and switching happen very frequently

Repeated kernel calls increase overhead. Since user-level threads handle switching without a kernel mode transition, they can operate relatively faster in such scenarios.

② When an application wants direct control over thread behavior

When you want to implement a custom scheduling policy tailored to specific task characteristics, a library-level design offers much greater flexibility — without being constrained by the kernel's scheduling approach.

③ When OS-independent implementation is needed

Kernel thread implementations vary across operating systems. User-level threads, being library-based, can maintain relatively consistent behavior across different environments.

In summary, user-level threads are a suitable choice when performance optimization and scheduling flexibility are priorities.

사용자 수준 스레드는 언제 사용할까?

커널 수준 스레드가 현대 운영체제의 기본 구조라면, 사용자 수준 스레드는 언제 유용할까? 핵심은 커널의 개입 없이 사용자 영역에서 빠르게 스레드를 제어하고 싶을 때다.

① 스레드 생성과 전환이 매우 자주 발생하는 경우

커널 호출이 반복될수록 오버헤드가 커진다. 사용자 수준 스레드는 커널 모드 전환 없이 처리되기 때문에 이런 상황에서 상대적으로 빠르게 동작할 수 있다.

② 응용 프로그램이 스레드 동작을 직접 제어하고 싶은 경우

특정 작업 특성에 맞는 스케줄링 정책을 직접 구현하고자 할 때, 라이브러리 수준에서 보다 유연하게 설계할 수 있다. 커널의 스케줄링 방식에 얽매이지 않아도 된다는 것이 장점이다.

③ 운영체제에 독립적인 구현이 필요한 경우

운영체제마다 커널 스레드의 구현 방식이 다를 수 있다. 반면 사용자 수준 스레드는 라이브러리 기반으로 동작하기 때문에 여러 환경에서 비교적 일관된 동작을 유지할 수 있다.

결론적으로 성능 최적화와 제어 유연성이 중요한 상황에서는 사용자 수준 스레드가 적합한 선택이 될 수 있다.

Thread Implementation in Modern OSes Linux and Windows

So what approach do real operating systems take? Looking at two representative OSes, both Linux and Windows operate fundamentally on kernel-level threads.

Linux manages threads as the basic unit of execution, with the kernel directly performing scheduling per thread.

Windows likewise treats threads as the basic unit of execution, clearly distinguishing between processes as resource management units and threads as execution units.

Both operating systems operate on the same principle. Modern OSes clearly separate processes (resource management units) from threads (execution units) and adopt a structure where the kernel directly manages threads. The advantages of kernel-level threads covered earlier — parallel processing performance, stable resource management, and consistent execution control; are directly reflected in the design of real operating systems.

현대 운영체제의 스레드 구현 방식; 리눅스와 윈도우

그렇다면 실제 운영체제는 어떤 방식을 사용할까? 대표적인 두 운영체제인 리눅스와 윈도우를 살펴보면, 둘 다 기본적으로 커널 수준 스레드를 기반으로 동작한다.

리눅스는 스레드를 기본 실행 단위로 관리하며, 스레드마다 커널이 직접 스케줄링을 수행한다.

윈도우 역시 스레드를 기본 실행 단위로 관리한다. 프로세스는 자원 관리 단위, 스레드는 실행 단위로 명확히 구분하여 운영한다.

두 운영체제 모두 같은 원칙 위에서 동작하는 셈이다. 현대 운영체제는 프로세스는 자원 관리 단위, 스레드는 실행 단위로 명확히 구분하고, 커널 수준에서 스레드를 직접 관리하는 구조를 채택하고 있다. 앞서 배운 커널 수준 스레드의 장점, 즉 병렬 처리 성능, 안정적인 자원 관리, 일관된 실행 제어가 실제 운영체제 설계에도 그대로 반영된 것이다.

Why Hybrid (M:N) Threads Are Rarely Used

Hybrid threads combine user-level and kernel-level threads. The goal is to map M user threads to N kernel threads, gaining both the fast switching of user-level threads and the parallel execution of kernel-level threads.

In theory this sounds attractive, but in practice it is rarely used. There are two main reasons.

First, complex coordination between the user-level library and the kernel is required. Second, since scheduling responsibility is split between user space and the kernel, implementation and debugging become extremely difficult.

The attempt to combine the advantages of both approaches ends up making the structure overly complex. And since modern OSes have already adopted kernel-level threads as their default, adequately solving the parallel processing problem; there is little reason to accept the added complexity of a hybrid model.

혼합형(M:N) 스레드가 잘 사용되지 않는 이유

혼합형 스레드는 사용자 수준 스레드와 커널 수준 스레드를 함께 사용하는 방식이다. M개의 사용자 스레드를 N개의 커널 스레드에 연결하여, 사용자 수준의 빠른 전환과 커널 수준의 병렬 실행이라는 두 가지 장점을 동시에 얻는 것이 목표다.

이론적으로는 매력적인 구조처럼 보이지만, 실제로는 잘 사용되지 않는다. 이유는 크게 두 가지다.

첫째, 사용자 수준 라이브러리와 커널 사이의 복잡한 연동이 필요하다. 둘째, 스케줄링 책임이 사용자 영역과 커널에 이중으로 분산되기 때문에 구현과 디버깅이 매우 어려워진다.

결국 두 방식의 장점을 합치려다 오히려 구조가 지나치게 복잡해지는 결과를 낳는다. 현대 운영체제가 커널 수준 스레드를 기본으로 채택하면서 병렬 처리 문제가 충분히 해결된 만큼, 굳이 복잡한 혼합형 구조를 감수할 필요가 없어진 것도 한 이유다.

Comparing the Three Thread Implementation Models

Here is a summary of the three models covered:

User-Level Threads (N:1) managed by a user library. Context switch cost is low and performance is fast, but the entire process can stall during I/O waits, and multi-core utilization is impossible. Rarely used today.

Kernel-Level Threads (1:1) managed directly by the OS kernel. Context switch cost is relatively higher, but other threads can continue running during I/O waits, and multi-core utilization is fully supported. The model adopted by most modern operating systems.

Hybrid Threads (M:N) managed jointly by user space and the kernel. Context switch cost is moderate and multi-core utilization is possible, but implementation complexity is very high. Rarely used in practice.

In conclusion, user-level threads are fast but limited in scalability, while kernel-level threads accept modest overhead in exchange for multi-core utilization and stability. Modern operating systems use the 1:1 kernel-level thread model as their default — the practical sweet spot between performance and reliability.

세 가지 스레드 구현 방식 비교

지금까지 살펴본 세 가지 스레드 구현 방식을 한눈에 정리해보자.

사용자 수준 스레드(N:1) 는 사용자 라이브러리가 스레드를 관리한다. 문맥 교환 비용이 낮아 빠르게 동작하지만, I/O 대기 시 전체 스레드가 정지될 수 있고 멀티코어 활용이 불가능하다. 현재는 거의 사용되지 않는다.

커널 수준 스레드(1:1) 는 운영체제 커널이 직접 스레드를 관리한다. 문맥 교환 비용이 상대적으로 높지만, I/O 대기 시에도 다른 스레드가 계속 실행될 수 있고 멀티코어 활용이 가능하다. 현재 대부분의 운영체제가 채택하고 있는 방식이다.

혼합형 스레드(M:N) 는 사용자와 커널이 함께 스레드를 관리한다. 문맥 교환 비용은 중간 수준이며 멀티코어 활용도 가능하지만, 구현 복잡도가 매우 높아 실제로는 거의 사용되지 않는다.

결론적으로 사용자 수준 스레드는 빠르지만 확장성이 제한적이고, 커널 수준 스레드는 약간의 오버헤드를 감수하는 대신 멀티코어 활용과 안정성을 확보할 수 있다. 현대 운영체제는 이 현실적인 균형점으로 1:1 커널 수준 스레드 모델을 기본 구조로 사용하고 있다.

3️⃣ Process / Thread Practice

Working with Threads Directly in Linux

So far we have examined the concepts and implementation of threads from a theoretical perspective. Now let's directly observe thread creation and execution flow in an Ubuntu environment.

In Linux, threads can be handled using the Pthread (POSIX Thread) library. Through this library, we will confirm the process of creating threads, controlling execution, and waiting for termination through hands-on practice.

Before diving into the exercises, let's first review the core Pthread functions and related concepts that will be used throughout.

리눅스에서 스레드를 직접 다뤄보자

지금까지 스레드의 개념과 구현 방식을 이론적으로 살펴봤다. 이번에는 우분투 환경에서 스레드의 생성과 실행 흐름을 직접 확인해본다.

리눅스에서는 Pthread(POSIX Thread) 라이브러리를 사용해 스레드를 다룰 수 있다. 이 라이브러리를 통해 스레드를 생성하고, 실행을 제어하고, 종료를 대기하는 과정을 실습으로 확인할 것이다.

본격적인 실습에 앞서, 실습 과정에서 사용하게 될 Pthread의 핵심 함수와 관련 개념을 먼저 정리해보자.

Preparation — What Is Pthread?

Pthread (POSIX Thread) is a thread library that follows the POSIX standard, used for handling threads in Linux and Unix environments.

An important point here: the C language itself does not natively include syntax or keywords for threads. Rather than supporting threads directly at the language level, C uses a structure where thread functionality provided by the OS is brought in as a library.

This is exactly why the Pthread library is used when implementing threads in C on Linux. The thread functionality itself is provided by the OS kernel, and Pthread serves as an interface that wraps that functionality so it can be conveniently called from a C program.

실습 준비 - Pthread란 무엇인가?

Pthread(POSIX Thread) 는 POSIX 표준을 따르는 스레드 라이브러리로, 리눅스와 유닉스 환경에서 스레드를 다룰 때 사용하는 방식이다.

여기서 중요한 점이 있다. C 언어 자체에는 스레드를 위한 문법이나 키워드가 기본적으로 존재하지 않는다. 즉 C 언어 차원에서 스레드를 직접 지원하는 것이 아니라, 운영체제가 제공하는 스레드 기능을 라이브러리 형태로 가져와서 사용하는 구조다.

리눅스 환경에서 C로 스레드를 구현할 때 Pthread 라이브러리를 사용하는 것도 바로 이 때문이다. 스레드 기능 자체는 운영체제 커널이 제공하고, Pthread는 그 기능을 C 프로그램에서 편리하게 호출할 수 있도록 감싸놓은 인터페이스인 셈이다.

Pthread Core Function ① — pthread_create()

pthread_create() is the function for creating a new thread. It creates an additional execution flow separate from the existing main flow.

An important point here is that when creating a thread, you also specify the function that thread will execute. In other words, calling pthread_create() does not merely create a thread, the specified function immediately begins running as a new execution flow.

Another key thing to remember is that the main function itself is also a thread. Therefore, the moment pthread_create() is called, both the main thread and the newly created thread simultaneously have their own execution flows. This is the moment when two or more execution flows come into existence within a single process.

Pthread 핵심 함수 ① pthread_create()

pthread_create()는 새로운 스레드를 생성하는 함수다. 기존의 메인 실행 흐름과는 별도로 새로운 실행 흐름을 하나 더 만드는 역할을 한다.

여기서 중요한 점은 스레드를 생성할 때 그 스레드가 실행할 함수를 함께 지정한다는 것이다. 즉 pthread_create()를 호출하면 단순히 스레드만 만드는 것이 아니라, 지정한 함수가 새로운 실행 흐름으로 즉시 실행되기 시작한다.

또 한 가지 기억해야 할 점은 메인 함수 자체도 하나의 스레드라는 것이다. 따라서 pthread_create()를 호출하는 순간, 메인 스레드와 새로 생성된 스레드가 동시에 각자의 실행 흐름을 가지게 된다. 하나의 프로세스 안에서 두 개 이상의 실행 흐름이 만들어지는 순간이 바로 이 시점이다.

Pthread Core Function ② — pthread_exit()

pthread_exit() is the function that terminates the currently running thread. The important point is that only that one thread is terminated. Calling pthread_exit() does not terminate the entire process; termination can be controlled at the thread level.

For example, you can terminate only the thread that has finished its task while letting the remaining threads continue running.

This is clearly distinct from the regular exit() function. While exit() terminates the entire process, pthread_exit() terminates only the calling thread. Think of it as the function to use when fine-grained, per-thread termination control is needed.

Pthread 핵심 함수 ② pthread_exit()

pthread_exit()은 현재 실행 중인 스레드를 종료하는 함수다. 여기서 중요한 점은 스레드 하나만 종료된다는 것이다. pthread_exit()을 호출해도 프로세스 전체가 종료되지 않으며, 스레드 단위로 종료 처리가 가능하다.

예를 들어 특정 작업을 마친 스레드만 종료하고, 나머지 스레드는 계속 실행되도록 만들 수 있다.

이는 일반적인 exit() 함수와 명확히 구분된다. exit()은 프로세스 전체를 종료하지만, pthread_exit()은 호출한 스레드만 종료한다. 스레드 단위의 세밀한 종료 제어가 필요할 때 사용하는 함수라고 이해하면 된다.

Pthread Core Function ③ pthread_join()

pthread_join() is the function that makes the current thread wait until a specified thread has terminated. It is most commonly used when the main thread needs to wait for worker threads to finish.

The reason this function matters is that without calling pthread_join(), the main thread may terminate first, which can force-terminate other threads that are still working. Creating a thread is important, but so is properly waiting for its work to complete.

To summarize, pthread_join() serves two roles: it controls the execution order of threads, and it ensures all tasks complete normally. Think of it as a function that must always be paired with thread creation.

Pthread 핵심 함수 ③ pthread_join()

pthread_join()은 특정 스레드가 종료될 때까지 현재 스레드가 대기하도록 만드는 함수다. 주로 메인 스레드가 작업 스레드의 종료를 기다릴 때 사용한다.

이 함수가 중요한 이유는 pthread_join()을 호출하지 않으면 메인 스레드가 먼저 종료될 수 있고, 그 경우 아직 작업 중인 다른 스레드들이 강제로 종료될 수 있기 때문이다. 스레드를 생성하는 것만큼, 작업이 끝날 때까지 제대로 기다려주는 과정도 중요하다.

정리하면 pthread_join()은 두 가지 역할을 한다. 스레드의 실행 순서를 제어하고, 모든 작업이 정상적으로 마무리될 수 있도록 보장한다. 스레드를 생성했다면 반드시 짝을 이루어 호출해야 하는 함수라고 이해하면 된다.

Pthread Core Concept ④ pthread_t (Thread Identifier)

pthread_t is a unique identifier for distinguishing threads; a thread ID. When a thread is created, the OS assigns it a unique ID, through which a specific thread can be waited on, controlled, or have its state checked.

pthread_t serves as the most fundamental identifier for managing threads in the Pthread library. Functions like pthread_join() and pthread_create() covered earlier all rely on this identifier to specify and control particular threads.

Internally, this identifier is one of the key pieces of information stored in the TCB (Thread Control Block). It is the starting point from which the OS tracks and manages threads.

Pthread 핵심 개념 ④ pthread_t (스레드 식별자)

pthread_t는 스레드를 구분하기 위한 고유 식별자, 즉 스레드 ID다. 스레드를 생성하면 운영체제는 각 스레드를 관리하기 위해 고유한 ID를 부여하며, 이 ID를 통해 특정 스레드를 기다리거나 제어하거나 상태를 확인할 수 있다.

pthread_t는 Pthread 라이브러리에서 스레드를 관리하기 위한 가장 기본적인 식별자 역할을 한다. 앞서 살펴본 pthread_join()이나 pthread_create() 같은 함수들도 모두 이 식별자를 기반으로 특정 스레드를 지정하고 제어한다.

내부적으로 보면 이 식별자는 TCB(스레드 제어 블록)에 저장되는 핵심 정보 중 하나다. 운영체제가 스레드를 추적하고 관리하는 출발점이 바로 이 ID라고 이해하면 된다.

Pthread Core Concept ⑤ Passing Data Between Threads

In Pthread, thread functions always receive an argument of type void *. This means data is passed by address, not by value. Whether passing an integer, a struct, or a string, the memory address where the data is stored is what gets handed to the thread.

Why is it designed this way? To unify the form of all thread functions. Regardless of what type of data is passed, the function receives a single address and reinterprets it as the needed type internally. Threads also return their results as void * upon termination, and when multiple values need to be passed, it is common practice to bundle them into a struct and pass the address of that struct.

There is a critical warning here. Suppose you create a local variable inside a function and pass its address to a thread. When that function returns, the local variable disappears from memory. But the thread may still be running. In this case, the thread ends up using an address pointing to memory that no longer exists, causing an invalid memory access.

Therefore, whenever passing data to a thread, always ask: "Will this memory still be alive when the thread finishes?" The core principle of Pthread data passing is that it works by address rather than by value, and the lifetime and scope of that memory must always be considered.

Pthread 핵심 개념: 스레드 간 데이터 전달

Pthread에서 스레드 함수는 항상 void * 형태의 인자를 받는다. 이는 값 자체를 넘기는 구조가 아닌 주소를 넘기는 구조라는 뜻이다. 정수든 구조체든 문자열이든, 스레드에 데이터를 전달할 때는 그 데이터가 저장된 메모리의 주소를 넘겨주게 된다.

왜 이렇게 설계했을까? 스레드 함수의 형태를 하나로 통일하기 위해서다. 어떤 타입의 데이터가 오더라도 일단 주소 하나를 받고, 함수 내부에서 필요한 타입으로 다시 해석하도록 만든 구조다. 스레드가 종료할 때도 마찬가지로 void * 형태로 결과를 반환하며, 여러 값을 전달할 경우에는 구조체로 묶어서 넘기는 방식을 많이 사용한다.

여기서 반드시 주의해야 할 점이 있다. 어떤 함수 안에서 지역 변수를 만들고 그 주소를 스레드에 넘겼다고 가정해보자. 함수가 끝나면 그 지역 변수는 메모리에서 사라진다. 하지만 스레드는 아직 실행 중일 수 있다. 이 경우 스레드는 이미 사라진 메모리를 가리키는 주소를 사용하게 되고, 결국 잘못된 메모리를 참조하는 문제가 발생한다.

따라서 스레드에 데이터를 넘길 때는 항상 "이 메모리가 스레드가 끝날 때까지 살아있는가?" 를 확인해야 한다. Pthread는 데이터를 값이 아닌 주소로 주고받는 구조이며, 그 메모리의 수명과 범위를 반드시 고려해야 한다는 점이 핵심이다.

Example Code ① pthread_create()

Let's start with the code structure. Since C has no built-in thread syntax, #include must be included to use thread functionality. thread_func is the function the newly created thread will execute. In Pthread, thread functions are defined with a void * signature. This function simply prints "새로운 스레드 실행 중" (New thread running) and terminates with return NULL. Looking at the main function: pthread_t tid is a variable that stores the identifier for the created thread. After printing "main 스레드 시작" (main thread start), calling pthread_create(&tid, NULL, thread_func, NULL) creates a new execution flow, and the new thread begins executing thread_func. The key here is that this code has no pthread_join(). The main thread creates the new thread and immediately moves to the next line without waiting, printing "main 스레드 종료" (main thread end). This leads to two possible outcomes. First, the main thread may hold the CPU and terminate before the new thread gets a chance to run, in which case only "main 스레드 시작 → main 스레드 종료" is printed. Second, if the OS allocates the CPU to the new thread first, the output becomes "main 스레드 시작 → 새로운 스레드 실행 중 → main 스레드 종료." Which thread runs first is determined by the OS's scheduling. This example demonstrates exactly that. pthread_create() creates a new execution flow, but without pthread_join(), the main thread does not wait — so the output can differ every time the program runs.

Through this hands-on example, you can observe that the order of output results varies between runs.

예제 코드 ① pthread_create()

코드 구조부터 살펴보자. C 언어에는 기본적으로 스레드 문법이 없기 때문에 #include 를 반드시 포함해야 스레드 기능을 사용할 수 있다. thread_func은 새로 생성된 스레드가 실행할 함수다. Pthread에서는 스레드 함수의 형태가 void *로 정해져 있다. 이 함수 안에서는 단순히 "새로운 스레드 실행 중"을 출력하고 return NULL로 종료한다. 메인 함수를 살펴보자. pthread_t tid는 생성된 스레드를 구분하기 위한 식별자를 저장하는 변수다. "main 스레드 시작"을 출력한 뒤, pthread_create(&tid, NULL, thread_func, NULL)을 호출하는 순간 새로운 실행 흐름이 만들어지고, 생성된 스레드는 thread_func을 실행하게 된다. 여기서 핵심은 이 코드에 pthread_join()이 없다는 점이다. 메인 스레드는 새 스레드를 만들기만 하고 끝날 때까지 기다려주지 않은 채 바로 다음 줄로 내려가 "main 스레드 종료"를 출력한다. 이 때문에 실행 결과가 두 가지로 나타날 수 있다. 첫째, 메인 스레드가 계속 CPU를 점유하다가 새 스레드가 실행되기 전에 종료해버리는 경우로, "main 스레드 시작 → main 스레드 종료"만 출력된다. 둘째, 운영체제가 새 스레드에 CPU를 먼저 할당하는 경우로, "main 스레드 시작 → 새로운 스레드 실행 중 → main 스레드 종료" 순으로 출력된다. 어떤 스레드가 먼저 실행될지는 운영체제의 스케줄링이 결정한다. 이 예제는 바로 그 점을 보여준다. pthread_create()는 새로운 실행 흐름을 만들어주지만, pthread_join()이 없으면 메인 스레드는 새 스레드를 기다려주지 않기 때문에 실행할 때마다 출력 결과가 달라질 수 있다.

위의 실습 예제를 통해 실행 결과값의 순서가 달라지는 것을 확인할 수 있다.

Example Code ② pthread_join()

In the previous example, using only pthread_create() meant execution order was not guaranteed. This example adds pthread_join() to clearly show the difference. Looking at thread_func: it loops from 1 to 3, printing "스레드 작업 1, 2, 3" (Thread task 1, 2, 3), pausing for 1 second with sleep(1) between each iteration. In the main function, after creating the thread with pthread_create(), pthread_join(tid, NULL) is called immediately. pthread_join() is the key. This function puts the current thread into a waiting state until the specified thread has fully terminated. The main thread does not move to the next line until the new thread has printed 1, 2, 3 and terminated with return NULL. Only after that is "main 종료" (main end) printed. Therefore, the output is always fixed: "스레드 작업 1 → 스레드 작업 2 → 스레드 작업 3 → main 종료." The difference from the previous example is clear. Without pthread_join(), execution order varies by scheduling; with it, the main thread is guaranteed to wait for the new thread to finish. To summarize: pthread_create() adds an execution flow, and pthread_join() synchronizes that flow. Join means "wait", it is the core function that guarantees execution order between threads.

Compared to the previous example, you can observe the clear difference between waiting and not waiting.

예제 코드 ② pthread_join()

앞선 예제에서는 pthread_create()만 사용했기 때문에 실행 순서가 보장되지 않았다. 이번 예제는 여기에 pthread_join()을 추가해 그 차이를 명확히 보여준다. thread_func을 보면 1부터 3까지 반복하며 "스레드 작업 1, 2, 3"을 출력하고, 각 반복마다 sleep(1)으로 1초씩 잠시 멈춘다. 메인 함수에서는 pthread_create()로 스레드를 생성한 뒤 곧바로 pthread_join(tid, NULL)을 호출한다. pthread_join()이 핵심이다. 이 함수는 지정한 스레드가 완전히 종료될 때까지 현재 스레드를 대기 상태로 만든다. 즉 메인 스레드는 새 스레드가 1, 2, 3을 모두 출력하고 return NULL로 종료될 때까지 다음 줄로 내려가지 않는다. 그 이후에야 비로소 "main 종료"가 출력된다. 따라서 실행 결과는 항상 "스레드 작업 1 → 스레드 작업 2 → 스레드 작업 3 → main 종료" 순서로 고정된다. 앞 예제와 비교하면 차이가 분명하다. pthread_join()이 없으면 실행 순서가 스케줄링에 따라 달라지지만, pthread_join()이 있으면 메인 스레드가 반드시 새 스레드의 종료를 기다린다. 정리하면, pthread_create()는 실행 흐름을 추가하고, pthread_join()은 그 흐름을 동기화한다. join은 "기다려라"는 의미이며, 스레드 간 실행 순서를 보장하는 핵심 함수다.

앞의 예제와 비교했을때 기다림이 있는 경우와 없는 경우의 차이를 확인할 수 있다.

Example Code ③ pthread_join() Not Used

This example is nearly identical to the previous one, but the critical difference is the absence of pthread_join(). The main thread creates the new thread and immediately drops to the next line without waiting, printing "main 종료."

In this situation, the result depends on which thread the OS allocates the CPU to first. If the main thread finishes execution first, only "main 종료" is printed, and the process terminates before the worker thread completes its work. Even if the new thread runs first, the main may terminate in the middle of printing "스레드 작업 1," and only if lucky will all three outputs appear. The result differs every time.

Two reasons explain this behavior. First, the main thread does not wait for the new thread. Second, when the main thread terminates, the process itself may terminate, forcibly ending any still-running threads along with it.

In summary: without pthread_join(), execution order is subject to scheduling, and thread tasks may not complete fully. This example demonstrates exactly why pthread_join() is essential.

Worker thread tasks may not run to completion. This confirms that threads are not automatically synchronized just because they were created.

예제 코드 ③ pthread_join() 미사용

이번 예제는 앞선 코드와 거의 동일하지만 pthread_join()이 없다는 점이 결정적인 차이다. 메인 스레드는 새 스레드를 생성한 뒤 기다리지 않고 곧바로 다음 줄로 내려가 "main 종료"를 출력한다.

이 상황에서는 운영체제가 어떤 스레드에 CPU를 먼저 할당하느냐에 따라 결과가 달라진다. 메인 스레드가 먼저 실행을 마쳐버리면 "main 종료"만 출력되고 프로세스가 종료되면서 작업 스레드가 끝까지 실행되지 못한다. 새 스레드가 먼저 실행되더라도 "스레드 작업 1" 출력 도중 메인이 끝나버릴 수 있고, 운이으면 1, 2, 3이 모두 출력될 수도 있다. 실행할 때마다 결과가 달라지는 것이다.

이런 차이가 생기는 이유는 두 가지다. 첫째, 메인 스레드가 새 스레드를 기다려주지 않기 때문이다. 둘째, 메인 스레드가 종료되면 프로세스 자체가 종료될 수 있어 아직 실행 중인 스레드도 함께 강제 종료될 수 있기 때문이다.

정리하면, pthread_join()이 없으면 실행 순서는 스케줄링에 따라 달라지고, 경우에 따라 스레드 작업이 끝까지 수행되지 못할 수 있다. 이 예제는 바로 그 이유로 pthread_join()이 반드시 필요하다는 것을 보여주는 코드다.

woker작업이 끝까지 실행되지 않을수도 있다. 스레드는 만들었다고해서 자동으로 동기화되진 않는다는 점을 확인할 수 있다.

Example Code ④ pthread_exit()

The key of this example is pthread_exit(NULL). This function terminates only the current thread, not the entire process. The main thread terminates at this point, but the worker thread continues running. Therefore, the printf("이 문장은 출력되지 않음\n") below pthread_exit() is never printed; the main thread has already terminated. Looking at the output, it appears that "main에서 pthread_exit 호출" (pthread_exit called from main) prints first, followed by "worker 스레드 작업 1, 2, 3." But strictly speaking, this order is not completely fixed. After pthread_create(), the worker thread could receive the CPU first. The order of the first line of output can vary by scheduling. What matters, however, is that the moment the main thread calls pthread_exit(), the process as a whole remains alive - so the worker thread is guaranteed to complete tasks 1, 2, and 3 in full. This is the biggest difference from the previous example without pthread_join(). Ending with return 0 can terminate the entire process when main exits, but using pthread_exit() terminates only the main thread while other threads continue running. To summarize: return 0 can terminate the entire process; pthread_exit() terminates only the current thread. It is the function that allows termination to be controlled at the thread level, ensuring other threads complete their work even after main exits first.

Even after main terminates, the worker thread continues printing. Note that the order of the first output line is not guaranteed.

예제 코드 ④ pthread_exit()

이번 예제의 핵심은 pthread_exit(NULL)이다. 이 함수는 프로세스 전체가 아닌 현재 스레드만 종료시킨다. 메인 스레드는 이 시점에서 종료되지만, 워커 스레드는 계속 실행된다. 따라서 pthread_exit() 아래의 printf("이 문장은 출력되지 않음\n")은 실제로 출력되지 않는다. 메인 스레드가 이미 종료되었기 때문이다. 실행 결과를 보면 "main에서 pthread_exit 호출"이 먼저 출력되고 이후 "worker 스레드 작업 1, 2, 3"이 이어지는 것처럼 보이지만, 엄밀히 말하면 이 순서가 완전히 고정된 것은 아니다. pthread_create() 이후 워커 스레드가 먼저 CPU를 할당받을 수도 있기 때문이다. 첫 줄의 출력 순서는 스케줄링에 따라 달라질 수 있다. 그러나 중요한 점은 메인 스레드가 pthread_exit()을 호출하는 순간 프로세스 전체는 유지되기 때문에, 워커 스레드는 반드시 작업 1, 2, 3을 끝까지 수행하게 된다는 것이다. 이것이 앞선 pthread_join() 미사용 예제와의 가장 큰 차이다. join 없이 return 0으로 끝내면 메인 종료 시 프로세스 전체가 종료될 수 있지만, pthread_exit()을 사용하면 메인 스레드만 종료되고 다른 스레드는 계속 실행된다. 정리하면 return 0은 프로세스 전체 종료 가능, pthread_exit()은 현재 스레드만 종료다. pthread_exit()은 스레드 단위로 종료를 제어할 수 있는 함수이며, 메인이 먼저 끝나더라도 다른 스레드의 작업을 끝까지 보장하고 싶을 때 사용한다.

메인이 종료되도 worker 스레드는 계속 출력이 된다. 첫 출력 순서는 보장되지 않는다.

Example Code ⑤ pthread_self()

Earlier examples covered how to create, terminate, and wait for threads. This one focuses on how to identify a thread.

pthread_self() is a function that returns the ID of the currently running thread. Calling it inside the main function prints the ID of the main thread. After creating a worker thread with pthread_create(), calling pthread_self() inside the worker thread prints that thread's own ID.

The output shows that the main thread's ID and the worker thread's ID are different. This means each thread holds an independent identifier, and the OS manages threads based on these IDs.

The reason for casting to unsigned long is that pthread_t may not be internally identical to a standard integer type, so the cast is needed to print it in a readable numeric format.

One more important point: the main function is also a thread. We tend to think of main simply as the program's entry point, but from Pthread's perspective, main is a thread with its own unique ID.

To summarize, pthread_self() returns the ID of the currently running thread, and this example confirms that every thread — including main — holds a distinct unique ID.

예제 코드 ⑤ 분석 pthread_self()

앞에서는 스레드를 생성하고, 종료하고, 기다리는 방법을 살펴봤다. 이번에는 스레드를 식별하는 방법을 확인해보자. pthread_self()는 현재 실행 중인 스레드의 ID를 반환하는 함수다. 예제에서 메인 함수 안에서 이를 호출하면 현재 실행 중인 메인 스레드의 ID가 출력된다. 이후 pthread_create()로 워커 스레드를 생성하면, 워커 스레드 안에서도 pthread_self()를 통해 자신의 ID를 출력한다. 실행 결과를 보면 메인 스레드의 ID와 워커 스레드의 ID가 서로 다른 것을 확인할 수 있다. 이것은 각 스레드가 독립적인 식별자를 가지고 있으며, 운영체제가 이 ID를 기준으로 스레드를 관리한다는 것을 의미한다. 여기서 unsigned long으로 형변환을 하는 이유는, pthread_t가 내부적으로 정수형과 완전히 동일하지 않을 수 있기 때문에 출력을 위해 정수형으로 변환해주는 것이다. 또 한 가지 중요한 점은 메인도 하나의 스레드라는 것이다. 우리는 보통 메인을 프로그램의 시작점으로만 생각하지만, Pthread의 관점에서 보면 메인도 고유한 ID를 가진 하나의 스레드다. 정리하면, pthread_self()는 현재 실행 중인 스레드의 ID를 반환하는 함수이며, 메인을 포함한 각 스레드는 서로 다른 고유한 ID를 가진다는 점을 확인할 수 있는 예제다.

Linux Basics : Files and Filesystems

Heesu Noh — Wed, 01 Apr 2026 14:43:39 GMT

1️⃣ Everything is a file
2️⃣ Using various file systems

1️⃣ Everything is a file

1. File

What is a File in Linux?

In Linux/Unix, everything is considered a file. Documents, executables, and even hardware devices are all treated as files. This is because unifying everything under the single concept of a "file" makes management and usage much more convenient. Files are broadly divided into three categories: Directory, Regular File, and Special File.

A directory is similar to a folder in Windows, often translated as "list." It does not contain actual data itself, but is composed of the names and location information of the files within it. Since files are distinguished by name, two files with the same name cannot exist within the same directory.

A regular file is the kind of file most people are familiar with, such as Word documents or PowerPoint files. It is divided into text files, which are encoded in a human-readable format, and binary files, which are closer to machine language.

Among special files, there is the Device File. Similar to a driver in Windows that connects hardware to the operating system, Linux treats the device (driver) itself as a file. The file command can be used to check a file's attributes. For example, entering file /dev/input/event0 outputs character special (13/64). Here, character special indicates that it is a character device file, 13 is the Major number representing the type of driver managing the device, and 64 is the Minor number representing which device it is within the same driver.

리눅스에서 파일이란 무엇인가?

리눅스/유닉스에서는 모든 것을 파일로 간주한다. . 문서, 실행 파일은 물론이고 하드웨어 장치까지도 파일로 간주한다. 이렇게 모든 것을 파일이라는 하나의 개념으로 통일하면 관리와 사용이 훨씬 편리해지기 때문이다. 파일은 크게 디렉토리, 일반 파일(Regular File), 특별한 파일(Special File) 세 가지로 나뉜다.

디렉토리는 윈도우의 폴더와 비슷한 개념으로, 흔히 '목록'이라고 번역한다. 실제 데이터가 담겨있는 것이 아니라 파일의 이름과 위치 정보로 구성된다. 파일은 이름으로 구분되기 때문에 같은 디렉토리 안에 동일한 이름의 파일이 두 개 이상 존재할 수 없다.

일반 파일은 워드, PPT처럼 흔히 접하는 파일이다. 사람이 읽을 수 있는 형태로 인코딩된 텍스트 파일과, 기계어에 가까운 바이너리 파일로 구분된다.

특별한 파일에는 장치 파일(Device File) 이 있다. 윈도우에서 하드웨어와 운영체제를 연결하는 드라이버와 비슷한 개념으로, 리눅스에서는 이 장치(드라이버)도 파일로 간주한다. file 명령어로 파일의 속성을 확인할 수 있는데, 예를 들어 file /dev/input/event0을 입력하면 character special (13/64)와 같이 출력된다. 여기서 character special은 문자 장치 파일임을 의미하고, 13은 이 장치를 관리하는 드라이버의 종류(Major 번호), 64는 같은 드라이버 내에서의 장치 번호(Minor 번호)를 나타낸다.

Handling Files - System Call

Files can be handled through system calls. To understand system calls, one must first understand the structure of Linux/Unix.

Linux/Unix is composed of a Kernel and a Shell. The kernel is located at the innermost layer, like the core of the Earth, while the shell wraps around it like a outer layer. Because the kernel is covered by the shell, it cannot be easily accessed from the outside. The means to access the kernel is the System Call.

Representative system calls are as follows. open/close literally opens and closes a file, marking the start and end of file handling. It is possible to prepare to handle an existing file or to create a new one. read is used to retrieve data from a successfully opened file, and write is used to record data to a successfully opened file. lseek is used to change the position at which read or write is performed within a successfully opened file. In addition, there are many other system calls such as access, chdir, chmod, and chown.

파일을 다루는 방법 - 시스템 콜(System Call)

파일은 시스템 콜을 통해 다룰 수 있다. 시스템 콜을 이해하려면 먼저 리눅스/유닉스의 구조를 알아야 한다.

리눅스/유닉스는 커널(Kernel) 과 쉘(Shell) 로 구성된다. 커널은 지구의 핵처럼 가장 안쪽에 위치하고, 쉘은 껍질처럼 커널을 감싸고 있다. 커널은 쉘로 덮여있기 때문에 외부에서 쉽게 접근할 수 없으며, 이 커널에 접근하기 위한 수단이 바로 시스템 콜(System Call) 이다.

대표적인 시스템 콜은 다음과 같다.

open / close는 말 그대로 파일을 열고 닫는 것으로, 파일 다루기의 시작과 종료를 의미한다. 이미 존재하는 파일을 다루기 위한 준비를 하거나, 새로운 파일을 생성하는 것도 가능하다.

read는 open에 성공한 파일의 데이터를 읽어올 때 사용하고, write는 open에 성공한 파일에 데이터를 기록할 때 사용한다.

lseek는 open에 성공한 파일에서 read 또는 write를 수행할 위치를 변경할 때 사용한다.

이 외에도 access, chdir, chmod, chown 등 다양한 시스템 콜이 존재한다.

Filesystem Hierarchy Standard (FHS)

FHS, or the Filesystem Hierarchy Standard, is a standard that defines the locations of files and directories. The standard was created for two main reasons: to allow users and software to predict the location of files.

From a user's perspective, one must know where a desired file is located in order to use it. Not knowing the location means having to search for it manually, which leads to a waste of resources. The standard emerged to solve this problem.

The same applies from a software perspective. Software needs to know the location of files in order to use them flexibly. When installing software, there is a process of specifying the installation location, and for the software to correctly find and use the files it needs, the file locations must be predictable.

Filesystem Hierarchy Standard (FHS)

FHS, 즉 파일시스템 계층 표준은 파일과 디렉토리의 위치를 규정하는 표준이다. 이 표준이 만들어진 이유는 크게 두 가지로, 사용자와 소프트웨어가 파일의 위치를 예측할 수 있도록 하기 위해서다.

사용자 입장에서는 찾고자 하는 파일이 어디에 있는지 알아야 사용할 수 있다. 위치를 모르면 일일이 찾아야 하고, 이는 곧 자원의 낭비로 이어진다. 이러한 문제를 해결하기 위해 표준이 등장한 것이다.

소프트웨어 입장에서도 마찬가지다. 파일의 위치를 알아야 소프트웨어가 해당 파일을 유연하게 활용할 수 있다. 소프트웨어를 설치할 때 설치 위치를 지정하는 과정이 있는데, 이때 소프트웨어가 필요한 파일을 올바르게 찾아 사용하려면 파일이 어디에 위치하는지 예측 가능해야 한다.

Root Filesystem

The root filesystem has a defined basic structure. It must support system boot, reverting to a previous state, recovering removed or lost data, and repairing damaged components.

The root filesystem starts with / and is organized in a tree structure, consisting of many directories such as /bin, /boot, /dev, /etc, and /lib. The name of each directory alone gives a basic understanding of what files it contains. Looking at the main directories: /bin contains essential command binaries, /boot contains static files used by the boot loader, /dev contains device files, and /etc contains system configuration files.

루트 파일시스템(Root Filesystem)

루트 파일시스템은 기본 틀이 정해져 있다. 시스템 부트, 이전 상태로 되돌리기, 제거되거나 손실된 것을 복구하기, 손상된 것을 수리하기 등이 가능해야 한다.

루트 파일시스템은 /로 시작하여 트리 구조로 이루어져 있으며, /bin, /boot, /dev, /etc, /lib 등 많은 디렉토리로 구성된다. 디렉토리 이름만 보아도 해당 디렉토리가 어떤 파일로 구성되어 있는지 기본적으로 파악할 수 있다. 주요 디렉토리를 살펴보면, /bin에는 필수 명령어 이진 파일이, /boot에는 부트 로더가 사용하는 변화없는 파일이, /dev에는 장치 파일이, /etc에는 시스템 설정 파일이 포함된다.

File Redirection : Output to File

In Linux, the result of a command is by default printed to the screen. However, in Linux, the screen (standard output) is also considered a file. Therefore, the direction of output can be redirected to a file instead of the screen, and this is called Redirection.

For example, running ls -l /etc/adduser.conf prints the file information to the screen. However, using the > symbol as in ls -l /etc/adduser.conf > redirect saves the output to a file called redirect instead. Running ls -l redirect confirms that the file has been created, and cat redirect shows that the output that would have appeared on screen has been saved to the file.

Redirection is critically important in practice. When operating a server, there are many situations where program outputs or error messages need to be saved to a file, and redirection solves this with a simple > filename. Since Linux servers often run tasks automatically without anyone at the monitor, saving results to a file for later review is very useful. Redirection is also a prime example of the Linux philosophy "everything is a file" in action.

파일 방향 변경 : 파일로 출력

리눅스에서 명령어의 실행 결과는 기본적으로 화면에 출력된다. 그런데 리눅스에서는 이 화면(표준 출력)도 파일로 간주한다. 따라서 출력의 방향을 화면이 아닌 다른 파일로 돌릴 수 있는데, 이것이 리다이렉션(Redirection) 이다.

위 예시를 보면, ls -l /etc/adduser.conf를 실행하면 해당 파일의 정보가 화면에 출력된다. 그런데 ls -l /etc/adduser.conf > redirect와 같이 > 기호를 사용하면 화면에 출력되어야 할 결과가 redirect라는 파일로 저장된다. 실제로 ls -l redirect로 확인해보면 redirect 파일이 생성된 것을 볼 수 있고, cat redirect로 파일의 내용을 확인하면 원래 화면에 출력되었어야 할 결과가 그대로 저장되어 있는 것을 확인할 수 있다.

리다이렉션은 실무에서 매우 중요하게 활용된다. 서버를 운영하다 보면 프로그램의 실행 결과나 오류 메시지를 파일로 저장해야 할 일이 많은데, 리다이렉션을 활용하면 > 파일명 하나로 간단히 해결된다. 또한 리눅스 서버는 사람이 항상 모니터 앞에 있지 않아도 자동으로 작업이 실행되는 경우가 많기 때문에, 실행 결과를 파일로 저장해두면 나중에 확인할 수 있어 유용하다. 이처럼 리다이렉션은 "모든 것은 파일이=다"라는 리눅스 철학이 실제로 적용되는 대표적인 예시이기도 하다.

File Redirection : Input from File, Output to File

While > redirects output to a file, < does the opposite — it receives input from a file.

Running ls -l /dev > dev_file saves the contents of the /dev directory to a file called dev_file without displaying anything on screen. Then, running cat -v < dev_file > cat_dev_file takes dev_file as input via <, processes it with cat -v, and saves the result to cat_dev_file via >. Since both input and output are directed to files, nothing appears on screen. Note that cat -v, unlike regular cat, displays special and control characters in a human-readable form.

Using < and > together allows an entire workflow of reading from a file, processing it, and saving the result to another file, all in a single command line. For example, when processing large log files on a server, one can use < to receive the log file as input, process it as needed, and save the result to a new file with >. In this way, < and > are important concepts that enable powerful automation in Linux.

파일 방향 변경 : 파일에서 입력, 파일로 출력

앞서 배운 >가 출력의 방향을 파일로 돌리는 것이었다면, <는 반대로 입력의 방향을 파일에서 받아오는 것이다.

먼저 ls -l /dev > dev_file을 실행하면 /dev 디렉토리의 내용이 화면에 출력되지 않고 dev_file이라는 파일로 저장된다. 이후 cat -v < dev_file > cat_dev_file을 실행하면 <를 통해 dev_file의 내용을 입력으로 받아 cat -v로 처리한 뒤, 그 결과를 >를 통해 cat_dev_file이라는 파일로 저장한다. 입력과 출력 모두 파일로 방향이 지정되어 있기 때문에 화면에는 아무것도 출력되지 않는다. 참고로 cat -v는 일반 cat과 달리 특수문자나 제어문자도 눈에 보이는 형태로 출력해주는 옵션이다.

<와 >를 함께 활용하면 파일을 입력으로 받아 처리한 뒤 결과를 다시 파일로 저장하는 흐름을 명령어 한 줄로 처리할 수 있다. 예를 들어 서버에서 대용량 로그 파일을 처리할 때, <로 로그 파일을 입력받아 필요한 처리를 한 뒤 >로 결과를 새로운 파일로 저장하는 식으로 활용할 수 있다. 이처럼 <와 >는 리눅스의 강력한 자동화 처리를 가능하게 하는 중요한 개념이다.

Counting Lines, Words, and Characters in a File (wc)

Counting Lines, Words, and Characters in a File (wc) wc stands for word count. Running man wc confirms that it is a command that outputs "print newline, word, and byte counts for each file." Running wc cat_dev_file outputs 195 1935 10668 cat_dev_file, representing the line count (195), word count (1935), and byte count (10668) in that order. The same result can be obtained in three ways: specifying the filename directly with wc cat_dev_file, passing the file as input with wc < cat_dev_file, or using a pipe with cat cat_dev_file | wc. All three methods produce the same result of 195 1935 10668. The pipe (|) passes the output of the preceding command directly as input to the following command. This allows complex tasks to be handled in a single line by combining multiple commands. For example, extracting only lines containing a specific word from a large log file, sorting them, and removing duplicates can all be done at once with cat filename | grep word | sort | uniq. In this way, the pipe, along with < and >, is a core feature that enables automation and efficient data processing in Linux.

파일에서 라인 수, 단어 수, 문자 수 확인하기

(wc) wc는 word count의 약자로, man wc 명령을 통해 확인하면 "print newline, word, and byte counts for each file", 즉 파일의 라인 수, 단어 수, 바이트 수를 출력하는 명령어임을 알 수 있다. wc cat_dev_file을 실행하면 195 1935 10668 cat_dev_file이 출력되는데, 순서대로 라인 수(195), 단어 수(1935), 바이트 수(10668) 를 의미한다. 위 예시에서 세 가지 방법으로 동일한 결과를 얻는 것을 볼 수 있다. wc cat_dev_file처럼 파일명을 직접 지정하거나, wc < cat_dev_file처럼 <를 통해 파일을 입력으로 넘겨주거나, cat cat_dev_file | wc처럼 |(파이프)를 사용하는 방법이다. 세 방법 모두 결과는 195 1935 10668로 동일하다. 파이프(|)는 앞 명령어의 출력 결과를 뒤 명령어의 입력으로 바로 넘겨주는 역할을 한다. 이를 활용하면 여러 명령어를 조합해 복잡한 작업을 한 줄로 처리할 수 있다. 예를 들어 대용량 로그 파일에서 특정 단어가 포함된 줄만 골라내고, 정렬하고, 중복을 제거하는 작업을 cat 파일명 | grep 단어 | sort | uniq처럼 명령어를 연결하는 것만으로 한 번에 처리할 수 있다. 이처럼 파이프는 앞서 배운 <, >와 함께 리눅스의 자동화와 효율적인 데이터 처리를 가능하게 하는 핵심 기능이다.

Searching File Contents by Pattern (grep)

grep stands for global regular expression print, and is a command used to search for specific patterns within file contents. For example, when looking for words starting with 'g' and ending with 'p', or characters meeting specific conditions between two letters, grep allows for convenient pattern-based searching without having to type out every possible case.

Running grep kvm cat_dev_file finds and outputs all lines containing kvm within cat_dev_file. Adding the -n flag also outputs the line numbers of matching lines. In the example, kvm is found on lines 162, 192, and 193.

The main grep flags are as follows. -n outputs the line numbers of matching lines, -r searches recursively through directories, -c outputs only the count of matching lines, and -l outputs only the names of matching files. -i searches without case sensitivity — keep in mind that Linux is case-sensitive by default.

파일 내용에서 패턴으로 검색하기 (grep)

grep은 global regular expression print의 약자로, 파일 내용에서 특정 패턴을 검색할 때 사용하는 명령어다. 예를 들어 'g'로 시작하고 'p'로 끝나는 단어, 또는 두 글자 사이에 특정 조건을 만족하는 문자가 있는 것을 찾고 싶을 때, 일일이 모든 경우를 입력하지 않고 패턴으로 간편하게 검색할 수 있다.

grep kvm cat_dev_file을 실행하면 cat_dev_file 안에서 kvm이 포함된 줄을 찾아 출력한다. 여기에 -n 플래그를 추가하면 일치하는 줄의 번호도 함께 출력된다. 예시에서 162번째 줄, 192번째 줄, 193번째 줄에 kvm이 포함되어 있음을 확인할 수 있다.

grep의 주요 플래그는 다음과 같다. -n은 일치하는 줄의 번호를 출력하고, -r은 디렉토리 안을 반복해서 검색하며, -c는 일치하는 줄의 개수만 출력하고, -l은 일치하는 파일 이름만 출력한다. -i는 대소문자를 구분하지 않고 검색하는 플래그인데, 리눅스는 기본적으로 대소문자를 구분한다는 점을 기억해두자.

Wildcard

A wildcard is a feature that allows multiple targets to be specified at once using a pattern when searching for files or directories. Detailed information can be checked with the man 7 glob command.

* matches anything regardless of the number of characters. For example, running echo /dev/v* outputs all files in the /dev directory that start with v. ls /dev/v* similarly lists all files starting with v.

? substitutes for exactly one character. For example, searching /dev/vcs? returns only files where exactly one character follows vcs. Using ?? means exactly two characters, and the number of characters is determined by the number of question marks used.

In this way, wildcards make it very convenient to handle multiple files matching a pattern at once without having to type each filename individually.

와일드 카드 (Wildcard)

와일드 카드는 파일이나 디렉토리를 검색할 때 패턴을 사용해 여러 대상을 한 번에 지정할 수 있는 기능이다. man 7 glob 명령으로 자세한 내용을 확인할 수 있다.

*는 아무 문자나 몇 글자든 상관없이 일치하는 것을 모두 찾는다. 예를 들어 echo /dev/v*를 실행하면 /dev 디렉토리에서 v로 시작하는 모든 파일을 출력한다. ls /dev/v*도 마찬가지로 v로 시작하는 모든 파일을 나열한다.

?는 딱 한 글자만 대체한다. 예를 들어 /dev/vcs?를 검색하면 vcs 뒤에 한 글자만 오는 파일만 검색된다. ??처럼 물음표를 두 개 쓰면 두 글자를 의미하며, 물음표 개수만큼 글자 수가 정해진다.

이처럼 와일드 카드를 활용하면 일일이 파일 이름을 입력하지 않고도 패턴에 맞는 여러 파일을 한 번에 다룰 수 있어 매우 편리하다.

2️⃣ Using various file systems

Using Various File Systems : Mount

We learned that in Linux, everything is considered a file. A concept closely connected to this is Mount.

In Windows, plugging a USB into a PC automatically assigns a drive letter such as C:\, D:\, or E:\. In Linux, however, a mount process is required instead. Recalling the root filesystem covered earlier, the USB device must be mounted somewhere within the tree structure that starts from /. In other words, if a device such as /dev/usb has a filesystem, mounting means adding that filesystem to the existing file hierarchy.

Mounting requires administrator (root) privileges. When checking file information with the ls command, permissions are displayed in a format such as rwx rwx. Only users with administrator privileges can execute the mount command; those without cannot.

다양한 파일시스템 사용 : 마운트(Mount)

리눅스에서는 모든 것을 파일로 간주한다고 배웠다. 이와 연결되는 개념으로 마운트(Mount) 에 대해 알아보자.

윈도우에서는 USB를 PC에 꽂으면 C:\, D:\, E:\ 와 같이 알파벳 드라이브 문자가 자동으로 할당된다. 그러나 리눅스에서는 이와 다르게 마운트 과정이 필요하다. 앞서 배운 루트 파일시스템을 떠올려보면, /로 시작하는 트리 구조 안의 어딘가에 USB 장치를 마운트해주어야 한다. 즉, /dev/usb와 같은 장치에 파일시스템이 있다면 그 파일시스템을 기존의 파일 계층 구조에 추가해주는 것이 마운트다.

마운트를 하기 위해서는 관리자(root) 권한이 필요하다. ls 명령으로 파일 정보를 확인하면 rwx rwx 와 같은 형태로 권한이 표시되는데, 관리자 권한을 가진 사용자만 마운트 명령을 실행할 수 있으며 그렇지 않은 사용자는 사용할 수 없다.

How to Use the Mount Command

The basic form of the mount command is mount -t filesystem device_name mount_point.

Looking at each element: a filesystem defines how data is stored and managed on a storage device. As the number of files grows, they need to be managed systematically, and that management method is the filesystem. There are many types of filesystems, and they differ by operating system. Windows primarily uses NTFS, while Linux primarily uses ext. Therefore, when mounting, one must know what filesystem the device uses. The device name is the path of the device to be mounted, specified in the form /dev/device_name. The mount point is the path that will be accessed after mounting, specified as a location within the root filesystem such as /home or /linux/usb.

Unmounting is performed with the umount mount_point command, which detaches the device from the added file hierarchy. However, a device cannot be unmounted while it is in use, and a busy warning message will be displayed. This is similar to what happens in Windows when a USB is pulled out without clicking the eject button, causing files to become inaccessible. Removing a device before synchronization is complete can corrupt files, so it is essential to unmount the device only after finishing its use.

마운트 명령어 사용법

마운트 명령어의 기본 형태는 mount -t 파일시스템 장치이름 사용위치이다.

각 항목을 살펴보면, 먼저 파일시스템이란 저장장치에 데이터를 어떻게 저장하고 관리하느냐를 정의하는 방식이다. 파일이 많아지면 이를 체계적으로 관리해야 하는데, 그 관리 방식이 바로 파일시스템이다. 파일시스템의 종류는 매우 다양하며 운영체제마다 다르다. 윈도우는 NTFS를, 리눅스는 ext를 주로 사용한다. 따라서 마운트를 할 때는 해당 장치가 어떤 파일시스템을 사용하는지 반드시 알아야 한다. 장치이름은 마운트할 장치의 경로로, /dev/장치이름의 형태로 지정한다. 사용위치는 마운트 후 실제로 접근하게 될 경로로, /home 이나 /linux/usb와 같이 루트 파일시스템 안의 위치를 지정한다.

마운트 해제는 umount 사용위치 명령으로 수행하며, 추가된 파일 계층 구조에서 해당 장치를 떼어내는 것이다. 단, 해당 장치가 사용 중일 때는 해제할 수 없으며 busy라는 경고 메시지가 출력된다. 이는 윈도우에서 USB를 꺼내기 버튼을 누르지 않고 바로 뽑았을 때 파일에 접근이 안 되는 경우와 비슷한 개념이다. 동기화가 완료되지 않은 상태에서 장치를 제거하면 파일이 손상될 수 있기 때문에, 반드시 사용이 끝난 후 마운트를 해제해야 한다.

SMB (Server Message Block)

SMB is a client/server protocol used in mounting.

The mounting covered so far has been local, meaning it takes place within a single machine. However, to use storage located remotely, one would have to copy and retrieve the data files, which is a cumbersome and complex process. SMB resolves this inconvenience.

In SMB, the server provides a filesystem to be shared, and the client uses the server's files over the network. Not only files but also resources such as printers can be provided by the server, and from the client's perspective, remote files can be used just as if they were local.

SMB (Server Message Block)

SMB는 마운트에서 사용되는 클라이언트/서버 방식의 프로토콜이다.

지금까지 배운 마운트는 로컬, 즉 하나의 기계 안에서 이루어지는 것이었다. 그런데 원격에 있는 스토리지를 사용하려면 데이터 파일을 복사해서 가져와야 하는데, 이 과정이 번거롭고 복잡하다. SMB는 이러한 불편함을 해결해준다.

SMB에서 서버는 공유할 파일시스템을 제공하고, 클라이언트는 네트워크를 통해 서버의 파일을 사용한다. 파일뿐만 아니라 프린터와 같은 자원도 서버에서 제공받을 수 있으며, 클라이언트 입장에서는 로컬에 있는 파일을 사용하는 것과 동일하게 사용할 수 있다.

NFS (Network File System)

NFS is a distributed filesystem protocol developed by Sun Microsystems.

When filesystems were discussed earlier, the focus was on how data is stored and managed on physical storage. However, despite having "filesystem" in its name, NFS does not serve that role. NFS is a network protocol that defines how files are exchanged and shared between a server and a client.

Like SMB, it is a client/server protocol where the server provides a filesystem to be shared and the client uses it over the network. One notable characteristic of NFS is that it can be used across multiple operating systems.NFS (Network File System)

NFS는 Sun Microsystems에서 개발한 분산 파일시스템 프로토콜이다.

앞서 파일시스템을 설명할 때는 물리적인 스토리지에 데이터를 어떻게 저장하고 관리할 것인가에 관한 것이었다. 그러나 NFS는 이름에 파일시스템이 들어가 있더라도 그 역할을 하지 않는다. NFS는 서버와 클라이언트 사이에서 파일을 어떻게 주고받으며 공유할 것인지를 정의하는 네트워크 프로토콜이다.

SMB와 마찬가지로 클라이언트/서버 방식의 프로토콜로, 서버는 공유할 파일시스템을 제공하고 클라이언트는 네트워크를 통해 이를 사용한다. NFS의 특징 중 하나는 여러 운영체제에서 사용 가능하다는 점이다.

Software Test Planning and Risk Management

Heesu Noh — Sat, 28 Mar 2026 15:19:24 GMT

1️⃣ Core Concepts of Test Planning
2️⃣ Risk Management Overview
3️⃣ Risk-based testing strategy

1️⃣ Core Concepts of Test Planning

What is a Test Plan?

To perform software testing systematically, prior planning is essential. Regardless of what is being managed, proceeding without a plan leads to losing direction, and testing is no exception. This is why we have consistently emphasized from previous weeks that a Test Plan must be established before testing begins.

A concept studied alongside this is the PDCA cycle. A continuous loop of Plan → Do → Check → Act. The test plan corresponds to the Plan phase, which is the starting point of this cycle. In this phase, we clearly define what to test, how far to test, and what goals we aim to achieve through testing at a high level.

The reason a test plan goes beyond mere preparation is that all subsequent activities; test design, execution, evaluation, and improvement; are carried out based on this plan. In other words, the test plan serves as the foundation that sets the direction and criteria for all downstream activities.

In summary, the test plan is the compass of the entire test process. The clearer the plan, the more consistently and efficiently all subsequent testing activities can be carried out.

테스트 계획(Test Plan)이란?

소프트웨어 테스트를 체계적으로 수행하기 위해서는 반드시 사전 계획이 필요하다. 어떤 대상을 관리하든 계획 없이 진행하면 방향을 잃기 쉽고, 테스트도 마찬가지다. 그래서 우리는 테스트를 시작하기 전에 반드시 테스트 계획(Test Plan) 을 세워야 한다는 점을 이전 주차부터 꾸준히 강조해왔다.

이와 관련하여 함께 학습한 개념이 바로 PDCA 사이클이다. PDCA란 Plan(계획) → Do(실행) → Check(평가) → Act(개선)의 순환 구조를 말하며, 테스트 계획은 이 사이클의 출발점인 Plan 단계에 해당한다. 이 단계에서는 무엇을 테스트할 것인지, 어디까지 테스트할 것인지, 그리고 상위 수준에서 테스트를 통해 달성하고자 하는 목표가 무엇인지를 명확하게 정의한다.

테스트 계획이 단순한 준비 작업에 그치지 않는 이유는, 이후에 이루어지는 테스트 설계, 수행, 평가 및 개선의 모든 활동이 바로 이 계획을 기준으로 전개되기 때문이다. 즉, 테스트 계획은 다음 작업들의 방향과 기준을 잡아주는 토대 역할을 한다.

정리하자면, 테스트 계획은 테스트 전체 프로세스의 나침반이다. 계획이 명확할수록 이후의 모든 테스트 활동이 더 일관성 있고 효율적으로 이루어질 수 있다.

Test Planning in the ISO/IEC/IEEE 29119 Standard

ISO/IEC/IEEE 29119 is an international standard covering software testing as a whole, structured into three layers. At the top is the Organizational Test Process, which establishes test policies and strategies applicable across the entire organization. Below that is the Test Management Process, responsible for planning and managing testing at the individual project level. Finally, the Dynamic Test Process is where actual test design, execution, and result processing take place. The test plan corresponds to the first activity within the Test Management Process.

There is one important principle to note here. A test plan is not created independently without context ; it must reflect the test policies and strategies established in the Organizational Test Process. Only when the direction of the higher layer is naturally embedded into the lower-level plan can consistent testing be achieved throughout the project.

So what exactly should a test plan document contain? First, the scope and test items must be clearly identified, defining what and how far to test. Next, the test objectives must be set, defining what the testing aims to achieve. Finally, a test strategy must be developed based on the Organizational Test Process, outlining how to achieve those objectives.

Throughout this process, the word strategy appears frequently. This refers not simply to "what to do," but to the specific methodology for "how to proceed systematically." Without a strategy, testing can easily lose its direction. Therefore, strategy is a core component that must be included in the test plan document.

Ultimately, the 29119 standard positions the test plan within a hierarchical flow of Organizational Policy → Test Strategy → Execution, and the test plan document is the tangible output that formalizes this flow.

ISO/IEC/IEEE 29119 표준에서의 테스트 계획

ISO/IEC/IEEE 29119는 소프트웨어 테스트 전반을 다루는 국제 표준으로, 크게 세 개의 계층으로 구성되어 있다. 가장 상위에는 조직 전체에 적용되는 테스트 정책과 전략을 수립하는 조직 차원의 테스트 프로세스가 있고, 그 아래에는 개별 프로젝트 수준에서 테스트를 계획하고 관리하는 테스트 관리 프로세스, 그리고 실제 테스트 설계와 실행, 결과 처리가 이루어지는 동적 테스트 프로세스가 순서대로 위치한다. 테스트 계획은 이 중 테스트 관리 프로세스의 첫 번째 활동에 해당한다.

여기서 한 가지 중요한 원칙이 있다. 테스트 계획은 아무런 맥락 없이 독립적으로 세워지는 것이 아니라, 반드시 상위 계층인 조직 차원의 테스트 프로세스에서 만들어진 테스트 정책과 전략을 반영해야 한다는 점이다. 상위 계층의 방향성이 하위 계획에 자연스럽게 녹아들어야만 프로젝트 전반에 걸쳐 일관성 있는 테스트가 가능하기 때문이다.

그렇다면 테스트 계획서에는 구체적으로 무엇이 담겨야 할까. 먼저 무엇을 어디까지 테스트할 것인지 대상과 범위를 명확히 식별해야 하고, 이번 테스트를 통해 달성하고자 하는 바를 테스트 목표로 설정해야 한다. 그리고 조직 차원의 테스트 프로세스를 기반으로 그 목표를 달성하기 위한 테스트 전략을 수립해야 한다.

이 과정에서 유독 전략(Strategy) 이라는 단어가 자주 등장하는데, 이는 단순히 "무엇을 할 것인가"에 머무르지 않고 "어떻게 체계적으로 해나갈 것인가"에 대한 구체적인 방법론을 의미한다. 목표만 있고 전략이 없다면 테스트는 쉽게 방향을 잃을 수 있기 때문에, 전략은 테스트 계획서의 핵심 구성 요소로 반드시 포함되어야 한다.

결국 29119 표준은 테스트 계획을 조직의 정책 → 테스트 전략 → 실행으로 이어지는 계층적 흐름 속에 위치시키고 있으며, 테스트 계획서는 바로 이 흐름을 하나의 문서로 구체화한 결과물이라고 할 수 있다.

Detailed Process of Test Planning in 29119

To develop a test plan, the process begins with understanding the project context. Only by grasping the overall picture; including the development scope and overall schedule - can the test scope be defined and a high-level test concept be formed. This concept then shapes the rough outline of the test plan and development schedule.

Once the schedule takes shape, the next step is risk identification and analysis; identifying and analyzing potential risk factors across the project. The methods identified to mitigate these risks are then incorporated into the test strategy design. Afterward, specific human resources and schedules are determined; who will perform the testing, what resources will be used, when and how; and these are documented in the test plan document. The completed document goes through consensus with stakeholders and is shared, completing the test planning process.

The most critical part of this entire flow is the three-step sequence of risk identification and analysis → risk mitigation identification → test strategy design. Planning is fundamentally an act of preparing for future events in advance. Therefore, risks that could affect test execution must always be included in the test plan. Ultimately, the core principle emphasized by the 29119 standard is that test planning must be risk-based.

29119 테스트 계획의 세부 프로세스

테스트 계획을 수립하기 위해서는 가장 먼저 프로젝트의 컨텍스트를 이해하는 것에서 출발한다. 개발하고자 하는 범위나 전체 일정 등 프로젝트의 전반적인 맥락을 파악해야 비로소 테스트의 범위가 정해지고, 테스트에 대한 전체적인 구상이 가능해진다. 이 구상을 바탕으로 테스트 계획과 개발 일정의 큰 틀이 만들어진다.

일정의 윤곽이 잡히면 그 다음으로는 위험 식별 및 분석 단계가 이어진다. 프로젝트 전반에 걸쳐 어떤 위험 요소가 존재하는지 찾아내고 이를 분석하는 과정이다. 이렇게 분석된 위험을 완화하기 위한 방법을 도출하고, 그 방법을 반영하여 테스트 전략을 설계하게 된다. 이후 누가 수행할 것인지, 어떤 자원을 사용할 것인지, 언제 어떻게 진행할 것인지와 같은 구체적인 인적 자원과 일정을 결정하여 테스트 계획서로 문서화한다. 완성된 계획서는 관련자들과 합의를 거쳐 공유되며 테스트 계획 수립이 마무리된다.

이 전체 흐름에서 가장 중요하게 짚어야 할 부분은 위험 식별 및 분석, 위험 완화 방법 식별, 테스트 전략 설계로 이어지는 세 단계다. 계획이란 본질적으로 미래에 일어날 일들을 미리 대비하는 행위다. 그렇기 때문에 테스트 계획을 세울 때도 앞으로의 테스트 수행에 영향을 미칠 수 있는 리스크를 반드시 계획 안에 포함시켜야 한다. 결국 테스트 계획은 위험을 기반으로 수립된다는 것이 29119 표준이 강조하는 핵심 원칙이다.

그렇다면 구체적으로 어떤 기준으로 위험을 식별하고 분석하는 것인지가 자연스러운 다음 질문이 된다. 다음 시간에는 바로 이 위험을 식별하고 분석하는 기준과 방법에 대해 자세히 살펴볼 예정이다.

Structure of the Test Plan Document; Master Test Plan and Level Test Plans

Once test planning is complete, the resulting artifact is the test plan document, which is divided into two types: the Master Test Plan and Level Test Plans.

The Master Test Plan is a comprehensive document that consolidates and manages all the subordinate level test plans. Its purpose is to oversee and control multiple test levels and non-functional testing from a holistic perspective — it is essentially the top-level plan that coordinates the entire testing effort.

Beneath the Master Test Plan are individual Level Test Plans, each specific to a particular test level. These detail the test strategy, specific activities, detailed schedule, test owners, execution methods, and tools for each test level. The test types covered correspond to the right side of the V-model; unit testing, integration testing, system testing, acceptance testing — as well as non-functional testing such as load and performance testing.

In summary, if the Master Test Plan is the big picture that provides an overview of all testing, then the Level Test Plans are the detailed blueprints that specify exactly how each test will be conducted within that big picture.

테스트 계획서의 구성; 총괄 테스트 계획과 단계별 테스트 계획

테스트 계획 수립이 완료되면 그 결과물로 테스트 계획서가 만들어진다. 테스트 계획서는 크게 총괄 테스트 계획과 단계별 테스트 계획 두 가지로 나뉜다.

먼저 총괄 테스트 계획은 하위에 존재하는 여러 단계별 테스트 계획들을 하나로 묶어 종합적으로 관리하는 계획서다. 여러 테스트 단계와 비기능 테스트 등을 전체적인 시각에서 관리하고 통제하는 것이 목적이며, 말 그대로 테스트 전반을 조율하는 최상위 계획서라고 볼 수 있다.

그리고 이 총괄 테스트 계획 아래에는 각 단계별로 세분화된 단계별 테스트 계획이 존재한다. 단계별 테스트 계획에서는 각 테스트 단계에서 수행할 테스트 전략, 구체적인 활동, 세부 일정, 테스트 담당자, 수행 방법, 사용 도구 등을 상세하게 계획한다. 대상이 되는 테스트 유형은 V 모델의 오른쪽에 해당하는 단위 테스트, 통합 테스트, 시스템 테스트, 인수 테스트, 그리고 사용량이나 성능 등을 검증하는 비기능 테스트까지 포함된다.

정리하자면, 총괄 테스트 계획이 전체 테스트를 조망하는 큰 그림이라면, 단계별 테스트 계획은 그 큰 그림 안에서 각 테스트를 어떻게 실제로 수행할 것인지를 구체적으로 담아낸 세부 설계도라고 할 수 있다.

Contents of the Master Test Plan and Level Test Plans

Since the test plan document is divided into master and level plans, the depth and specificity of content in each naturally differs.

The Master Test Plan contains high-level content: the test purpose and scope explaining why testing is being conducted; the test item definition clarifying what will be tested; the test strategy and approach describing how testing will proceed; the overall schedule; the organizational structure and roles defining who is responsible for what; and assumptions and constraints that may affect test execution. In short, the Master Test Plan is a document capturing the big picture of all testing.

Based on this master plan, individual Level Test Plans are created — separate documents for each test level such as unit, integration, system, and acceptance testing. These contain much more specific content: the test scope and strategy for that level, the activities and objectives to be performed, the characteristics of the test items, test design methods, constraints during execution, input/output work products, test tools to be used, the detailed schedule, and the final deliverables.

Ultimately, while the Master Test Plan provides the overall direction and criteria, the Level Test Plans serve as detailed execution guides explaining exactly how each test level will be carried out. Only when the two documents are organically connected can systematic and consistent test execution be achieved.

[예시] 총괄 및 단계별 테스트 계획서의 구성 항목

테스트 계획서는 총괄과 단계별로 나뉘는 만큼, 각각에 담기는 내용의 수준과 세부성도 자연스럽게 달라진다.

총괄 테스트 계획서에는 상위 수준의 내용들이 담긴다. 테스트를 왜 수행하는지에 대한 테스트 목적 및 범위, 무엇을 테스트할 것인지를 정의하는 테스트 대상 시스템 정의, 테스트를 어떻게 진행할 것인지에 대한 테스트 전략과 수행 절차, 전체적인 일정, 누가 어떤 역할을 맡을 것인지에 대한 조직 구성 및 역할, 그리고 테스트 수행에 영향을 줄 수 있는 가정 및 제약사항 등이 포함된다. 즉, 총괄 테스트 계획서는 테스트 전반을 조망하는 큰 그림을 담은 문서라고 할 수 있다.

이 총괄 계획서를 기반으로 단계별 테스트 계획서가 만들어진다. 단위 테스트, 통합 테스트, 시스템 테스트, 인수 테스트와 같이 각 테스트 단계별로 세분화된 계획서가 따로 작성되며, 여기에는 훨씬 구체적인 내용들이 담긴다. 해당 단계에서의 테스트 범위 및 전략, 수행해야 할 활동과 목적, 테스트 대상의 특성, 테스트 설계 방법, 수행 시의 제약사항, 테스트의 입출력 산출물, 활용할 테스트 도구, 세부 일정, 그리고 최종적으로 만들어지는 산출물 등이 해당된다.

결국 총괄 테스트 계획서가 전체 방향과 기준을 제시하는 문서라면, 단계별 테스트 계획서는 그 기준 아래에서 각 테스트 단계를 실제로 어떻게 수행할 것인지를 상세하게 풀어낸 실행 지침서라고 볼 수 있다. 두 문서가 유기적으로 연결될 때 비로소 체계적이고 일관성 있는 테스트 수행이 가능해진다.

Input/Output Work Products and Exit Criteria for Unit and System Test Plans

Let us examine how a Level Test Plan is structured in practice, using unit testing and system testing as examples.

Unit Test Plan

To conduct unit testing, the input work products required are the unit test plan and unit test cases. After executing the test cases, the output work product; the test results report; is produced. Tools such as JUnit, a Java-based testing framework, are used in this process.

A key aspect of unit testing is the exit criteria. Since unit testing involves directly examining the source code, coverage - a measure of how much code has been tested - is used as the exit criterion. The two most common types are statement coverage (the ratio of executed statements to total statements) and branch coverage (the ratio of tested branches or decision points). A target value is set based on these metrics, and the test is considered complete when the target is met.

System Test Plan

System testing is performed based on the results of requirements analysis in the V-model. Like unit testing, the input work products are the system test plan and test cases, and the output work product after execution is the test results report. Tools such as JMeter may be used at this level.

The exit criteria for system testing are based on requirements coverage — the percentage of total requirements for which testing has been performed. A target percentage is set and serves as the completion benchmark.

While detailed schedules, test owners, and specific execution methods are also included in the plan, even the input/output work products and exit criteria alone illustrate how concretely and measurably a Level Test Plan must be written.

[예시] 단위 테스트 및 시스템 테스트 계획서의 입/출력 산출물과 완료 기준

단계별 테스트 계획서가 실제로 어떻게 구성되는지를 단위 테스트와 시스템 테스트를 예시로 살펴보자.

단위 테스트 계획서

단위 테스트를 수행하기 위해서는 먼저 입력 산출물로 단위 테스트 계획서와 단위 테스트 케이스가 필요하다. 테스트 케이스를 기반으로 테스트를 수행하고 나면 출력 산출물인 테스트 결과서가 만들어진다. 이 과정에서는 Java 기반의 JUnit과 같은 테스트 도구가 활용된다.

단위 테스트에서 주목해야 할 부분은 완료 기준이다. 단위 테스트는 실제 코드를 직접 들여다보며 수행하기 때문에, 얼마나 많은 코드를 테스트했는지를 나타내는 커버리지(Coverage) 를 완료 기준으로 활용한다. 대표적으로 문장 커버리지와 분기 커버리지가 있는데, 문장 커버리지는 전체 문장 수 대비 테스트가 수행된 문장 수의 비율로, 분기 커버리지는 조건문이나 분기문이 얼마나 테스트되었는지의 비율로 산출된다. 이 수치를 기반으로 목표치를 설정하고, 그 목표를 달성했을 때 테스트가 완료된 것으로 판단한다.

시스템 테스트 계획서

시스템 테스트는 V 모델 기준으로 요구사항 분석 결과를 바탕으로 수행된다. 단위 테스트와 마찬가지로 테스트를 실제로 수행하기 위한 입력 산출물로 시스템 테스트 계획서와 테스트 케이스가 필요하며, 수행 후에는 출력 산출물인 테스트 결과서가 생성된다. 이 단계에서는 JMeter와 같은 도구가 활용될 수 있다.

시스템 테스트의 완료 기준은 요구사항 커버리지를 기반으로 한다. 전체 요구사항 중 실제로 테스트가 수행된 요구사항이 몇 퍼센트인지를 산출하여 목표치를 설정하고, 이를 완료 기준으로 삼는 것이다.

물론 이 외에도 테스트 수행을 위한 세부 일정, 담당자, 구체적인 수행 방법 등 다양한 요소들이 계획서에 포함되지만, 입출력 산출물과 완료 기준만 보더라도 단계별 테스트 계획서가 얼마나 구체적이고 측정 가능한 형태로 작성되어야 하는지를 잘 알 수 있다.

Why the Test Plan Matters

There is a clear reason why the test plan is developed so meticulously. A test plan is not merely about creating a document. It becomes the baseline for all test design and execution that follows. Testing is designed and executed based on the scope, strategy, and exit criteria defined in the plan, and the test plan also serves as the foundation for monitoring whether testing is proceeding as planned.

Without a plan, it is nearly impossible to judge whether testing is heading in the right direction. No matter how skilled the testers are, their efforts are unlikely to yield proper results without a clear plan. This is precisely why the importance of the Plan phase is repeatedly emphasized in software testing. The test plan is both the starting point and the backbone of successful testing.

테스트 계획, 왜 중요한가

테스트 계획을 이렇게 꼼꼼하게 수립하는 데는 분명한 이유가 있다. 테스트 계획은 단순히 문서를 만드는 작업에 그치는 것이 아니라, 이후에 이루어지는 테스트 설계와 수행 전반의 기준점이 되기 때문이다. 계획서에 정의된 범위, 전략, 완료 기준 등을 토대로 테스트가 설계되고 실행되며, 테스트가 계획대로 제대로 진행되고 있는지를 모니터링하는 기반 역할도 바로 테스트 계획이 담당한다.

결국 계획 없이는 테스트가 올바른 방향으로 가고 있는지조차 판단하기 어렵다. 아무리 뛰어난 테스터가 있더라도 명확한 계획이 없다면 그 노력이 제대로 된 결과로 이어지기 힘들다. 이것이 바로 소프트웨어 테스트에서 Plan의 중요성을 거듭 강조하는 이유이며, 테스트 계획은 성공적인 테스트의 시작이자 전체를 관통하는 근간이라고 할 수 있다.

2️⃣ Risk Management Overview

What is Risk?

Earlier, we learned that the ISO/IEC/IEEE 29119 standard emphasizes a risk-based test strategy in the test planning process. Let us now examine what risk management is and how its process unfolds.

First, what is Risk? It is actually a concept we encounter in everyday life ; "there is a risk of rain today," "this investment carries a high risk," "there are safety risks on a construction site." As these examples show, risk is not a special concept; it refers to an uncertain event or situation that may occur in the future.

The same applies to software testing. Unexpected events can occur during test execution, and such uncertainties can affect the quality and outcomes of testing. This is why identifying and managing risks in advance is a core element of test planning.

위험(Risk)이란 무엇인가

앞서 테스트 계획을 수립하는 과정에서 ISO/IEC/IEEE 29119 표준이 위험 기반의 테스트 전략을 강조한다는 것을 배웠다. 그렇다면 본격적으로 위험 관리란 무엇인지, 그리고 그 프로세스는 어떻게 진행되는지 살펴보자.

먼저 위험(Risk) 이란 무엇일까. 사실 위험이라는 단어는 우리가 일상생활에서도 흔하게 접하는 개념이다. 예를 들어 "오늘 비가 올 위험이 있다", "이 투자는 위험 부담이 크다", "공사 현장에는 안전 위험이 존재한다"와 같이, 우리는 이미 다양한 맥락에서 위험이라는 개념을 자연스럽게 사용하고 있다. 이처럼 위험은 특별한 개념이 아니라, 미래에 발생할 수 있는 불확실한 사건이나 상황을 가리키는 말이다.

소프트웨어 테스트에서도 마찬가지다. 테스트를 수행하는 과정에서도 예상치 못한 일들이 발생할 수 있고, 이러한 불확실성이 테스트의 품질과 결과에 영향을 미칠 수 있다. 그렇기 때문에 위험을 미리 파악하고 관리하는 것이 테스트 계획의 핵심이 되는 것이다. 이어서 위험 관리의 구체적인 프로세스를 살펴보도록 하자.

Risk vs. Issue. Potential Problem vs. Actual Problem

To understand risk more clearly, let us compare two contrasting concepts: Potential Problem and Actual Problem.

A Potential Problem is something that has not yet occurred but may happen in the future; this is Risk. An Actual Problem, on the other hand, is a problem that has already occurred; this is an Issue. Schedule delays, budget overruns, major scope changes, quality defects discovered in production, and customer complaints are all examples of Issues that have already materialized.

In practice, many organizations and individuals operate primarily in Issue mode; reacting to problems as they arise, only to be consumed by the next issue in an endless cycle. We see this pattern repeatedly in real-world disasters and accidents: countermeasures are developed only after the incident has occurred, after lives have been lost and economic damage has been done. This is the hallmark of Issue-driven work.

Working more systematically means operating in Risk mode; anticipating potential problems before they occur and preparing countermeasures in advance. As discussed, the act of planning is inherently about preparing for future events that have not yet happened, which means plans must always incorporate Risk.

Ultimately, risk-based planning is not a concept limited to testing. In any project or work environment, the key to systematic management is shifting from reacting to Issues after they erupt to proactively identifying and preparing for Risks.

Risk vs Issue, 잠재적 문제와 실제 문제

위험(Risk)을 좀 더 명확하게 이해하기 위해 상반되는 두 개념을 비교해보자. 바로 Potential Problem(잠재적 문제) 과 Actual Problem(실제 문제) 이다.

Potential Problem은 지금 당장 발생하지는 않았지만, 미래에 발생할 수도 있는 문제를 의미한다. 이것이 바로 Risk다. 반면 Actual Problem은 이미 발생한 문제, 즉 Issue를 가리킨다. 일정 지연, 예산 초과, 프로젝트의 대규모 변경, 품질 문제 발생, 고객 클레임 접수 등이 모두 이미 터진 Issue에 해당한다.

많은 조직과 개인이 실제로는 Issue 중심으로 일을 한다. 문제가 터지고 나서야 부랴부랴 대응하고, 또 다른 문제가 터지면 다시 그것을 처리하느라 바쁜 악순환이 반복되는 것이다. 우리 주변에서 일어나는 각종 재난이나 사고를 돌아봐도 마찬가지다. 사고가 발생하고 나서야 대책을 마련하고, 이미 소중한 생명과 막대한 경제적 손실이 발생한 뒤에야 제도가 바뀌는 모습을 우리는 너무나 자주 목격한다. 이것이 전형적인 Issue 중심의 일 처리 방식이다.

반면 보다 체계적으로 일을 한다는 것은 Risk 중심으로 일을 한다는 것을 의미한다. 문제가 터지기 전에 미리 잠재적인 위험을 예측하고, 그에 대한 대책을 사전에 마련하는 것이다. 앞서 배운 것처럼 계획을 세운다는 행위 자체가 아직 발생하지 않은 미래의 일들을 대비하는 것이기 때문에, 계획 안에는 반드시 Risk가 포함되어 있어야 한다.

결국 위험 기반으로 계획을 세우는 것은 단순히 테스트에만 국한된 이야기가 아니다. 어떤 프로젝트든, 어떤 업무든 체계적으로 관리하고자 한다면 Issue가 터난 후에 반응하는 방식에서 벗어나, Risk를 미리 식별하고 대비하는 방식으로 일하는 것이 핵심임을 반드시 기억하자.

프로젝트에서의 위험(Risk) 정의

프로젝트가 성공적으로 완료되기 위해서는 내외부 참여자, 예산, 시스템, 기술, 고객 등 수많은 구성 요소들이 유기적으로 잘 맞물려 돌아가야 한다. 그리고 일반적으로 프로젝트의 성공 여부는 품질(Quality), 비용(Cost), 납기(Delivery) 세 가지를 모두 만족했는지를 기준으로 판단한다.

여기서 위험(Risk)의 정의가 자연스럽게 도출된다. 이 세 가지 요소 중 적어도 하나에라도 영향을 줄 수 있는 잠재적인(Potential) 이벤트 또는 상태를 바로 위험, 즉 Risk라고 정의한다.

중요한 것은 "줄 수 있는"이라는 표현에 담긴 잠재성이다. 실제로 영향을 준 것이 아니라, 영향을 줄 수도 있는 가능성만으로도 Risk로 간주한다는 점이다. 예를 들어 핵심 개발자의 이탈 가능성, 기술적 난이도로 인한 일정 지연 가능성, 요구사항의 잦은 변경 가능성 등이 모두 Risk에 해당한다. 아직 아무 일도 일어나지 않았지만, 그것이 현실이 되었을 때 프로젝트의 품질, 비용, 납기에 영향을 미칠 수 있다면 그 자체로 Risk인 것이다.

결국 프로젝트에서 위험을 관리한다는 것은, 이처럼 프로젝트의 성공을 위협할 수 있는 잠재적 요소들을 미리 식별하고 대비하는 활동이라고 할 수 있다.

2. Risk Management Process

Definition of Risk in a Project Context

For a project to be completed successfully, numerous components, internal and external stakeholders, budget, systems, technology, and customers - must work together in harmony. Generally, the success of a project is judged by whether it satisfies all three of the following criteria: Quality (Q), Cost (C), and Delivery (D).

위험 관리 프로세스

위험 관리 프로세스란 소프트웨어 프로젝트의 목표인 품질, 비용, 일정을 성공적으로 만족시키기 위해, 프로젝트에 존재하는 위험을 미리 식별하고 분석하여 대비해나가는 일련의 활동을 말한다.

이 프로세스는 크게 세 단계로 진행된다.

Risk Identification

Risk identification is the activity of finding potential problems that could affect the achievement of project goals — quality, cost, and schedule. It involves continuously asking, "What risks could arise when performing this activity?" as the project plan is developed.

This is easier said than done, because without it being a habit, it is easy to skip over. Many people are accustomed to working through a to-do list and lack the practice of proactively looking for potential problems. However, failing to identify risks allows small risks to grow into Issues — what could have been contained at 1 grows into 10 and then 100.

Methods for Identifying Risks

There are three commonly used methods for systematically identifying risks.

The first is using a Risk Database. Organizations that manage risks well maintain a risk database containing records of past risks and how they were resolved. Referencing this database makes it easier to identify risks that could arise in the current project.

The second is using an Issue Database. Even organizations without a risk database typically have records of past issues — in the form of spreadsheets or meeting minutes. These past issues can recur in the next project, making them potential risks. An issue database alone can be sufficient for deriving current project risks.

The third is using a Risk Checklist. By reviewing a checklist item by item with Yes/No responses, potential risks for the project can be identified. Items marked "No" represent risk factors that could materialize.

Risk Examples in a Testing Project

To illustrate how risk identification works in practice, consider the following examples from a testing project.

The first is a sudden change in customer priorities. A situation may arise where, after a test strategy and test cases have already been developed, a customer suddenly requests that a specific feature be tested first. Although it has not happened yet, it is entirely plausible, and a response strategy should be prepared in advance.

The second is the risk associated with an external test outsourcing vendor. When an external vendor is contracted because internal resources are insufficient for testing, the vendor may go bankrupt or fail to fulfill the contract. This can lead to schedule delays and cost overruns, so contingency plans must be prepared.

The third is insufficient tester competency. A test team may include both experienced professionals and junior employees with limited testing experience. Testing performed without adequate competency can directly affect quality and must therefore be identified as a significant risk factor.

Risk Identification Must Become a Habit

Ultimately, the starting point of risk identification is simple: habitually asking "What risks could arise as I carry out this work?" whenever planning. By repeatedly leveraging past issue databases and checklists to uncover as many risks as possible, the capability to execute projects on a risk-based foundation will naturally develop over time.

위험 식별(Risk Identification)

위험 식별이란 프로젝트의 목표인 품질, 비용, 일정의 달성에 영향을 줄 수 있는 잠재적인 문제를 찾아내는 활동이다. 프로젝트 계획을 수립해나가면서 "이 활동을 할 때는 어떤 위험이 있을 수 있을까?"를 끊임없이 자문하는 과정이 바로 위험 식별이다.

이것이 말처럼 쉽지 않은 이유는 습관이 되어있지 않으면 자연스럽게 넘어가기 쉽기 때문이다. 많은 사람들이 해야 할 일의 목록만 생각하며 일을 처리하는 데 익숙하다 보니, 잠재적인 문제를 미리 찾는 연습이 부족한 경우가 많다. 하지만 위험 식별을 놓치면 작은 위험이 Issue로 번져 1로 막을 수 있었던 것이 10이 되고 100이 되는 상황을 맞이하게 된다.

위험을 찾아내는 방법

위험을 체계적으로 식별하기 위해 일반적으로 활용하는 방법은 크게 세 가지다.

첫 번째는 위험 DB 활용이다. 위험 관리를 잘 하는 조직이라면 과거에 발생했던 위험과 그 해결 방법이 기록된 위험 DB를 보유하고 있다. 이 DB를 참고하면 현재 프로젝트에서 발생할 수 있는 위험을 보다 수월하게 찾아낼 수 있다.

두 번째는 이슈 DB 활용이다. 위험 DB가 없는 조직이라도 과거에 발생한 문제들을 기록한 이슈 DB는 갖고 있는 경우가 많다. 엑셀 파일이나 회의록 형태로 남아있는 과거의 이슈들은 다음 프로젝트에서도 똑같이 발생할 수 있는 잠재적 위험이 된다. 즉, 이슈 DB만으로도 현재 프로젝트의 위험을 충분히 도출해낼 수 있다.

세 번째는 위험 체크리스트 활용이다. 체크리스트를 항목별로 Yes/No로 점검하며 이 프로젝트에서 발생 가능한 위험을 확인하는 방식이다. No로 표시된 항목들은 곧 발생할 수 있는 위험 요소가 된다.

실제 테스트 프로젝트에서의 위험 예시

위험 식별이 실제로 어떻게 이루어지는지 테스트 프로젝트를 예시로 살펴보자. 먼저 고객 우선순위의 갑작스러운 변경이다. 테스트 전략과 테스트 케이스를 이미 만들어 놓은 상황에서 고객이 갑자기 특정 기능을 먼저 테스트해달라고 요청하는 경우가 생길 수 있다. 아직 발생하지는 않았지만 충분히 일어날 수 있는 위험이며, 이에 대한 대응 전략을 미리 마련해두어야 한다.

다음으로 외부 테스트 아웃소싱 업체의 리스크다. 내부 인력만으로 테스트를 수행하기 어려워 외부 업체와 계약했을 때, 그 업체가 파산하거나 계약을 이행하지 못하는 상황이 발생할 수 있다. 이 경우 일정 지연과 비용 낭비로 이어질 수 있기 때문에 사전에 대처 방안을 준비해야 한다.

마지막으로 테스터의 역량 부족이다. 테스트 팀 안에는 전문가도 있지만 경험이 부족한 신입 직원도 있을 수 있다. 역량이 충분히 갖추어지지 않은 상태에서 테스트를 수행하면 품질에 직접적인 영향을 미칠 수 있으므로, 이 역시 중요한 위험 요소로 식별해야 한다.

위험 식별, 습관이 되어야 한다

결국 위험 식별의 출발점은 단순하다. 계획을 세울 때 "이 일을 수행하다 보면 어떤 위험이 생길 수 있을까?"라는 질문을 습관적으로 던지는 것이다. 과거의 이슈 DB나 체크리스트를 적극 활용하여 최대한 많은 위험을 찾아내려는 노력을 반복하다 보면, 자연스럽게 위험에 기반하여 프로젝트를 수행하는 역량이 갖추어지게 된다.

Risk Analysis

Once risk identification has produced a large number of risk factors, the next step is risk analysis. It is not feasible to prepare countermeasures for all 20–30 identified risks, given the constraints of cost, schedule, and resources that every project faces. The core of risk analysis is therefore evaluating the magnitude of each identified risk to determine which risks to prioritize for management.

Two Evaluation Criteria for Risk Analysis

Risk magnitude is measured using two factors: probability of occurrence and impact.

Probability of occurrence refers to the likelihood that the risk will actually materialize. It is typically scored as 1 (Low) for below 30%, 2 (Medium) for 30%–80%, and 3 (High) for above 80%. Impact refers to how significantly the risk would affect Quality, Cost, and Delivery if it occurred, and is similarly scored as 1 (Low) for below 10%, 2 (Medium) for 10%–20%, and 3 (High) for above 20%.

Deriving Risk Levels via a Risk Matrix

By quantifying both criteria, they can be represented in a Risk Matrix. Risk levels are derived by multiplying or combining the probability and impact scores. For instance, a probability of 1 (Low) and impact of 1 (Low) results in a low-level risk, while a probability of 2 (Medium) and impact of 2 (Medium) results in a level 4 (Medium) risk. The example from the risk identification phase — a sudden change in customer priorities — would be evaluated as probability 2 (Medium) and impact 3 (High), resulting in a level 6 (High) risk.

Why Risk Analysis is Essential

Once the risk level hierarchy is established, it becomes clear which of the 20 identified risks require priority management. High-level risks are managed first, followed by medium-level risks. While it would be ideal to address every risk if time, cost, and resources were unlimited, projects always operate within constraints. Risk analysis is therefore an essential activity in systematic project management, ensuring that limited resources are focused on the most critical risks.

위험 분석(Risk Analysis)

위험 식별을 통해 수많은 위험 요소들이 도출되고 나면, 그 다음 단계는 위험 분석이다. 20~30개에 달하는 위험 요소 모두에 대해 대책을 마련하는 것은 현실적으로 불가능하다. 프로젝트에는 항상 비용, 일정, 자원과 같은 제약이 존재하기 때문이다. 따라서 식별된 위험들의 크기를 평가하여 어떤 위험을 중점적으로 관리할 것인지 우선순위를 정하는 것이 위험 분석의 핵심이다.

위험 분석의 두 가지 평가 기준

위험의 크기를 측정하기 위해서는 두 가지 요소를 평가한다. 바로 발생 확률과 영향도다.

발생 확률은 해당 위험이 실제로 일어날 가능성을 의미하며, 일반적으로 30% 이하는 1(하), 30%80%는 2(중), 80% 이상은 3(상)과 같이 점수로 구분한다. 영향도는 위험이 발생했을 때 품질(Quality), 비용(Cost), 납기(Delivery)에 얼마나 큰 영향을 미치는지를 나타내며, 10% 이하는 1(하), 10%20%는 2(중), 20% 이상은 3(상)으로 동일하게 점수화한다.

위험 매트릭스를 통한 등급 산출

이렇게 두 가지 기준을 점수화하면 이를 매트릭스 형태로 표현할 수 있다. 발생 확률과 영향도를 곱하거나 조합하여 위험의 등급을 산출하는 방식이다. 예를 들어 발생 확률이 1(하)이고 영향도가 1(하)이면 전체적으로 낮은 수준의 위험이 되고, 발생 확률이 2(중)이고 영향도가 2(중)이면 4(중) 수준의 위험이 된다. 앞서 위험 식별 단계에서 예시로 들었던 고객 우선순위의 갑작스러운 변경의 경우, 발생 확률 2(중)에 영향도 3(상)으로 평가되어 6(상) 수준의 고위험에 해당한다.

위험 분석이 필수적인 이유

이와 같이 위험 등급 체계를 정하고 나면 20개의 위험 요소 중 어떤 것을 중점적으로 관리해야 할지가 명확해진다. 고수준으로 분류된 위험들을 최우선으로 관리하고, 그 다음 순위로 중간 수준의 위험들을 관리하는 방식이다. 시간과 비용, 자원이 충분하다면 모든 위험에 대응할 수 있겠지만, 현실적으로 프로젝트는 항상 한정된 자원 안에서 운영된다. 그렇기 때문에 위험 분석을 통해 우선순위를 정하고 한정된 자원을 가장 중요한 위험에 집중적으로 투입하는 이 과정은 체계적인 프로젝트 관리에 있어 필수적인 활동이라고 할 수 있다.

Risk Mitigation Identification

Having identified and prioritized risks, the final step is developing countermeasures. Risk mitigation means preparing responses that either reduce the probability of a risk occurring or minimize its impact if it does occur, bringing it down to an acceptable level. There are four categories of risk mitigation strategies.

1) Risk Avoidance

This approach completely eliminates the possibility of a risk occurring. For example, performing integration testing using the Big Bang approach; integrating dozens or hundreds of modules all at once; makes it extremely difficult to trace which module caused a defect. To eliminate this risk from the outset, adopting an incremental integration test strategy instead of Big Bang is a classic example of risk avoidance. The key is creating an environment where the risk cannot occur in the first place.

2) Risk Mitigation

Rather than eliminating a risk entirely, this approach reduces it to an acceptable level. For example, when there is a risk that customer requirement priorities may suddenly change, the goal is to reduce the probability of that change from 50% to 10–20%. This can be achieved by engaging the customer actively from the early stages of development to lock down requirements as much as possible, thereby reducing the probability of change at the source.

3) Risk Transference

This approach does not solve the risk directly but instead transfers the responsibility or impact to another party. It is most commonly used when the probability is low but the potential damage is high. For example, to guard against the possibility of a test outsourcing vendor failing to fulfill the contract, requiring the vendor to purchase performance bond insurance, even at additional cost; transfers the financial risk to a third party. In practice, many organizations mandate performance bond insurance for inter-agency contracts. This is not an evasion of responsibility but a rational strategy for reducing the impact of risks that are difficult to manage internally.

4) Risk Acceptance

This approach accepts risks that fall within a tolerable level. For example, if testers lack sufficient competency but must be included in order to complete testing successfully, the risk can be accepted by investing in test training to build their skills over time. If it is judged that they will be able to perform adequately once trained, then the risk is accepted and managed accordingly.

These four mitigation strategies must be selected appropriately based on the magnitude and nature of each risk identified and analyzed. Most importantly, the key is to understand what risk is, practice the three-phase risk management process of Identification → Analysis → Mitigation systematically, and use it to prevent small risks from becoming large Issues. The risk management concepts learned here will serve as a critical foundation when developing the risk-based test strategy covered in future sessions.

위험 완화 방안 식별(Risk Mitigation Identification)

위험을 식별하고 우선순위를 분석했다면, 이제 그 위험들에 대한 대책을 마련하는 단계가 남아있다. 위험 완화란 위험이 발생할 확률을 낮추거나, 발생하더라도 그 영향도를 최소화하여 허용 가능한 수준으로 낮추기 위한 대책을 마련하는 것이다. 위험 완화 방안은 크게 네 가지로 구분된다.

1) 위험 회피 (Avoidance)

위험이 발생할 가능성을 애초에 완전히 제거하는 방법이다. 예를 들어 통합 테스트를 빅뱅(Big Bang) 방식으로 진행하면, 수십~수백 개의 모듈을 한꺼번에 통합하기 때문에 문제가 발생했을 때 어느 모듈에서 비롯된 것인지 추적하기가 매우 어렵다. 이러한 위험을 처음부터 없애기 위해 빅뱅 방식 대신 점진적 통합 테스트 전략을 채택하는 것이 위험 회피의 대표적인 예다. 처음부터 위험이 발생할 수 없는 환경을 만드는 것이 핵심이다.

2) 위험 완화 (Mitigation)

위험을 완전히 없애는 것이 아니라, 허용 가능한 수준 이하로 낮추는 방법이다. 예를 들어 고객의 요구사항 우선순위가 갑자기 변경될 위험이 있을 때, 그 변경 가능성을 50%에서 10~20%로 줄이는 것을 목표로 한다. 이를 위해 개발 후반부가 아닌 초기 단계부터 고객을 적극적으로 참여시켜 요구사항을 최대한 확정짓는 방식으로 위험의 발생 확률 자체를 낮출 수 있다.

3) 위험 전이 (Transference)

위험을 내가 직접 해결하는 것이 아니라, 외부의 힘을 빌려 위험의 책임이나 영향을 다른 주체에게 이전하는 방법이다. 발생 확률은 낮지만 피해가 클 경우에 주로 활용된다. 예를 들어 테스트 아웃소싱 업체가 계약을 이행하지 못하는 상황에 대비하여, 비용이 다소 들더라도 업체가 이행보증보험에 가입하도록 하여 피해를 최소화하는 것이 대표적인 사례다. 실제 현업에서도 기관 간 계약 시 이행보증보험 가입을 의무화하는 경우가 많다. 이는 책임 회피가 아니라, 감당하기 어려운 위험의 영향도를 줄이기 위한 합리적인 선택이다.

4) 위험 수용 (Acceptance)

허용 가능한 수준의 위험을 그대로 받아들이는 방법이다. 예를 들어 테스터의 역량이 부족하더라도, 해당 인력을 포함하여 테스트를 성공적으로 수행해야 하는 상황이라면 테스트 교육을 통해 역량을 키우는 것을 선택할 수 있다. 당장은 부족하더라도 시간이 지나면 충분히 수행 가능하다고 판단한다면, 그 위험을 감내하고 수용하는 것이다.

이 네 가지 위험 완화 방안은 앞서 식별하고 분석한 위험의 크기와 성격에 따라 적절하게 선택되어야 한다. 무엇보다 중요한 것은 위험이 무엇인지 이해하고, 식별 → 분석 → 완화의 세 단계로 이루어진 위험 관리 프로세스를 체계적으로 실천하는 것이다. 이를 통해 나중에 큰 Issue로 번질 수 있는 문제들을 사전에 차단하고, 프로젝트를 안정적으로 이끌어 나갈 수 있다. 앞으로 학습할 위험 기반의 테스트 전략 수립에서도 오늘 배운 위험 관리의 개념과 프로세스가 핵심적인 토대로 활용될 것이다.

3️⃣ Risk-based testing strategy

What is Risk-Based Testing?

Risk-Based Testing is a strategy that identifies potential problems that could arise in a project and applies the risk management process; Risk Identification → Analysis → Mitigation; integrated into the testing effort.

Just as resources are constrained in a project, testing resources - cost, tools, equipment, and personnel — are always limited. In this environment, the essence of the risk-based test strategy is to invest available resources most efficiently based on risk, in order to achieve the testing objective of discovering as many defects as possible.

위험 기반 테스트(Risk-Based Test)란?

위험 기반 테스트란 프로젝트에서 발생할 수 있는 잠재적인 문제들을 찾아내고, 위험 관리 프로세스인 위험 식별 → 분석 → 완화의 과정을 테스트에 융합하여 적용하는 전략이다.

프로젝트에서 자원이 제한되어 있듯이, 테스트에서도 비용, 도구, 장비, 인원은 항상 제한되어 있다. 이러한 환경에서 가지고 있는 자원을 가장 효율적으로 활용하기 위해, 다양한 결함을 발견한다는 테스트의 목표를 달성하기 위해 위험에 기반하여 자원을 집중적으로 투입하는 것이 바로 위험 기반 테스트 전략의 본질이다.

Goals Achievable Through Risk-Based Testing

There are three key benefits expected from risk-based testing.

The first is improved software product quality. By identifying and analyzing risks and establishing priorities, the factors most likely to contribute to product defects are naturally surfaced. Focusing testing efforts on these factors naturally elevates product quality.

The second is improved overall test coverage. Concentrating on higher-priority areas means testing the features that customers and stakeholders care about most first. This ensures meaningful test coverage where it matters most.

The third is improved test efficiency. By directing limited resources toward high-risk areas, greater test effectiveness is achieved with the same resources. Risk-based testing is ultimately the most rational test strategy for achieving maximum quality impact with limited resources.

위험 기반 테스트를 통해 얻을 수 있는 목표

위험 기반 테스트를 통해 기대할 수 있는 효과는 크게 세 가지다.

첫 번째는 소프트웨어 제품 품질 향상이다. 위험을 식별하고 분석하여 우선순위를 정하다 보면 자연스럽게 제품의 결함에 가장 큰 영향을 미칠 수 있는 요소를 찾게 된다. 이 요소들을 중점적으로 테스트함으로써 제품의 품질이 자연스럽게 높아진다.

두 번째는 전체 테스트 커버리지 개선이다. 우선순위가 낮은 부분보다 높은 부분에 집중한다는 것은, 고객과 이해관계자들이 가장 중요하게 생각하는 영역을 먼저 충분히 테스트한다는 의미다. 이를 통해 실질적으로 의미 있는 테스트 커버리지를 확보할 수 있다.

세 번째는 테스트 효율성 향상이다. 한정된 자원을 위험 수준이 높은 영역에 집중 투입함으로써, 같은 자원으로 더 큰 테스트 효과를 얻을 수 있다. 결국 위험 기반 테스트는 "적은 자원으로 최대의 품질 효과를 내기 위한" 가장 합리적인 테스트 전략이라고 할 수 있다.

Risk Analysis Criteria from a Testing Perspective

Risk analysis involves evaluating the probability of occurrence and impact of identified risks to establish priorities. Let me examine how these two criteria are applied specifically from a testing perspective.

Probability of Occurrence; Likelihood of Defects

In a testing context, probability of occurrence refers to the likelihood that defects will arise during testing. The following factors contribute to this likelihood.

First is source code complexity. Code with nested loops inside conditionals, or so-called spaghetti code, is structurally complex and more prone to defects. Inter-module coupling complexity is also significant — the more tightly interdependent the modules, the more likely unexpected defects are to emerge. Implementation technology difficulty is another contributing factor. Lines of Code (LOC) also matter — the more lines of code, the greater the likelihood of defects. Finally, developer competency cannot be overlooked; the difference in skill between junior, mid-level, and senior developers directly affects defect probability.

Impact ; Effect on Business When a Defect Occurs

Impact refers to how significantly a functional failure would affect the business. The factors that compose impact include the following.

First is user criticality — how important the function is to the user determines the degree of impact. Economic and safety damage is also a key criterion; if a failure leads to financial loss or safety incidents, the impact is correspondingly high. Damage to the organization's reputation must also be considered — incidents involving hacking, payment failures, or authentication issues can severely undermine organizational credibility. Finally, frequency of use contributes to impact; the more frequently a function is used, the more people are affected when a defect occurs.

Risk Exposure

By comprehensively evaluating the factors composing probability and impact, the magnitude of each risk is classified as Low, Medium, or High. This is referred to as Risk Exposure, and it determines the priority of which areas to focus testing on. By concentrating test resources on areas with high Risk Exposure, the most effective testing possible is achieved within the constraints of available resources.

테스트 관점에서의 위험 분석 기준

위험 분석은 식별된 위험들에 대해 발생 확률과 영향도를 평가하여 우선순위를 정하는 과정이다. 테스트 측면에서 이 두 가지 기준을 어떻게 적용하는지 구체적으로 살펴보자.

발생 확률; 결함이 생길 가능성

테스트에서의 발생 확률이란 테스트를 수행할 때 결함이 발생할 가능성을 의미한다. 즉, 어떤 프로그램이나 모듈에서 결함이 많이 생길 것인가를 따져보는 것이다. 이에 영향을 미치는 요소들은 다음과 같다.

먼저 소스코드의 복잡성(Complexity) 이다. 분기문 안에 반복문이 중첩되어 있거나 이른바 스파게티 코드처럼 구조가 복잡할수록 결함이 발생할 가능성이 높아진다. 또한 모듈 간 상호 관계의 복잡성도 중요한 요소인데, 여러 모듈이 얽히고 설킨 구조일수록 예상치 못한 결함이 생기기 쉽다. 구현 기술의 난이도가 높은 경우도 마찬가지다. 그리고 코드의 규모(Lines of Code) 도 중요한데, 코드 라인이 많아질수록 그만큼 결함이 발생할 가능성도 함께 커진다. 마지막으로 개발자의 역량도 빼놓을 수 없다. 초급, 중급, 고급 개발자의 실력 차이는 결함 발생 확률에 직접적인 영향을 미친다.

영향도; 결함 발생 시 비즈니스에 미치는 영향

영향도란 기능 장애가 발생했을 때 비즈니스 전반에 얼마나 큰 영향을 미치는가를 평가하는 기준이다. 영향도를 구성하는 요소들도 다양하다.

먼저 사용자의 취급 중요도다. 사용자가 해당 기능을 얼마나 중요하게 여기는지에 따라 영향도가 달라진다. 경제적·안전적 피해 또한 중요한 기준이다. 기능 고장이 발생했을 때 금전적 손실이나 안전 문제로 이어진다면 그만큼 영향도가 높다. 조직의 대외 이미지 피해도 빼놓을 수 없는데, 해킹이나 결제·인증 문제가 발생하면 조직의 신뢰도에 큰 타격을 줄 수 있다. 마지막으로 기능 사용 빈도도 영향도에 기여한다. 사용자가 자주 쓰는 기능일수록 결함이 발생했을 때 더 많은 사람들에게 영향을 미치기 때문이다.

위험 노출도(Risk Exposure)

이처럼 발생 확률과 영향도를 구성하는 요소들을 종합적으로 평가하면, 각 위험 요소의 크기가 저, 중, 고 수준으로 산출된다. 이를 위험 노출도(Risk Exposure) 라고 하며, 이 수치를 기반으로 어떤 영역을 중점적으로 테스트할 것인지 우선순위가 결정된다. 결국 테스트 자원을 위험 노출도가 높은 영역에 집중 투입함으로써, 한정된 자원 안에서 가장 효과적인 테스트를 수행할 수 있게 되는 것이다.

Example: Risk Analysis; Vaccine Appointment System

Let me examine how risk analysis works in practice using a vaccine appointment booking system.

The main requirements of this system include functions such as member registration, member withdrawal, login, logout, and vaccine appointment booking. For each function, the probability of defects occurring and the impact on the business if they do are evaluated on a scale of 1–5, and the two values are multiplied to derive the risk magnitude. It is important to note that this evaluation must be performed by domain experts familiar with the system.

For example, the member registration function is evaluated with a probability of 3 and impact of 4, resulting in a risk magnitude of 12. The member withdrawal function scores a probability of 2 and impact of 1, yielding a risk magnitude of only 2. The login function, on the other hand, scores a probability of 4 and impact of 5, resulting in the highest risk magnitude of 20. This is because if login is unavailable, all core system functions — appointment booking, inquiry, and cancellation — become completely inaccessible, warranting the highest impact rating.

By deriving risk magnitudes for all requirements in this way, it becomes clearly apparent which functions are high-risk and which are low-risk. This enables the development of a differentiated test strategy — concentrating more test resources on high-risk functions and allocating relatively fewer resources to low-risk ones. Risk analysis is ultimately a rational decision-making tool for distributing limited test resources most effectively.

[예시] 위험 분석 - 백신 접종 예약 시스템

위험 분석이 실제로 어떻게 이루어지는지 백신 접종 예약 시스템을 예시로 살펴보자.

이 시스템의 주요 요구사항으로는 회원 가입, 회원 탈퇴, 로그인, 로그아웃, 접종 예약 등의 기능이 있다. 각 기능에 대해 결함이 발생할 확률과 발생 시 비즈니스에 미치는 영향도를 1~5점 척도로 평가하고, 두 값을 곱하여 위험의 크기를 산출한다. 이 평가는 반드시 해당 시스템과 관련된 전문가가 수행해야 한다는 점이 중요하다.

예를 들어 회원 가입 기능은 발생 확률 3, 영향도 4로 평가되어 위험의 크기가 12가 된다. 회원 탈퇴 기능은 발생 확률 2, 영향도 1로 위험의 크기가 2에 그친다. 반면 로그인 기능은 발생 확률 4, 영향도 5로 위험의 크기가 20으로 가장 높게 산출된다. 로그인이 불가능하면 예약, 조회, 취소 등 시스템의 모든 핵심 기능을 아예 사용할 수 없게 되기 때문에 영향도가 최고 수준으로 평가된 것이다.

이처럼 모든 요구사항 항목에 대해 동일한 방식으로 위험의 크기를 산출하면, 어떤 기능이 고위험이고 어떤 기능이 저위험인지가 명확하게 드러난다. 이를 토대로 위험의 크기가 큰 기능에는 더 많은 테스트 자원을 집중하고, 위험의 크기가 작은 기능에는 상대적으로 적은 자원을 투입하는 차별화된 테스트 전략을 수립할 수 있다. 결국 위험 분석은 한정된 테스트 자원을 가장 효과적으로 배분하기 위한 합리적인 의사결정 도구인 셈이다.

Example: Unit Test Planning Based on Risk Analysis

Let me examine how risk analysis results are applied to actual test planning, again using the vaccine appointment booking system.

Risk Level Classification

Requirements are classified into four risk levels based on the scores derived from risk analysis. Level 1 represents the highest risk area with both high probability and high impact. Level 2 has low probability but high impact. Level 3 has high probability but low impact. Level 4 has both low probability and low impact.

Applying this to the vaccine appointment system: login, vaccine appointment, member registration, and appointment inquiry are classified as Level 1 (High Risk) with both high probability and high impact. Appointment cancellation falls under Level 2, while member withdrawal and logout are classified as Level 3 due to their relatively low risk magnitude.

Exit Criteria by Risk Level

These risk classifications are directly used to differentiate the exit criteria and test methods when developing the test plan.

For Level 1 high-risk items, the strictest criteria are applied. To minimize the likelihood of defects and reduce their impact if they do occur, the most rigorous coverage criterion — MC/DC (Modified Condition/Decision Coverage) at 100% completion — is set as the exit criterion. The goal is to test as wide a range of code as possible, leaving no room for hidden defects.

For Level 2 items, while not as strict as MC/DC, branch and condition coverage at 100% completion is set as the exit criterion. Sufficient testing is performed given the high impact, but with somewhat more flexibility than Level 1.

For Level 3 low-risk items, since the impact is relatively limited, statement coverage at 100% completion is set as the exit criterion, allowing for efficient allocation of test resources.

Significance of Risk-Based Test Planning

By incorporating risk analysis results into the test plan, strict criteria are applied to critical functions while appropriate criteria are applied to less critical ones, enabling the most effective use of limited resources. For instance, if the login function has a defect, all core system functions are paralyzed, customers stop using the service, and the company faces significant losses. In contrast, defects in Level 3 functions such as member withdrawal or logout have relatively limited impact. Ultimately, a risk-based test plan is a rational approach that addresses the greatest risks first and most thoroughly, effectively enhancing software product quality.

Risk Factor-Based Risk Analysis Example

Going one step further from the risk analysis method discussed earlier, let me examine an approach that breaks down risk factors by requirement for more granular analysis.

The basic structure remains the same; probability and impact are evaluated per requirement ; but here, risk is broken down into the following sub-categories for more systematic analysis:

LOP (Loss of Power)
CFD (Corrupted File Data)
UUA (Unauthorised User Access)
DNS (Database Not Synchronized)
UUD (Unclear User Documentation)
ST (Slow Throughput)

Each requirement is assessed against these risk factors, scores are assigned, and all scores are summed to derive the final risk magnitude for that requirement. For example, comparing Requirement 1 and Requirement 2, Requirement 2 scores 67 points, indicating a higher overall risk. This leads to the conclusion that more test resources should be concentrated on functions related to Requirement 2.

This approach goes beyond simply evaluating probability and impact, by itemizing specific risk factors for systematic review, it enables a more rigorous and precise risk analysis.

This concludes our examination of what risk is, how the risk management process is structured, and why this analysis is indispensable for developing a test strategy. Ultimately, a risk-based test strategy is an approach that incorporates all of these processes into the test plan, focusing on the most critical risks within limited resources to effectively improve software quality. The risk management concepts learned here will continue to serve as a critical foundation in the upcoming phases of test design and execution.

[예시]위험 분석 기반의 단위 테스트 계획 수립 예시

위험 분석 결과를 실제 테스트 계획에 어떻게 반영하는지 백신 접종 예약 시스템을 통해 살펴보자.

위험 등급 분류

위험 분석을 통해 산출된 점수를 기반으로 요구사항들을 네 가지 등급으로 분류한다. 1등급은 발생 확률과 영향도가 모두 높은 최고위험 영역이고, 2등급은 발생 확률은 낮지만 영향도가 높은 영역이다. 3등급은 발생 확률은 높지만 영향도가 낮은 영역이며, 4등급은 발생 확률과 영향도가 모두 낮은 저위험 영역이다.

백신 접종 예약 시스템에 적용하면, 로그인, 접종 예약, 회원 가입, 예약 조회는 발생 확률과 영향도가 모두 높은 1등급 고위험 항목에 해당한다. 예약 취소는 2등급에 해당하며, 회원 탈퇴와 로그아웃은 위험의 크기가 상대적으로 작아 3등급으로 분류된다.

등급별 테스트 완료 기준

이렇게 분류된 위험 등급은 테스트 계획을 수립할 때 완료 기준과 테스트 방법을 차별화하는 데 직접적으로 활용된다.

1등급 고위험 항목에 대해서는 가장 엄격한 기준을 적용한다. 결함의 발생 가능성을 최대한 줄이고 결함이 발생하더라도 그 영향도를 낮추어야 하기 때문에, 가장 높은 수준의 커버리지 기준인 MCDC 커버리지 100% 완료를 완료 기준으로 삼는다. 최대한 넓은 범위의 코드를 빠짐없이 테스트하여 결함이 숨어있을 가능성을 철저히 차단하는 것이다.

2등급 항목에 대해서는 MCDC만큼 엄격하지는 않더라도, 분기 및 조건 커버리지 100% 완료를 완료 기준으로 설정한다. 영향도가 높은 만큼 충분한 수준의 테스트를 수행하되, 1등급보다는 다소 유연한 기준을 적용하는 것이다.

3등급 저위험 항목은 상대적으로 영향도가 작기 때문에 문장 커버리지 100% 완료 정도를 완료 기준으로 삼아 테스트 자원을 효율적으로 배분한다.

위험 기반 테스트 계획의 의의

이처럼 위험 분석 결과를 테스트 계획에 반영하면, 중요한 기능에는 타이트한 기준을 적용하고 그렇지 않은 기능에는 적절한 수준의 기준을 적용하여 한정된 자원을 가장 효과적으로 활용할 수 있다. 예를 들어 로그인 기능에 결함이 발생한다면 예약, 조회, 취소 등 시스템의 모든 핵심 기능이 마비되어 고객은 서비스 이용을 멈추게 되고, 회사 입장에서는 막대한 손실로 이어질 수 있다. 반면 회원 탈퇴나 로그아웃과 같은 3등급 기능에 결함이 발생하더라도 그 영향은 상대적으로 제한적이다. 결국 위험 기반 테스트 계획은 가장 큰 위험을 가장 먼저, 가장 철저하게 다루어 소프트웨어 제품의 품질을 효과적으로 높이는 합리적인 접근 방식이라고 할 수 있다.

위험 요인 기반의 위험 분석 예시

앞서 살펴본 위험 분석 방법에서 한 단계 더 나아가, 요구사항별 위험 요인(Risk Factor)을 세분화하여 분석하는 방법을 살펴보자.

기본적인 구조는 동일하다. 요구사항 항목별로 발생 확률과 영향도를 평가하되, 여기서는 위험을 보다 체계적으로 분석하기 위해 위험 요인을 아래와 같이 소분류로 세분화한다.

LOP (Loss of Power) — 전력 손실
CFD (Corrupted File Data) — 파일 데이터 손상
UUA (Unauthorised User Access) — 비인가 사용자 접근
DNS (Database Not Synchronized) — 데이터베이스 미동기화
UUD (Unclear User Documentation) — 불명확한 사용자 문서
ST (Slow Throughput) — 느린 처리 속도

각 요구사항에 대해 이 위험 요인들을 하나씩 점검하고 점수를 부여한 뒤, 모든 점수를 합산하여 해당 요구사항의 최종 위험 크기를 산출한다. 예를 들어 요구사항 1번과 2번을 비교했을 때, 요구사항 2번이 67점으로 더 높은 위험 크기를 가지고 있어 더 큰 위험을 내포하고 있음을 알 수 있다. 이를 통해 요구사항 2번과 관련된 기능에 더 많은 테스트 자원을 집중해야 한다는 판단을 내릴 수 있다.

이 방식은 단순히 발생 확률과 영향도만을 평가하는 것에서 벗어나, 구체적인 위험 요인을 항목화하여 빠짐없이 점검할 수 있다는 점에서 보다 체계적이고 정밀한 위험 분석이 가능하다.

여기까지 위험이 무엇인지, 위험 관리 프로세스가 어떻게 구성되는지, 그리고 왜 이 분석이 테스트 전략 수립에 반드시 필요한지를 살펴보았다. 결국 위험 기반 테스트 전략이란 이 모든 과정을 테스트 계획에 녹여내어, 한정된 자원 안에서 가장 중요한 위험에 집중함으로써 소프트웨어의 품질을 효과적으로 높이는 접근 방식이다. 앞으로 배울 테스트 설계와 수행 단계에서도 오늘 학습한 위험 관리의 개념이 핵심적인 기반으로 계속 활용될 것이다.

Processes in Operating Systems

Heesu Noh — Fri, 27 Mar 2026 06:57:17 GMT

1️⃣ Concept and Creation of Processors
2️⃣ Process State and Management
3️⃣ Process Execution and Control

1️⃣ Concept and Creation of Processors

How Does an Operating System Manage Running Tasks?

When using a computer, it's perfectly natural to have a web browser, messenger, and music player all open at the same time. From the user's perspective, it looks like several programs are simply running all at once; but the operating system is examining this situation in far greater detail.

So how exactly does the operating system distinguish and manage these executions? Rather than jumping straight to the answer, let's build up the concepts one by one so the answer emerges naturally. Getting a firm grasp of this big picture first will make topics like scheduling and synchronization - which come later, much easier to understand.

운영체제는 실행 중인 작업을 어떤 단위로 관리할까?

컴퓨터를 사용하다 보면 웹브라우저, 메신저, 음악 재생 프로그램을 동시에 켜두는 일이 자연스럽다. 사용자 입장에서는 여러 프로그램이 그냥 한꺼번에 돌아가는 것처럼 보이지만, 운영체제 입장에서는 이 상황을 훨씬 세밀하게 들여다보고 있다.

그렇다면 운영체제는 이 실행들을 도대체 어떤 단위로 구분하고 관리하는 걸까? 이 질문에 바로 답을 내놓기보다는, 답이 자연스럽게 나올 수 있도록 개념을 하나씩 쌓아가 보자. 이 큰 흐름을 먼저 잡아두면, 나중에 배우게 될 스케줄링이나 동기화 같은 주제들도 훨씬 수월하게 이해할 수 있을 것이다.

1. The Basic Concept of a Process

What's the Difference Between a Program and a Process?

In everyday conversation, "program" and "process" are often used interchangeably. In operating systems, however, the two concepts are clearly distinct.

A program is a file stored on a storage device - a static state that has not yet been executed. Simply put, it's a chunk of code that's ready to run but isn't doing anything yet.

A process, on the other hand, is the state in which that program has been loaded into memory and is actually running. From the moment something becomes a process, the operating system allocates system resources such as CPU and memory to that execution and begins treating it as a single, independent unit to manage.

To summarize: in terms of execution status, a program is in a non-running state, while a process is in a state where it has been allocated CPU time and is actively running. There's also a difference in resource usage. A program uses no resources at all, but a process actively consumes system resources like CPU and memory as it executes.

1. 프로세스 기본 개념

프로그램과 프로세스, 무엇이 다를까?

일상적인 대화에서는 '프로그램'과 '프로세스'를 크게 구분하지 않고 쓰는 경우가 많다. 하지만 운영체제에서는 이 두 개념이 명확하게 갈린다.

프로그램은 저장장치에 저장된 파일, 즉 아직 실행되지 않은 정적인 상태를 말한다. 쉽게 말해 실행될 준비는 됐지만 아직 아무것도 하고 있지 않은 코드 덩어리다.

반면 프로세스는 그 프로그램이 메모리에 올라가 실제로 실행되고 있는 상태를 말한다. 프로세스가 되는 순간부터 운영체제는 CPU와 메모리 같은 시스템 자원을 해당 실행에 할당하고, 이를 하나의 독립적인 관리 대상으로 다루기 시작한다.

정리하면 이렇다. 실행 여부를 기준으로 보면, 프로그램은 실행되지 않은 상태이고 프로세스는 CPU를 할당받아 실제로 실행 중인 상태다. 자원 사용 면에서도 차이가 있다. 프로그램은 자원을 전혀 사용하지 않지만, 프로세스는 실행되면서 CPU와 메모리 같은 시스템 자원을 실제로 소비한다.

One Program, Multiple Processes

There's an important point worth highlighting here. Even if it's the same program, running it multiple times creates a different process for each execution.

For example, suppose you launch the same web browser twice. There is only one program file stored on disk, but the operating system recognizes each execution as a separate process and manages them independently. The file is one, but the running instances are two.

This matters because, going forward, you need to remember that the basic unit the operating system manages is not the "program" but the "process." Scheduling, memory allocation, and synchronization all revolve around processes.

하나의 프로그램, 여러 개의 프로세스

여기서 한 가지 중요한 포인트가 있다. 하나의 프로그램이라도 여러 번 실행되면, 그 실행마다 서로 다른 프로세스가 만들어진다는 것이다.

예를 들어 같은 웹브라우저를 두 번 실행했다고 하자. 디스크에 저장된 프로그램 파일은 하나지만, 운영체제는 각각의 실행을 별개의 프로세스로 인식하고 따로따로 관리한다. 파일은 하나여도 실행 중인 인스턴스는 둘인 셈이다.

이 개념이 중요한 이유는, 앞으로 운영체제가 관리하는 기본 단위가 '프로그램'이 아니라 바로 '프로세스'라는 점을 기억해야 하기 때문이다. 스케줄링도, 메모리 할당도, 동기화도 모두 프로세스를 중심으로 돌아간다.

Seeing Processes in Action Through Task Manager

There's a way to make the concept of the operating system managing execution in process units feel more concrete. That's Windows Task Manager.

When you open Task Manager, you see a list of all the processes currently running on your computer. Every single item in that list is an execution unit - " a process " managed by the operating system. Some were launched directly by you; others are tasks the operating system runs internally, invisible to the user.

Here's something interesting: even if you only have a few windows open on screen, opening Task Manager reveals that dozens of processes are actually running simultaneously. This shows that the range of execution the user perceives and the range the operating system actually manages are entirely different things.

Task Manager is a window that lets you see this reality with your own eyes. Through this screen, you can intuitively understand that the operating system's execution is not a single flow, but is divided into multiple execution units that are managed concurrently.

작업 관리자로 확인하는 프로세스의 실체

운영체제가 프로세스 단위로 실행을 관리한다는 개념을 조금 더 실감 나게 확인할 수 있는 방법이 있다. 바로 윈도우의 작업 관리자다.

작업 관리자를 열면 현재 컴퓨터에서 실행 중인 프로세스들이 목록 형태로 쭉 나열된다. 이 목록의 항목 하나하나가 바로 운영체제가 관리하는 실행 단위, 즉 프로세스다. 내가 직접 실행한 프로그램도 있고, 사용자 눈에는 보이지 않지만 운영체제가 내부적으로 돌리고 있는 작업들도 함께 포함되어 있다.

여기서 흥미로운 점이 있다. 화면에 띄워 놓은 창이 몇 개 없더라도, 작업 관리자를 열어보면 실제로는 수십 개의 프로세스가 동시에 동작하고 있다는 걸 확인할 수 있다. 사용자가 인식하는 실행의 범위와 운영체제가 실제로 관리하는 실행의 범위가 전혀 다르다는 뜻이다.

작업 관리자는 이 사실을 눈으로 직접 확인시켜 주는 창구다. 운영체제의 실행은 단순히 하나의 흐름이 아니라, 여러 개의 실행 단위로 나뉘어 동시에 관리되고 있다는 것을 이 화면을 통해 직관적으로 이해할 수 있다.

The Moment a Program Becomes a Process

To understand the relationship between a program and a process, it helps to trace the flow of execution step by step.

It starts on disk. A program is stored there as a file; an executable made up of code and static data. In this state, nothing is running yet. It's not using the CPU, it's not occupying memory; it's simply a file sitting quietly on disk.

When the user runs the program, the situation changes. The program on disk is loaded into memory, and once it is loaded and in an executable state, we call it a process. That process then receives CPU time, and actual computation begins.

To recap: a program is an executable file stored on disk; a process is the state in which that program has been loaded into memory and is running. Even the same program, when executed multiple times, can produce that many different processes in memory.

The core takeaway from this flow is one thing: the very moment a program is loaded into memory, it transforms into a process - a managed entity under the operating system's control. Before that moment it's merely a stored file, but once it's in memory, the operating system recognizes it as an independent execution unit and begins allocating and managing resources like CPU and memory.

프로그램이 프로세스가 되는 순간

프로그램과 프로세스의 관계를 이해하려면 실행이 일어나는 흐름을 단계별로 따라가 보는 것이 좋다.

시작은 디스크다. 디스크에는 프로그램이 파일의 형태로 저장되어 있다. 이 프로그램은 코드와 정적인 데이터로 이루어진 실행 파일로, 이 상태에서는 아직 아무것도 실행되고 있지 않다. CPU를 사용하는 것도 아니고, 메모리를 점유하는 것도 아닌, 그냥 디스크 위에 가만히 놓인 파일일 뿐이다.

여기서 사용자가 프로그램을 실행하면 상황이 바뀐다. 디스크에 있던 프로그램이 메모리로 올라오게 되고, 메모리에 적재되어 실행 가능한 상태가 된 것을 바로 프로세스라고 부른다. 그리고 이 프로세스가 CPU를 할당받으면서 비로소 실제 연산이 시작된다.

다시 한번 정리하면 이렇다. 프로그램은 디스크에 저장된 실행 파일이고, 프로세스는 그 프로그램이 메모리에 적재되어 실행 중인 상태다. 같은 프로그램이라도 여러 번 실행하면 메모리에는 그만큼 서로 다른 프로세스가 생성될 수 있다.

이 흐름에서 핵심은 딱 하나다. 프로그램이 메모리에 적재되는 바로 그 순간, 운영체제의 관리 대상인 프로세스로 전환된다는 것이다. 그 전까지는 그저 저장된 파일에 불과하지만, 메모리에 올라오는 순간부터 운영체제는 이를 독립적인 실행 단위로 인식하고 CPU와 메모리 같은 자원을 할당하며 관리하기 시작한다.

2. Process Structure and Creation

The Memory Structure of a Process

When we say a process is loaded into memory, it's not simply a matter of one chunk of program code being dumped in. The various roles needed for execution are separated and each occupies its own region in a structured layout. Understanding this structure makes it much clearer how the operating system systematically manages running processes.

2.프로세스의 구성과 생성

프로세스의 메모리 구조

프로세스가 메모리에 올라간다고 했을 때, 단순히 프로그램 코드 하나가 통째로 올라가는 것이 아니다. 실행에 필요한 여러 역할들이 구분되어 각자의 영역을 차지하는 구조로 배치된다. 이 구조를 이해하면 운영체제가 실행 중인 프로세스를 어떻게 체계적으로 관리하는지 훨씬 명확하게 보인다.

Code Region

First is the code region. As the name implies, this is where the program's instructions - the commands the CPU reads and executes one by one - are stored. It's the region that defines the program's behavior itself.

Data Region

Next is the data region. This is where data used throughout the entire program is stored - things like global variables and static variables. Rather than being used only within a specific function, these are data that are referenced broadly throughout the program's execution.

Heap Region

Then there's the heap region. The heap is a memory area that is dynamically allocated while the program runs. For example, creating a new object during execution or requesting additional memory on the fly falls into this category. Its defining characteristic is that its size isn't fixed in advance - it's used flexibly according to the flow of execution.

Stack Region

Finally, there's the stack region. This is where information related to function calls, along with local variables and parameters, is stored. Every time a function is called, the necessary information is pushed onto the stack, and when the function returns, it is popped off — so the amount of space used changes continuously with the flow of execution.

Each Process Has an Independent Memory Space

Through this structure ; code, data, heap, and stack - the operating system manages the memory a process needs during execution in a role-based, systematic way.

There's one point that must be emphasized here: each process holds this memory structure independently. Process A's memory space and Process B's memory space do not directly access or mix with each other. As a result, even if a problem occurs in one process, it does not affect others. This independence is a critical foundation that allows the operating system to manage multiple processes simultaneously and stably.

코드 영역

가장 먼저 코드 영역이다. 이름 그대로 실행할 프로그램의 명령어들이 저장되는 공간이다. CPU는 이 영역에 저장된 명령어를 하나씩 읽어가며 실행한다. 프로그램의 동작 자체를 정의하는 영역이라고 볼 수 있다.

데이터 영역

다음은 데이터 영역이다. 전역 변수나 정적 변수처럼 프로그램 전체에 걸쳐 사용되는 데이터들이 이곳에 저장된다. 특정 함수 안에서만 쓰이는 것이 아니라, 프로그램이 실행되는 동안 전반적으로 참조되는 데이터들의 자리다.

힙 영역

그다음은 힙 영역이다. 힙은 프로그램이 실행되는 도중에 동적으로 할당되는 메모리 공간이다. 예를 들어 실행 중에 새로운 객체를 생성하거나, 필요에 따라 메모리를 추가로 요청하는 경우가 여기에 해당한다. 미리 크기를 정해두는 것이 아니라 실행 흐름에 따라 유동적으로 사용된다는 점이 특징이다.

스택 영역

마지막으로 스택 영역이다. 함수 호출과 관련된 정보, 그리고 지역 변수와 매개변수가 저장되는 공간이다. 함수가 호출될 때마다 필요한 정보가 쌓이고, 함수가 종료되면 다시 걷혀나가는 방식으로 동작하기 때문에 사용량이 실행 흐름에 따라 계속해서 변한다.

각 프로세스는 독립된 메모리 공간을 가진다

코드, 데이터, 힙, 스택으로 나뉜 이 구조를 통해 운영체제는 프로세스가 실행되는 데 필요한 메모리를 역할별로 체계적으로 관리한다.

여기서 반드시 짚고 넘어가야 할 점이 하나 있다. 각 프로세스는 이 메모리 구조를 서로 독립적으로 가진다는 것이다. 프로세스 A의 메모리 공간과 프로세스 B의 메모리 공간은 서로 직접 접근하거나 섞이지 않는다. 덕분에 한 프로세스에서 문제가 생기더라도 다른 프로세스에 영향을 주지 않도록 보호된다. 이 독립성이 운영체제가 여러 프로세스를 안정적으로 동시에 관리할 수 있는 중요한 기반이 된다.

The Memory Layout of a Process

When a process is loaded into memory, the data isn't all jumbled together. The data needed for execution is divided into four regions based on its nature and laid out systematically.

Looking at the structure from the bottom up, the code region sits at the very bottom, followed above it by the data region, then the heap region, with the stack region at the very top.

The code region holds the program instructions that the CPU reads and executes one by one. The data region holds data used broadly throughout the program, such as global and static variables. The heap region is for dynamically allocated memory during execution - for instance, when char *cp = malloc(10000) requests memory mid-run, this region is used. The stack region stores local variables and parameters declared inside functions - things like float f or int i belong here.

One thing worth noting in this structure is that the heap and stack expand toward each other. The heap grows upward as more dynamic allocations occur; the stack grows downward as function calls accumulate. The empty space between them serves as a buffer to accommodate this expansion.

The reason memory is divided into regions like this is simple: to keep data of different natures clearly separated so they don't get mixed together during execution. The operating system allocates and manages memory for each process based on exactly this structure.

프로세스의 메모리 배치 구조

프로세스가 메모리에 올라갈 때, 모든 데이터가 한데 뒤섞여 저장되는 것이 아니다. 실행에 필요한 데이터들은 그 성격에 따라 네 개의 영역으로 나뉘어 체계적으로 배치된다.

구조를 아래에서부터 살펴보면, 가장 아래에 코드 영역이 위치하고 그 위로 데이터 영역, 힙 영역, 그리고 가장 위에 스택 영역이 자리한다.

코드 영역에는 CPU가 하나씩 읽어 실행할 프로그램 명령어들이 저장된다. 데이터 영역에는 전역변수나 정적 변수처럼 프로그램 전반에 걸쳐 사용되는 데이터들이 올라간다. 힙 영역은 실행 중에 동적으로 할당되는 메모리 공간으로, char *cp = malloc(10000)처럼 실행 도중 메모리를 요청하는 경우 이 영역이 사용된다. 스택 영역에는 함수 안에서 선언된 지역변수와 매개변수가 저장되는데, float f나 int i 같은 지역변수가 여기에 해당한다.

이 구조에서 한 가지 눈여겨볼 점은 힙과 스택이 서로를 향해 확장된다는 것이다. 힙은 동적 할당이 늘어날수록 위쪽으로 커지고, 스택은 함수 호출이 쌓일수록 아래쪽으로 확장된다. 두 영역 사이의 빈 공간이 이 확장을 수용하는 여유분 역할을 한다.

이렇게 메모리를 영역별로 구분해서 사용하는 이유는 하나다. 프로세스가 실행되는 동안 성격이 서로 다른 데이터들이 뒤섞이지 않도록 명확히 분리해서 관리하기 위해서다. 운영체제는 바로 이 구조를 기준으로 각 프로세스에 메모리를 할당하고 관리한다.

How Is a Process Created?

A process doesn't spontaneously appear out of nowhere. The entity that creates a new process is always an already-running existing process.

The process that creates a new one is called the parent process; the newly created one is called the child process. A child process, once running, can itself spawn further children. As this relationship repeats, processes form a tree-shaped hierarchical structure.

The operating system performs two roles simultaneously in this process: allocating an independent memory space for the new child process, and completing the preparation work so it can begin executing immediately. From the moment a child process is created, it becomes an independent execution unit with its own memory space.

At this stage, there's no need to dig deep into system calls or specific implementation details. The two core ideas are: processes are created by existing processes, and that relationship forms a parent-child hierarchy. Keeping this concept in mind provides a solid foundation for understanding topics like scheduling and inter-process communication that come later.

프로세스는 어떻게 만들어질까?

프로세스는 갑자기 혼자 생겨나지 않는다. 새로운 프로세스를 만들어내는 주체는 항상 이미 실행 중인 기존의 프로세스다.

이때 새로운 프로세스를 만들어낸 쪽을 부모 프로세스, 그로 인해 새롭게 생성된 쪽을 자식 프로세스라고 부른다. 자식 프로세스 역시 실행되고 나면 또 다른 자식을 낳을 수 있다. 이런 관계가 반복되면서 프로세스들은 나무 모양의 계층 구조를 이루게 된다.

운영체제는 이 과정에서 두 가지 역할을 함께 수행한다. 새로운 자식 프로세스를 위한 독립된 메모리 공간을 할당하고, 곧바로 실행에 들어갈 수 있도록 준비 작업을 마친다. 자식 프로세스는 생성되는 순간부터 자신만의 메모리 공간을 가지는 독립적인 실행 단위가 된다.

지금 단계에서 시스템 호출이나 구체적인 구현 방식까지 깊이 파고들 필요는 없다. 핵심은 두 가지다. 프로세스는 기존 프로세스에 의해 생성된다는 것, 그리고 그 관계는 부모와 자식이라는 계층 구조로 이어진다는 것이다. 이 개념을 머릿속에 잡아두면, 이후에 배울 스케줄링이나 프로세스 간 통신 같은 주제들을 이해하는 데 든든한 발판이 된다.

The Process Tree and the Parent-Child Relationship

Given that parent-child relationships are formed, processes don't exist as scattered, independent entities - they form a hierarchical tree structure. Every process in this tree has some parent process, and ultimately all processes derive from a single root process at the top.

There's an important point here: changes in a parent process's state can affect its child processes. In particular, when a parent process terminates, how its child processes are handled is determined by the operating system's policy.

For reference, in a Linux environment, a process called init serves as the starting point for all processes. No matter which running process you trace upward through its parents, you'll eventually reach init.

Understanding this parent-child relationship makes it easier to naturally connect to topics you'll encounter later, such as process termination and resource cleanup.

프로세스 트리 구조와 부모-자식 관계

부모와 자식 관계가 만들어진다고 했는데, 이 관계를 기준으로 보면 프로세스들은 서로 독립적으로 흩어져 있는 것이 아니라 계층적인 트리 구조를 이룬다. 모든 프로세스는 이 트리 안에서 반드시 어떤 부모 프로세스를 가지고 있으며, 최종적으로는 하나의 최상위 프로세스로부터 파생된다.

여기서 한 가지 중요한 점이 있다. 부모 프로세스의 상태 변화가 자식 프로세스에도 영향을 줄 수 있다는 것이다. 특히 부모 프로세스가 종료되는 경우, 자식 프로세스를 어떻게 처리할지는 운영체제의 정책에 따라 결정된다.

참고로 리눅스 환경에서는 init이라는 프로세스가 모든 프로세스의 시작점 역할을 한다. 실행 중인 어떤 프로세스든 부모를 따라 트리를 거슬러 올라가다 보면 결국 이 init 프로세스와 연결된다.

이렇게 부모와 자식 간의 관계를 이해해두면, 이후에 다루게 될 프로세스 종료나 자원 정리 같은 내용들도 자연스럽게 연결지어 이해할 수 있게 된다.

The Process Tree: A Hierarchy Starting from init

At the top of the process tree in a Linux environment sits init - the process that holds PID 1. This is the very first process to run on the system, and all processes ultimately derive from it.

Below it in the hierarchy sits the Shell process. When a user logs into the system, a shell process is created for that user. The shell receives commands the user types and passes them to the operating system - from the user's perspective, it's the launching point that starts program execution.

Programs like web browsers, chat applications, media players, and text editors are all created as child processes with the shell as their parent. Rather than each user-run program appearing independently out of nowhere, they are created hierarchically through the shell, within a parent-child relationship.

In this way, processes are managed within a tree-structured hierarchy. No matter which process you trace upward, it eventually connects to init. Understanding this entire flow as a single picture makes later topics - process termination, resource cleanup - much easier to connect and comprehend.

프로세스 트리, init에서 시작되는 계층 구조

리눅스 환경에서 프로세스 트리의 최상위에는 init이라는 프로세스가 있다. PID 1번을 가지는 이 프로세스가 시스템에서 가장 먼저 실행되는 출발점으로, 모든 프로세스는 결국 이 init으로부터 파생된다.

그 아래 계층에는 쉘(Shell) 프로세스가 위치한다. 사용자가 시스템에 로그인하면 해당 사용자를 위한 쉘 프로세스가 생성된다. 쉘은 사용자가 입력한 명령어를 받아 운영체제에 전달해주는 역할을 하는데, 사용자 입장에서는 프로그램을 실행할 때 그 실행을 시작해주는 출발점이라고 보면 된다.

웹브라우저, 채팅, 미디어플레이어, 편집기 같은 프로그램들은 모두 이 쉘을 부모로 두는 자식 프로세스로 생성된다. 사용자가 실행하는 프로그램들이 각각 독립적으로 뚝 생겨나는 것이 아니라, 쉘이라는 프로세스를 거쳐 부모-자식 관계 속에서 계층적으로 만들어진다는 뜻이다.

이처럼 프로세스는 트리 구조의 계층 관계 안에서 관리된다. 어떤 프로세스든 부모를 따라 위로 거슬러 올라가다 보면 결국 init과 연결된다. 이 전체 흐름을 하나의 그림으로 이해해두면, 이후에 배울 프로세스 종료나 자원 정리 같은 내용들도 자연스럽게 연결지어 이해할 수 있게 된다.

From Execution Request to Actual Execution

When a user runs a program, several steps proceed in sequence inside the operating system. Let's trace the flow from the moment an icon is clicked or a command is entered to the point where the program actually starts running.

First, when the user's execution request comes in, the operating system creates a new process via a system call. The representative system call function used here is fork(). Through this call, a new child process is born from an existing process.

Once the process is created, the operating system allocates the memory space that process needs to run. The code, data, heap, and stack regions discussed earlier are each secured at this point. After memory allocation, the preparation work of initializing CPU state and execution position follows. Only when this preparation is complete can the process receive CPU time and move into the actual execution phase.

At this stage, there's no need to memorize functions like fork() or the specifics of implementation. What matters is holding the overall flow; execution request → process creation → memory allocation → execution preparation; as a single picture in your mind. Understanding this flow makes subsequent topics like scheduling and process management connect much more naturally.

프로그램 실행 요청부터 실제 실행까지

사용자가 프로그램을 실행하면 운영체제 내부에서는 여러 단계가 순서대로 진행된다. 아이콘을 클릭하거나 명령어를 입력하는 순간부터 실제로 프로그램이 동작하기까지의 흐름을 따라가 보자.

먼저 사용자의 실행 요청이 들어오면 운영체제는 시스템 호출을 통해 새로운 프로세스를 생성한다. 이때 사용되는 대표적인 시스템 호출 함수가 fork()다. 이 호출을 통해 기존 프로세스로부터 새로운 자식 프로세스가 만들어진다.

프로세스가 생성되고 나면 운영체제는 그 프로세스가 실행되는 데 필요한 메모리 공간을 할당한다. 앞서 살펴본 코드, 데이터, 힙, 스택 영역이 이 시점에 각각 확보된다. 메모리 할당이 끝나면 CPU 상태와 실행 위치를 초기화하는 실행 준비 작업이 이어진다. 이 준비가 완료되어야 비로소 프로세스가 CPU를 할당받아 실제 실행 단계로 넘어갈 수 있다.

이 단계에서 fork() 같은 함수나 세부 구현 방식을 외우려 할 필요는 없다. 중요한 것은 프로그램 실행 요청 → 프로세스 생성 → 메모리 할당 → 실행 준비라는 전체 흐름을 하나의 그림으로 머릿속에 담아두는 것이다. 이 흐름을 이해하고 있으면 이후에 배울 스케줄링이나 프로세스 관리 같은 내용들도 훨씬 자연스럽게 연결된다.

3.Condition of Process

The State Changes of a Process

A process doesn't stay in the same state from beginning to end once it's created. It transitions through several states as it runs. Let's trace that flow in order.

The very first is the New state. This is the phase where the process is just being created and is not yet ready to execute.

Once creation is complete, it moves to the Ready state. All preparations for execution are done, but the process hasn't been allocated CPU time yet - it's waiting its turn. It's capable of running, but is waiting because no CPU is available.

When a Ready process is allocated CPU time, it becomes Running. This is the state where instructions are actually being executed. At any given moment, generally only one process can be running on a single CPU. A process can be sent back to Ready if it loses the CPU.

If a situation arises during Running where the process must wait for an external event - like an I/O operation - it transitions to the Waiting state. In this state the process doesn't use the CPU, and once the awaited event completes, it returns to Ready to wait for CPU allocation again.

Finally, when execution is entirely complete, the process reaches the Terminated state. This is where the process's lifecycle ends.

Ultimately, a process operates by continuously changing states within the flow of creation → ready → running → waiting → termination. Understanding this flow of state changes makes it much more natural to later understand how scheduling decides which process to choose at which moment.

프로세스의 상태 변화

프로세스는 한번 생성되면 처음부터 끝까지 같은 상태로 머무는 것이 아니다. 실행되는 동안 여러 상태를 오가며 변화한다. 이 흐름을 순서대로 따라가 보자.

가장 처음은 New 상태다. 프로세스가 막 생성되고 있는 단계로, 아직 실행 준비가 완료되지 않은 상태를 의미한다.

생성이 완료되면 Ready 상태로 넘어간다. 실행에 필요한 준비는 모두 끝났지만 아직 CPU를 할당받지 못해 차례를 기다리는 상태다. 실행 가능한 상태이지만 CPU가 없어서 대기 중인 것이다.

Ready 상태의 프로세스가 CPU를 할당받으면 Running 상태가 된다. 실제로 명령어가 수행되는 상태로, 일반적으로 한 시점에 CPU 하나에서 실행될 수 있는 프로세스는 단 하나뿐이다. 실행 중에 CPU를 빼앗기면 다시 Ready 상태로 돌아가기도 한다.

Running 도중 입출력 작업처럼 외부 사건을 기다려야 하는 상황이 생기면 Waiting 상태로 전환된다. 이 상태에서는 CPU를 사용하지 않으며, 기다리던 사건이 완료되면 다시 Ready 상태로 돌아가 CPU 할당을 기다린다.

마지막으로 실행이 모두 끝나면 Terminated 상태가 된다. 프로세스의 생명 주기가 마무리되는 단계다.

결국 프로세스는 생성 → 준비 → 실행 → 대기 → 종료라는 일련의 흐름 속에서 상태를 끊임없이 바꾸며 동작한다. 이 상태 변화의 흐름을 이해해두면, 이후에 배울 스케줄링이 어떤 시점에 어떤 프로세스를 선택하는지도 훨씬 자연스럽게 이해할 수 있게 된다.

The Flow of Process State Transitions

Let's trace which states a process passes through during execution and what triggers each state change.

프로세스 상태 변화의 흐름

프로세스가 실행되는 동안 어떤 상태를 거치고, 어떤 계기로 상태가 바뀌는지 흐름을 따라가 보자.

Basic Flow

Most simply: after being created, a process waits in a non-running state, then is dispatched and moves to the running state when it receives CPU time. When execution ends, it moves to the terminated state. However, if an interrupt occurs during execution, it can return to the non-running state. This is the most basic skeleton of process state changes.

기본 흐름

가장 단순하게 보면, 프로세스는 생성된 후 비실행 상태에 머물다가 CPU를 할당받는 순간 디스패치되어 실행 상태로 넘어간다. 실행이 끝나면 종료 상태로 이동한다. 단, 실행 중에 인터럽트가 발생하면 다시 비실행 상태로 돌아갈 수 있다. 이것이 프로세스 상태 변화의 가장 기본적인 골격이다.

Detailed State Changes

In practice, when the operating system manages processes, this flow is more granular.

After a process is created, rather than immediately receiving CPU time, it first enters the ready state. The ready state means all preparations for execution are done but the process is still waiting for CPU allocation. The process by which the operating system selects one process from those in the ready state and hands it the CPU is called a dispatch. The dispatch is precisely the transition action that occurs when moving from ready to running.

A process in the running state can be sent back to ready depending on circumstances - for example, when its allocated time runs out, or when a higher-priority process needs the CPU. Meanwhile, if a situation requiring the process to wait for an external event (such as an I/O request) arises during execution, the process moves to the waiting state. It doesn't use the CPU while waiting, and once the awaited event completes, it returns to ready and waits for CPU allocation again.

After repeating this cycle, once execution is fully complete, the process moves to the terminated state.

The two key points here are: processes don't proceed in just one direction - they repeatedly cycle through ready, running, and waiting states depending on circumstances - and at the center of that flow is the dispatch, the transition from ready to running.

구체적인 상태 변화

실제로 운영체제가 프로세스를 관리할 때는 이 흐름이 좀 더 세분화된다.

프로세스가 생성되면 곧바로 CPU를 받는 것이 아니라 먼저 준비 상태로 들어간다. 준비 상태는 실행에 필요한 모든 준비는 끝났지만 CPU를 아직 할당받지 못해 차례를 기다리는 상태다. 여기서 운영체제가 실행할 프로세스를 하나 골라 CPU를 넘겨주는 과정을 디스패치라고 한다. 준비에서 실행으로 넘어가는 바로 그 전환 동작이 디스패치다.

실행 상태에 들어간 프로세스는 상황에 따라 다시 준비 상태로 돌아올 수 있다. 할당된 시간이 끝났거나, 더 우선순위가 높은 프로세스에게 CPU를 넘겨줘야 할 경우가 그 예다. 한편 실행 중에 입출력 요청처럼 외부 사건을 기다려야 하는 상황이 생기면 프로세스는 대기 상태로 이동한다. 대기 상태에서는 CPU를 사용하지 않으며, 기다리던 사건이 완료되면 다시 준비 상태로 돌아와 CPU 할당을 기다린다.

이 과정을 반복하다가 실행이 모두 끝나면 프로세스는 최종적으로 종료 상태로 이동한다.

여기서 핵심은 두 가지다. 프로세스는 한 방향으로만 쭉 진행되는 것이 아니라 상황에 따라 준비, 실행, 대기를 반복해서 오간다는 것, 그리고 그 흐름의 중심에 준비에서 실행으로 전환되는 디스패치라는 과정이 있다는 것이다.

2️⃣ Process State and Management

1. Process State Changes

Why Process State Changes Are Necessary

Although it looks like multiple programs are running simultaneously on a computer, the CPU can only execute one process at a time. Because of this constraint, the operating system must continuously make important decisions: which process to run right now, and when to stop the currently running process and switch to another.

Throughout this process, a process doesn't stay only in the running state - it is managed by moving back and forth between the ready and waiting states. The flow that emerges as the operating system switches and manages processes is called process state change.

This concept goes beyond simply describing what state a process is in. It is a fundamental prerequisite for understanding process management techniques like scheduling and synchronization that we'll explore later, which is why it's important to get a firm grasp of this flow now.

프로세스 상태 변화의 필요성

컴퓨터에서 여러 프로그램이 동시에 실행되는 것처럼 보이지만, CPU는 한 순간에 단 하나의 프로세스만 실행할 수 있다. 이 제약이 있기 때문에 운영체제는 끊임없이 중요한 판단을 내려야 한다. 지금 어떤 프로세스를 실행할 것인지, 언제 실행 중인 프로세스를 멈추고 다른 프로세스로 전환할 것인지를 계속해서 결정해야 하는 것이다.

이 과정에서 프로세스는 실행 상태에만 머무는 것이 아니라 준비와 대기 상태를 오가며 관리된다. 운영체제가 프로세스를 전환하고 관리하는 과정에서 나타나는 이러한 흐름을 프로세스의 상태 변화라고 한다.

이 개념은 단순히 프로세스가 어떤 상태에 있는지를 설명하는 데서 그치지 않는다. 앞으로 살펴볼 스케줄링이나 동기화 같은 프로세스 관리 기법들을 이해하기 위한 기본 전제가 되는 개념이기 때문에, 지금 이 흐름을 확실히 잡아두는 것이 중요하다.

What Would Happen Without Process State Changes?

Without process state changes, the operating system would be unable to properly distinguish between multiple processes. If there's no information about which process should be running right now and which is waiting, there's no way to decide who should receive the CPU.

Ultimately, process state changes don't merely display a process's current situation. They serve as the management standard that allows multiple processes to share the CPU and use system resources efficiently.

From this perspective, process state change can be understood as the most fundamental management mechanism the operating system uses to reclaim and reallocate the CPU. The reason the operating system appears to handle multiple processes simultaneously is precisely because it manages them by rapidly switching the CPU based on these state changes.

프로세스 상태 변화가 없다면 어떻게 될까?

만약 프로세스의 상태 변화가 없다면 운영체제는 여러 프로세스를 제대로 구분할 수 없게 된다. 어떤 프로세스가 지금 실행되어야 하는지, 어떤 프로세스가 대기 중인지에 대한 정보가 없으면 CPU를 누구에게 할당해야 할지 결정할 수 없기 때문이다.

결국 프로세스의 상태 변화는 단순히 프로세스의 현재 상황을 표시하는 것에 그치지 않는다. 여러 프로세스가 CPU를 나눠 쓸 수 있도록 하고, 시스템 자원을 효율적으로 사용하기 위한 관리 기준이 된다.

이 관점에서 보면 프로세스의 상태 변화는 운영체제가 CPU를 회수하고 다시 할당하기 위해 사용하는 가장 기본적인 관리 방식이라고 이해할 수 있다. 운영체제가 여러 프로세스를 동시에 다루는 것처럼 보이는 것도, 결국 이 상태 변화를 기반으로 CPU를 빠르게 전환하며 관리하기 때문에 가능한 일이다.

The Operating System Decides State Changes

Process state changes don't occur arbitrarily at any random time. A state change means the operating system is intervening in the execution flow, and this intervention only happens at specific moments.

In other words, a process does not change its own state. State changes occur only at the moment the operating system determines that intervention is necessary. There's one important point here: state changes take place exclusively in kernel mode. User programs never directly change their own state.

This means the operating system holds complete authority over process management. From the process's perspective, it cannot decide on its own when to run or when to stop — all such decisions are made by the operating system in kernel mode. This structure allows multiple processes to share the CPU in an orderly fashion and prevents any single process from monopolizing or destabilizing the entire system.

상태 변화는 운영체제가 결정한다

프로세스의 상태 변화는 아무 때나 임의로 발생하는 것이 아니다. 상태가 바뀐다는 것은 곧 운영체제가 실행 흐름에 개입하겠다는 의미이며, 이 개입은 항상 특정 시점에서만 일어난다.

다시 말해 프로세스가 스스로 자신의 상태를 바꾸는 것이 아니다. 운영체제가 개입이 필요하다고 판단한 순간에 비로소 상태 변화가 일어난다. 여기서 한 가지 중요한 점이 있다. 이 상태 변화는 반드시 커널 모드에서만 이루어진다는 것이다. 사용자 프로그램이 직접 자신의 상태를 바꾸는 일은 없다.

이 사실은 운영체제가 프로세스 관리의 주도권을 완전히 쥐고 있다는 것을 의미한다. 프로세스 입장에서는 언제 실행되고 언제 멈출지를 스스로 결정할 수 없으며, 그 모든 판단은 운영체제가 커널 모드에서 내린다. 이 구조 덕분에 여러 프로세스가 질서 있게 CPU를 나눠 쓸 수 있고, 한 프로세스가 시스템 전체를 독점하거나 불안정하게 만드는 상황을 막을 수 있다.

When Does the Operating System Intervene?

There are specific triggers that cause the operating system to change a process's state. The most representative cases are as follows.

The first is when a process is allocated CPU time. When a process that was waiting in the ready state receives the CPU, the operating system transitions it to the running state.

The second is when a process has used up its entire allocated CPU time. The operating system limits how long each process can hold the CPU; when that time expires, it stops the running process and switches to another.

The third is when a process requests an I/O operation, or when a previously requested I/O operation completes. When an I/O request comes in, the process is transitioned to the waiting state; when the operation completes, it returns to the ready state. The operating system intervenes directly to change the state at both of these moments.

Ultimately, based on these three triggers, a process cycles repeatedly through the running, ready, and waiting states. Through this flow, the operating system coordinates multiple processes so they can share the CPU in an orderly manner.

운영체제는 어떤 순간에 개입할까?

운영체제가 프로세스의 상태를 변경하는 데는 특정한 계기가 있다. 대표적인 경우를 살펴보면 다음과 같다.

첫 번째는 프로세스가 CPU를 할당받는 순간이다. 준비 상태에서 기다리던 프로세스가 CPU를 받게 되면 운영체제는 해당 프로세스를 실행 상태로 전환한다.

두 번째는 프로세스가 할당된 CPU 사용 시간을 모두 소진했을 때다. 운영체제는 각 프로세스가 CPU를 무한정 점유하지 못하도록 사용 시간을 제한하는데, 그 시간이 끝나면 실행 중인 프로세스를 멈추고 다른 프로세스로 전환한다.

세 번째는 프로세스가 입출력 작업을 요청하거나, 요청했던 입출력 작업이 완료되었을 때다. 입출력 요청이 들어오면 해당 프로세스는 대기 상태로 전환되고, 작업이 완료되면 다시 준비 상태로 돌아온다. 이 두 시점 모두 운영체제가 직접 개입해 상태를 변경한다.

결국 이 세 가지 시점을 기준으로 프로세스는 실행, 준비, 대기 상태를 반복해서 오가게 된다. 운영체제는 이 흐름을 통해 여러 프로세스가 CPU를 질서 있게 나눠 쓸 수 있도록 조율하는 것이다.

Terms for State Transitions

We've already looked at the flow of process state changes. Now let's define the terms used to describe each specific moment of transition. Dispatch and timeout are expressions that refer to the concrete moments when the operating system intervenes and a process's state changes.

Dispatch refers to the moment the operating system selects one process from those in the ready state and transitions it to the running state. Simply put, it's the exact moment the operating system decides "I'm going to run this process now." Dispatch is the transition point from ready to running.

Timeout is the transition in the opposite direction. When a running process has exhausted its allocated CPU time, the operating system sends that process back to the ready state. The transition that occurs at this moment is called a timeout — that is, time expiration.

Looking at the two terms together makes the flow clear. Dispatch takes a process from ready to running; timeout brings it back from running to ready. The operating system coordinates multiple processes taking turns with the CPU by repeating these two transitions.

상태 전환을 표현하는 용어

프로세스의 상태가 바뀌는 흐름은 앞서 살펴봤다. 여기서는 그 상태가 바뀌는 각각의 순간을 부르는 용어를 정리해보자. 디스패치와 타임아웃은 운영체제가 개입해서 프로세스의 상태가 전환되는 구체적인 순간을 가리키는 표현이다.

디스패치는 운영체제가 준비 상태에 있는 프로세스 중 하나를 골라 실행 상태로 전환하는 순간을 가리킨다. 쉽게 말해 운영체제가 "이 프로세스를 이제 실행하겠다"고 결정하는 바로 그 순간이다. 준비 상태에서 실행 상태로 넘어가는 전환점이 디스패치다.

타임아웃은 그 반대 방향의 전환이다. 실행 중인 프로세스가 할당된 CPU 사용 시간을 모두 소진하면, 운영체제는 해당 프로세스를 다시 준비 상태로 돌려보낸다. 이때 발생하는 전환을 타임아웃, 즉 시간 종료라고 한다.

두 용어를 함께 놓고 보면 흐름이 명확해진다. 디스패치는 준비에서 실행으로, 타임아웃은 실행에서 준비로 돌아오는 전환이다. 운영체제는 이 두 가지 전환을 반복하면서 여러 프로세스가 CPU를 번갈아 사용할 수 있도록 조율한다.

Block and Wakeup

While dispatch and timeout were terms for transitions between the ready and running states, let's now look at the transition terms related to the waiting state.

Block refers to the moment a running process transitions to the waiting state — when it requests an I/O operation or must wait for a specific event. The process has reached a point where it can no longer proceed on its own, and the operating system places it in the waiting state. While in this waiting state, the process does not use the CPU.

Wakeup is the opposite. When the awaited I/O operation completes or the specific event is resolved, the operating system wakes up the waiting process and moves it to the ready state. One important note: a wakeup does not immediately put the process into the running state. It returns to the ready state and goes through the normal process of waiting for CPU allocation.

With all four transition terms together, the entire flow of process state changes becomes clear at a glance. Execution begins with a dispatch, returns to ready via a timeout, enters waiting via a block, and comes back to ready via a wakeup. The operating system continuously coordinates the execution of multiple processes through these four transitions.

Block과 Wakeup

앞서 디스패치와 타임아웃이 준비와 실행 사이의 전환을 표현하는 용어였다면, 이번에는 대기 상태와 관련된 전환 용어를 살펴보자.

Block은 실행 중인 프로세스가 입출력 작업을 요청하거나 특정 사건을 기다려야 하는 경우, 실행 상태에서 대기 상태로 전환되는 순간을 가리킨다. 프로세스가 스스로 더 이상 진행할 수 없는 상황이 되어 운영체제에 의해 대기 상태로 보내지는 것이다. 이 대기 상태에 머무는 동안에는 CPU를 사용하지 않는다.

Wakeup은 그 반대다. 기다리던 입출력 작업이 완료되거나 특정 사건이 해결되면, 운영체제는 대기 중이던 프로세스를 다시 깨워 준비 상태로 이동시킨다. 이 전환을 Wakeup이라고 한다. 주의할 점은 Wakeup이 된다고 해서 곧바로 실행 상태로 가는 것이 아니라는 점이다. 다시 준비 상태로 돌아와 CPU 할당을 기다리는 순서를 밟게 된다.

이제 네 가지 전환 용어를 함께 놓고 보면 프로세스 상태 변화의 전체 흐름이 한눈에 정리된다. 디스패치로 실행이 시작되고, 타임아웃으로 준비 상태로 돌아오며, Block으로 대기에 들어가고, Wakeup으로 다시 준비 상태로 복귀한다. 운영체제는 이 네 가지 전환을 통해 여러 프로세스의 실행을 끊임없이 조율한다.

2. Process Control Block (PCB)

We've seen which states a process passes through and what triggers each state change. Thinking carefully about this process, a natural question arises.

For the operating system to manage processes, it needs various information about each one. It needs to know what state a process is currently in — whether it's Ready, Running, or Waiting. It also needs to know where to resume execution when a process that was interrupted starts running again, as well as information about how that process had been using the CPU.

So where exactly is all this information stored? Let's find the answer in what follows.

2. 프로세스 제어 블록(PCB)

지금까지 프로세스가 어떤 상태를 거치고, 어떤 계기로 상태가 바뀌는지 살펴봤다. 그런데 이 과정을 곰곰이 생각해보면 자연스럽게 한 가지 의문이 생긴다.

운영체제가 프로세스를 관리하려면 각 프로세스에 대한 여러 정보가 필요하다. 지금 이 프로세스가 어떤 상태에 있는지, 즉 Ready인지 Running인지 Waiting인지를 알아야 한다. 실행이 중단되었다가 다시 재개될 때 어디서부터 이어서 실행해야 하는지도 알아야 한다. 그 프로세스가 CPU를 어떻게 사용하고 있었는지와 같은 정보도 필요하다.

그렇다면 이런 정보들은 과연 어디에 저장되는 걸까? 다음 내용에서 이 질문의 답을 찾아보자.

What Is a PCB?

The answer to the question posed earlier; where is information about processes stored?; is the Process Control Block (PCB).

The operating system uses a data structure called a PCB to manage each process's state and execution information separately. Each time a process is created, a corresponding PCB is created for it, and this PCB is stored in the kernel area.

One important point to note here: a PCB is a management data structure that contains only the information needed to manage a process's execution. The actual execution content — the process's code, data, and stack — is not included in the PCB. That content exists separately in the process's memory space: the code region, data region, heap, and stack, as we saw earlier.

As an analogy, a PCB is like a management card recording information about a process. Rather than containing actual execution content, it's a structure that gathers only the information the operating system needs to manage and control that process.

PCB란 무엇인가?

앞서 던진 질문, 프로세스에 관한 정보는 어디에 저장되는가에 대한 답이 바로 프로세스 제어 블록(PCB, Process Control Block) 이다.

운영체제는 프로세스의 상태와 실행 정보를 프로세스마다 따로 관리하기 위해 PCB라는 자료구조를 사용한다. 프로세스가 하나 생성될 때마다 그에 대응하는 PCB가 하나씩 만들어지며, 이 PCB는 커널 영역에 저장된다.

여기서 한 가지 중요한 점을 짚고 넘어가야 한다. PCB는 프로세스의 실행을 관리하기 위한 정보만 담고 있는 관리용 자료구조라는 것이다. 프로세스의 코드, 데이터, 스택처럼 실행 내용 자체는 PCB에 포함되지 않는다. 그 내용들은 앞서 살펴본 것처럼 프로세스의 메모리 공간인 코드 영역, 데이터 영역, 힙, 스택에 따로 존재한다.

쉽게 비유하자면 PCB는 프로세스에 대한 정보를 기록해둔 관리 카드 같은 것이다. 실제 실행 내용이 담긴 것이 아니라, 운영체제가 그 프로세스를 관리하고 제어하기 위해 필요한 정보들만 모아둔 구조체다.

The Relationship Between a Process and Its PCB

Each time a process is created, a corresponding PCB is created in the kernel area. When Process A is created, PCB A is created; when Process B is created, PCB B is created. There are as many PCBs in the kernel area as there are running processes.

The key point here is that the operating system doesn't handle processes directly — it manages each process's state and execution information through its PCB. From the operating system's perspective, the PCB is what represents a process. All judgments about which process to run next, what state it's currently in, and how it had been using the CPU are made based on the information contained in the PCB.

Simply put, a process is the actual entity running in memory, while the PCB is the information portal the operating system consults to manage that process. The operating system looks at the PCB before the process itself to understand and control it.

프로세스와 PCB의 관계

프로세스가 생성될 때마다 그에 대응하는 PCB가 커널 영역에 하나씩 만들어진다. 프로세스 A가 생성되면 PCB A가, 프로세스 B가 생성되면 PCB B가 만들어지는 식이다. 실행 중인 프로세스의 수만큼 PCB가 커널 영역에 존재하게 된다.

여기서 중요한 점은 운영체제가 프로세스를 직접 다루는 것이 아니라 PCB를 통해 프로세스의 상태와 실행 정보를 관리한다는 것이다. 운영체제 입장에서는 PCB가 곧 프로세스를 대표하는 존재다. 어떤 프로세스를 다음에 실행할지, 지금 어떤 상태에 있는지, CPU를 어떻게 사용하고 있었는지와 같은 판단을 모두 PCB에 담긴 정보를 기준으로 내린다.

쉽게 말해 프로세스는 메모리에서 실제로 실행되는 실체이고, PCB는 운영체제가 그 프로세스를 관리하기 위해 참조하는 정보 창구라고 볼 수 있다. 운영체제는 프로세스 자체보다 PCB를 먼저 들여다보며 프로세스를 파악하고 제어한다.

Information Stored in a PCB

A PCB contains various pieces of information the operating system needs to manage a process. Let's look at the main items.

First is the process identifier. A unique number identifying each process — the PID (Process ID) — is stored here. The operating system uses this identifier to distinguish and manage multiple processes.

Next is the process state. The PCB records which state the process is currently in: created, ready, running, waiting, or terminated. The operating system uses this information to determine which process to run and which is waiting.

One of the most important items is the Program Counter. When a process is interrupted during execution, the system needs to know where to resume when it runs again. The program counter stores the address of the next instruction to be executed. Because this value is stored, when a process resumes after being interrupted, it doesn't restart from the beginning — it continues precisely from where it left off.

PCB에 저장되는 정보

PCB에는 운영체제가 프로세스를 관리하는 데 필요한 여러 정보들이 담겨 있다. 대표적인 항목들을 살펴보자.

먼저 프로세스 식별자다. 각 프로세스를 구분하기 위한 고유한 숫자, 즉 PID(Process ID)가 저장된다. 운영체제는 이 식별자를 기준으로 여러 프로세스를 서로 구별하고 관리한다.

다음은 프로세스 상태다. 현재 이 프로세스가 생성, 준비, 실행, 대기, 종료 중 어떤 상태에 있는지가 PCB에 기록된다. 운영체제는 이 정보를 보고 어떤 프로세스를 실행할지, 어떤 프로세스가 대기 중인지를 판단한다.

그리고 가장 중요한 항목 중 하나가 프로그램 카운터(Program Counter) 다. 프로세스가 실행되다가 중단되면, 다음에 다시 실행될 때 어디서부터 이어서 실행해야 하는지를 알아야 한다. 프로그램 카운터는 바로 그 다음에 실행해야 할 명령어의 주소를 저장해두는 값이다. 이 값이 있기 때문에 프로세스가 중단되었다가 재개될 때 처음부터 다시 시작하는 것이 아니라 멈췄던 지점에서 정확히 이어서 실행할 수 있다.

Information Stored in a PCB (Continued)

Beyond the identifier, state, and program counter, a few more important pieces of information are stored in the PCB.

Register values. While a process is running, the CPU's registers hold various values needed for the current computation. When an interrupt occurs or a timeout stops the process, the register values at that moment are saved in the PCB. When the process runs again later, those saved register values are restored exactly, allowing it to return to precisely the state it was in before it was interrupted. Because both the program counter and register values are saved together, a process can resume as if it had never been interrupted.

Scheduling-related information is also included in the PCB. This includes the process's priority, its position in the scheduling queue, and other parameters needed for scheduling decisions. The operating system uses this information to determine which process to run next.

Ultimately, the PCB serves as the repository for all information needed to fully restore a process to its previous state when it resumes after being interrupted; losing nothing in the process.

PCB에 저장되는 정보 (계속)

앞서 살펴본 식별자, 상태, 프로그램 카운터 외에도 PCB에는 몇 가지 중요한 정보가 더 저장된다.

레지스터 값이다. 프로세스가 실행되는 동안 CPU의 레지스터에는 현재 연산에 필요한 여러 값들이 담겨 있다. 그런데 인터럽트가 발생하거나 타임아웃으로 프로세스가 중단되면, 그 순간 레지스터에 있던 값들을 PCB에 저장해둔다. 이후 해당 프로세스가 다시 실행될 때 PCB에 저장해둔 레지스터 값을 그대로 복원함으로써, 중단되기 직전의 상태로 정확히 되돌아갈 수 있다. 프로그램 카운터와 레지스터 값이 함께 저장되기 때문에 프로세스는 마치 중단된 적이 없었던 것처럼 이어서 실행될 수 있는 것이다.

스케줄링 관련 정보도 PCB에 포함된다. 이 프로세스의 우선순위가 얼마인지, 스케줄링 큐에서 어느 위치에 있는지, 그 밖에 스케줄링 결정에 필요한 매개변수들이 여기에 저장된다. 운영체제는 이 정보를 바탕으로 다음에 어떤 프로세스를 실행할지 판단한다.

결국 PCB는 프로세스가 중단되었다가 재개될 때 아무것도 잃지 않고 이전 상태로 완전히 복구될 수 있도록 필요한 모든 정보를 보관하는 역할을 한다.

Information Stored in a PCB (Wrap-up)

Beyond the information already covered, a few more items are included in the PCB.

Accounting information. Records of how much CPU time the process has used, its actual usage time, the maximum allowed time, and similar resource usage details are recorded here. Identifying information such as account number and job number is also included.

I/O status information is also stored. What I/O operations the process has currently requested, and what files are currently open ; this information is recorded in the PCB.

Finally, memory management information. Which areas of memory the process is using, and information related to memory allocation, are included in the PCB.

Looking at all the information contained in a PCB together, it becomes clear that the PCB does far more than simply track a process's execution state. Through the PCB, the operating system grasps and manages not just the process's execution flow, but its resource usage as well. In the end, a single PCB concentrates all management information about a given process.

PCB에 저장되는 정보 (마무리)

PCB에는 앞서 살펴본 정보들 외에도 몇 가지가 더 포함된다.

계정 정보다. 해당 프로세스가 CPU를 얼마나 사용했는지, 실제 사용 시간은 얼마인지, 사용 가능한 상한 시간은 얼마인지와 같은 자원 사용 내역이 기록된다. 계정 번호나 작업 번호 같은 식별 정보도 함께 포함된다.

입출력 상태 정보도 저장된다. 현재 이 프로세스가 어떤 입출력 작업을 요청했는지, 현재 열려 있는 파일은 무엇인지와 같은 정보가 PCB에 기록된다.

마지막으로 메모리 관리 정보다. 이 프로세스가 메모리의 어느 영역을 사용하고 있는지, 메모리 할당과 관련된 정보들이 PCB에 포함된다.

이렇게 PCB에 담긴 정보들을 종합해서 보면, PCB가 단순히 프로세스의 실행 상태만 관리하는 것이 아님을 알 수 있다. 운영체제는 PCB를 통해 프로세스의 실행 흐름은 물론, 자원 사용 현황까지 함께 파악하고 관리한다. 결국 PCB 하나에 해당 프로세스에 관한 모든 관리 정보가 집약되어 있는 셈이다.

The Key Role of the PCB in Context Switching

Among the various pieces of information stored in the PCB, the ones most critically used when a context switch occurs are register values and the program counter.

The moment a running process is interrupted, the operating system saves the register values and program counter at that point into the process's PCB. When another process runs and then switches back to this process, the operating system restores exactly the values that were saved in the PCB. As a result, the process can resume precisely from where it was interrupted, as if nothing had happened.

Because these two pieces of information are safely stored in the PCB, the operating system can rapidly switch among dozens of processes while fully maintaining each process's execution flow.

So what order are the PCB-stored values actually saved and restored in when a process switch occurs? Let's look at that step-by-step in the next section.

문맥 교환과 PCB의 핵심 역할

PCB에 저장되는 여러 정보 중에서 문맥 교환이 발생할 때 가장 핵심적으로 활용되는 것은 레지스터 값과 프로그램 카운터다.

프로세스가 실행되다가 중단되는 순간, 운영체제는 그 시점의 레지스터 값과 프로그램 카운터를 해당 프로세스의 PCB에 저장해둔다. 이후 다른 프로세스가 실행되다가 다시 이 프로세스로 전환되면, 운영체제는 PCB에 저장해뒀던 값들을 그대로 복원한다. 덕분에 프로세스는 마치 아무 일도 없었던 것처럼 중단된 지점부터 정확히 이어서 실행될 수 있다.

이 두 가지 정보가 PCB에 안전하게 보관되어 있기 때문에, 운영체제는 수십 개의 프로세스를 빠르게 전환하면서도 각 프로세스의 실행 흐름을 온전히 유지할 수 있는 것이다.

그렇다면 실제로 프로세스가 전환될 때 PCB에 저장된 정보들이 구체적으로 어떤 순서로 저장되고 복원되는지, 다음 내용에서 그 과정을 단계별로 살펴보자.

3. Context Switching

What Is a Context Switch?

In an environment where multiple processes run on a single CPU, there inevitably comes a moment when the currently running process must be stopped and the CPU handed to another process - for example, when a time slice expires due to timeout, or when a process must transition to the waiting state due to an I/O request.

At this moment, the operating system must handle two things simultaneously: safely interrupting the current process's execution flow, and preparing the next process to resume precisely from where it previously left off.

The operation that handles both of these at once is the context switch. A context switch is not simply a matter of handing the CPU to another process — it refers to the entire sequence of saving the current process's execution state to its PCB and restoring the previous state from the next process's PCB. This process is what allows multiple processes to take turns using a single CPU while each maintains its own execution flow.

문맥 교환이란?

하나의 CPU에서 여러 프로세스가 실행되는 환경에서는 현재 실행 중인 프로세스를 멈추고 다른 프로세스에게 CPU를 넘겨야 하는 순간이 반드시 생긴다. 타임아웃으로 사용 시간이 끝났거나, 입출력 요청으로 대기 상태로 전환되어야 하는 경우가 대표적인 예다.

이 순간 운영체제는 두 가지를 동시에 처리해야 한다. 현재 실행 중인 프로세스의 실행 흐름을 안전하게 중단시키는 것, 그리고 다음에 실행할 프로세스가 이전에 멈췄던 지점부터 정확히 이어서 실행될 수 있도록 준비하는 것이다.

이 두 가지를 한꺼번에 처리하는 동작이 바로 문맥 교환(Context Switch) 이다. 문맥 교환은 단순히 CPU를 다른 프로세스에게 넘기는 것이 아니라, 현재 프로세스의 실행 상태를 PCB에 저장하고 다음 프로세스의 PCB에서 이전 상태를 복원하는 일련의 과정 전체를 가리킨다. 이 과정이 있기 때문에 여러 프로세스가 하나의 CPU를 번갈아 사용하면서도 각자의 실행 흐름을 유지할 수 있는 것이다.

When Does a Context Switch Occur?

A context switch doesn't happen arbitrarily at any time. It only occurs at the moment the operating system determines it needs to change which process the CPU is executing.

There are three representative triggers. When an interrupt occurs in the running process; when the allocated time slice expires; and when an I/O request makes it impossible for the current process to continue executing. At these moments, the operating system decides whether to switch the CPU's execution target.

There's an important point here: an interrupt doesn't necessarily cause a context switch. Even if an interrupt occurs, if the operating system determines that the current process can continue running, no context switch takes place.

In other words, a context switch is not the interrupt itself - it is the result of the decision the operating system makes after the interrupt. An interrupt is merely a signal giving the operating system an opportunity to intervene; whether an actual context switch happens is a decision the operating system makes based on the circumstances.

문맥 교환은 언제 발생할까?

문맥 교환은 임의로 아무 때나 발생하는 것이 아니다. 운영체제가 CPU의 실행 대상을 바꿔야 한다고 판단한 순간에만 일어난다.

대표적인 계기는 세 가지다. 실행 중인 프로세스에 인터럽트가 발생했을 때, 할당된 타임 슬라이스가 종료되었을 때, 그리고 입출력 요청으로 인해 현재 프로세스가 더 이상 실행을 이어갈 수 없는 상황이 됐을 때다. 이런 시점에 운영체제는 CPU 실행 대상을 전환할지 여부를 결정한다.

여기서 한 가지 중요한 점이 있다. 인터럽트가 발생했다고 해서 반드시 문맥 교환이 일어나는 것은 아니라는 것이다. 인터럽트가 발생하더라도 운영체제가 현재 프로세스를 계속 실행해도 된다고 판단하면 문맥 교환은 일어나지 않는다.

즉 문맥 교환은 인터럽트 그 자체가 아니라, 인터럽트 이후 운영체제가 내린 판단의 결과다. 인터럽트는 운영체제에게 개입할 기회를 주는 신호일 뿐이고, 실제로 문맥 교환을 할지 말지는 운영체제가 상황을 보고 결정하는 것이다.

The Actual Sequence of a Context Switch

When a context switch occurs, the operating system proceeds in the following order.

First, it saves the execution position and CPU state of the currently running process. The program counter and register values discussed earlier are recorded in that process's PCB. This save must be completed so that when the process runs again later, it can resume exactly from where it stopped.

Once saving is complete, it retrieves and restores the execution position and CPU state stored in the next process's PCB. Through this restoration, the CPU reaches a state where it can pick up where the new process previously left off.

These two steps - saving the previous process's state to its PCB and restoring the next process's state from its PCB to switch the CPU's execution target - constitute the entire operation called a context switch.

Ultimately, context switching is only possible because of the PCB. Because each process's execution state is safely preserved in the PCB, the operating system can rapidly switch among dozens of processes while fully maintaining each process's execution flow.

문맥 교환이 발생하면 운영체제는 다음과 같은 순서로 동작한다.

먼저 현재 실행 중이던 프로세스의 실행 위치와 CPU 상태를 저장한다. 앞서 살펴본 프로그램 카운터와 레지스터 값들이 해당 프로세스의 PCB에 기록되는 것이다. 이 저장이 완료되어야 나중에 이 프로세스가 다시 실행될 때 중단된 지점부터 정확히 이어갈 수 있다.

저장이 끝나면 다음에 실행할 프로세스의 PCB에 저장되어 있던 실행 위치와 CPU 상태를 꺼내어 복원한다. 이 복원 과정을 통해 CPU는 새로운 프로세스가 이전에 멈췄던 지점부터 실행을 이어받을 수 있는 상태가 된다.

이 두 단계, 즉 이전 프로세스의 상태를 PCB에 저장하고 다음 프로세스의 상태를 PCB에서 복원하여 CPU의 실행 대상을 전환하는 전체 과정을 문맥 교환이라고 한다.

결국 문맥 교환은 PCB가 있기 때문에 가능한 동작이다. 각 프로세스의 실행 상태가 PCB에 안전하게 보관되어 있기 때문에, 운영체제는 수십 개의 프로세스를 빠르게 전환하면서도 각 프로세스의 실행 흐름을 온전히 유지할 수 있다.

The Step-by-Step Sequence of a Context Switch

Let's trace the specific sequence of a context switch through the transition between P1 and P2.

While P1 is running, its time slice expires and the operating system intervenes. The operating system first saves P1's current state ; its program counter and register values ; into PCB1. This is the moment P1's execution flow is safely preserved.

Once saving is complete, the operating system retrieves P2's PCB2 and restores the previously saved execution state. Through this restoration, the CPU reaches a state where it can resume P2 from where it previously stopped. The control of the CPU passes from P1 to P2, and P2 begins executing.

The same pattern repeats every time the process switches. The current process's state is saved to its PCB, and the next process's state is restored from its PCB.

The key here is that a context switch does not swap the processes themselves. The essence of a context switch is the repeated cycle of saving and restoring via the PCB. The processes remain in memory as they are; the operating system transfers execution states through the PCBs to switch which process the CPU is running.

문맥 교환의 실제 진행 순서

문맥 교환이 구체적으로 어떤 순서로 진행되는지 P1과 P2의 전환 과정을 통해 살펴보자.

P1이 실행 중이다가 타임 슬라이스가 종료되면 운영체제가 개입한다. 운영체제는 가장 먼저 현재 실행 중이던 P1의 상태, 즉 프로그램 카운터와 레지스터 값을 PCB1에 저장한다. P1의 실행 흐름이 안전하게 보관되는 순간이다.

저장이 완료되면 운영체제는 다음에 실행할 P2의 PCB2를 가져와 이전에 저장해뒀던 실행 상태를 복원한다. 이 복원 과정을 통해 CPU는 P2가 이전에 멈췄던 지점부터 이어서 실행할 수 있는 상태가 된다. 이렇게 CPU의 제어권이 P1에서 P2로 넘어가고 P2가 실행을 시작한다.

이후에도 프로세스가 바뀔 때마다 동일한 방식이 반복된다. 현재 프로세스의 상태는 PCB에 저장되고, 다음 프로세스의 상태는 PCB에서 복원된다.

여기서 핵심은 문맥 교환이 프로세스 자체를 바꾸는 것이 아니라는 점이다. PCB를 기준으로 저장과 복원을 반복하는 과정이 문맥 교환의 본질이다. 프로세스들은 메모리에 그대로 있고, 운영체제가 PCB를 통해 실행 상태를 주고받으며 CPU의 실행 대상을 전환하는 것이다.

Summarizing the Relationship Between PCBs and Context Switching

Let's pull together everything we've covered.

All of a process's state information is stored in the PCB. When a context switch occurs, the current process's state is saved to its PCB, and the next process's previous state is restored from its PCB — that is how process switching is carried out.

From this perspective, the PCB's role becomes clear. The PCB is not simply a place to record process information - it is the management standard that maintains the information needed to resume an interrupted process's execution. Without the PCB, there would be no way to know where a process stopped or what state it was in, making context switching impossible altogether.

In conclusion, a context switch can be summarized as the process of switching the execution target based on the PCB. Each time a process switches, saving and restoring via the PCB repeats, and through this process the operating system manages multiple processes sharing a single CPU without interruption.

PCB와 문맥 교환의 관계 정리

지금까지 살펴본 내용을 한 번 정리해보자.

프로세스의 상태 정보는 모두 PCB에 저장된다. 문맥 교환이 발생하면 현재 프로세스의 상태를 PCB에 저장하고, 다음 프로세스의 PCB에서 이전 상태를 복원하는 방식으로 프로세스 전환이 이루어진다.

이 관점에서 보면 PCB의 역할이 명확해진다. PCB는 단순히 프로세스 정보를 기록해두는 공간이 아니라, 중단된 프로세스의 실행을 다시 이어서 수행할 수 있도록 필요한 정보를 유지해주는 관리 기준이다. PCB가 없다면 프로세스가 어디서 멈췄는지, 어떤 상태였는지를 알 수 없기 때문에 문맥 교환 자체가 불가능하다.

결론적으로 문맥 교환은 PCB를 기준으로 실행 대상을 바꾸는 과정이라고 정리할 수 있다. 프로세스가 전환될 때마다 PCB에 저장과 복원이 반복되고, 운영체제는 이 과정을 통해 여러 프로세스가 하나의 CPU를 끊김 없이 나눠 쓸 수 있도록 관리한다.

The Overhead of Context Switching

Context switching is a necessary process that allows multiple processes to share the CPU. But it comes with a cost: while a context switch is taking place, the CPU cannot perform any actual work and comes to a momentary halt.

During this time, all the CPU does is save the previous process's state to the PCB and load the next process's state from the PCB. No actual computation takes place. This — consuming CPU time without performing real work — is what we call overhead.

The more frequently context switches occur, the more this overhead accumulates, ultimately affecting overall system performance. The more often processes switch, the less time the CPU has available for actual work.

Therefore, the operating system should not trigger context switches unconditionally — it must manage them so they occur only when necessary and as infrequently as possible. The number of context switches is directly linked to system performance. The specifics of how the operating system selects which process to run and how it moderates the number of switches will be explored in depth in the CPU scheduling chapter.

문맥 교환의 오버헤드

문맥 교환은 여러 프로세스가 CPU를 나눠 쓰기 위해 반드시 필요한 과정이다. 하지만 여기에는 한 가지 비용이 따른다. 문맥 교환이 일어나는 동안 CPU는 실제 작업을 수행하지 못하고 멈추게 된다는 것이다.

이 시간 동안 CPU가 하는 일은 이전 프로세스의 상태를 PCB에 저장하고, 다음 프로세스의 상태를 PCB에서 불러오는 것뿐이다. 실제 연산은 전혀 이루어지지 않는다. 이처럼 실제 작업은 하지 않으면서 CPU 시간이 소모되는 부분을 오버헤드라고 한다.

문맥 교환이 자주 발생할수록 이 오버헤드는 누적되고, 결과적으로 전체 시스템 성능에 영향을 미치게 된다. 프로세스 전환이 잦을수록 CPU가 실제 작업에 쓸 수 있는 시간이 그만큼 줄어드는 셈이다.

따라서 운영체제는 문맥 교환을 무조건 발생시키는 것이 아니라, 필요한 경우에만 최소한으로 일어날 수 있도록 관리해야 한다. 문맥 교환 횟수는 시스템 성능과 직접적으로 연결되어 있기 때문이다. 이와 관련된 구체적인 내용, 즉 운영체제가 어떤 기준으로 CPU 실행 대상을 선택하고 전환 횟수를 조율하는지는 CPU 스케줄링 챕터에서 본격적으로 다루게 된다.

The Relationship Between Time Slice Size and Context Switching

Let's examine how frequently context switches occur depending on time slice size, and how that affects system performance, through three cases.

Case A is when the time slice is large. A single process occupies the CPU for a long time, and context switches occur relatively infrequently. Because more time is spent on actual work, overhead is low. However, if one process monopolizes the CPU for too long, other processes may have to wait a long time.

Case B is when the time slice is appropriately sized. Process execution time and context switching are balanced. Multiple processes share the CPU evenly while overhead doesn't become excessive — an ideal state.

Case C is when the time slice is excessively small. Context switches occur repeatedly almost the instant a process starts running. The time spent saving and restoring state exceeds the time the CPU spends on actual work. Overhead accumulates significantly and overall system performance degrades.

In the end, if the time slice is too small, context switches occur excessively and overhead grows; if it's too large, a particular process ends up monopolizing the CPU. The number of context switches is directly linked to system performance, and finding the right balance is a central challenge in CPU scheduling.

타임 슬라이스 크기와 문맥 교환의 관계

타임 슬라이스의 크기에 따라 문맥 교환이 얼마나 자주 발생하는지, 그리고 그것이 시스템 성능에 어떤 영향을 주는지 세 가지 경우를 통해 살펴보자.

A는 타임 슬라이스가 큰 경우다. 하나의 프로세스가 오랜 시간 CPU를 점유하고, 문맥 교환은 상대적으로 드물게 발생한다. 실제 작업에 쓰이는 시간이 길기 때문에 오버헤드는 낮다. 다만 한 프로세스가 CPU를 너무 오래 독점하면 다른 프로세스가 오래 기다려야 하는 문제가 생길 수 있다.

B는 타임 슬라이스가 적당한 경우다. 프로세스 실행 시간과 문맥 교환이 균형을 이루고 있다. 여러 프로세스가 고르게 CPU를 나눠 쓰면서도 오버헤드가 과도하지 않은 이상적인 상태다.

C는 타임 슬라이스가 지나치게 작은 경우다. 프로세스가 실행을 시작하자마자 문맥 교환이 반복적으로 발생한다. CPU가 실제 작업을 수행하는 시간보다 상태를 저장하고 복원하는 데 쓰는 시간이 더 많아지는 상황이다. 오버헤드가 크게 누적되어 시스템 전체 성능이 떨어진다.

결국 타임 슬라이스가 너무 작으면 문맥 교환이 과도하게 발생해 오버헤드가 커지고, 너무 크면 특정 프로세스가 CPU를 독점하는 문제가 생긴다. 문맥 교환 횟수는 시스템 성능과 직접적으로 연결되어 있으며, 이 균형을 어떻게 맞출 것인가가 CPU 스케줄링의 핵심 과제가 된다.

The Next Challenges in Process Management

We've confirmed that context switching allows the operating system to switch between processes. But in an environment where multiple processes exist simultaneously, simply being able to switch is not enough.

When there are multiple processes, new problems naturally arise. When several processes are waiting, which one should run first? When multiple processes try to use the same resource at the same time, how should that be coordinated?

Ultimately, switching between processes is just the beginning. Maintaining the entire system stably and efficiently requires additional management beyond the switching itself — scheduling to determine execution order, and synchronization to coordinate resource use. What we'll explore going forward is precisely how these problems are solved.

프로세스 관리의 다음 과제

문맥 교환을 통해 운영체제가 프로세스 간의 전환을 수행할 수 있다는 것을 확인했다. 그런데 여러 프로세스가 동시에 존재하는 환경에서는 단순히 전환이 가능하다는 것만으로는 충분하지 않다.

프로세스가 여럿 존재한다면 자연스럽게 새로운 문제들이 생겨난다. 대기 중인 프로세스가 여러 개일 때 그 중 어떤 프로세스를 먼저 실행할 것인가, 여러 프로세스가 같은 자원을 동시에 사용하려 할 때 이를 어떻게 조율할 것인가와 같은 문제들이다.

결국 프로세스를 전환하는 것 자체는 시작에 불과하다. 시스템 전체를 안정적이고 효율적으로 유지하기 위해서는 전환 이후의 관리, 즉 어떤 순서로 실행할지를 결정하는 스케줄링과 자원 사용을 조율하는 동기화 같은 추가적인 관리 기법이 필요하다. 앞으로 살펴볼 내용들이 바로 이 문제들을 어떻게 해결하는지에 대한 것이다.

The Core Challenges of Process Management Ahead

In an environment where multiple processes run together, there are additional problems the operating system must solve. Let's lay out the key challenges we'll be examining one by one.

The first is CPU scheduling. Among multiple waiting processes, which one gets the CPU? In what order and by what criteria is execution order determined?

The second is memory management. When multiple processes must share limited memory, how is memory allocated to each process and managed efficiently?

The third is inter-process communication. When processes with independent memory spaces need to exchange information, how is that handled?

The fourth is synchronization. When multiple processes try to access the same data simultaneously, conflicts can occur. Managing this situation to maintain data consistency and prevent conflicts requires dedicated techniques.

These four are the core issues in process management we'll be covering going forward. For now, get a broad sense that these problems exist — the specific details of each will be examined one by one in the chapters ahead.

앞으로 다룰 프로세스 관리의 핵심 과제들

여러 프로세스가 함께 실행되는 환경에서 운영체제가 추가로 해결해야 할 문제들이 있다. 앞으로 하나씩 살펴볼 핵심 과제들을 먼저 정리해두자.

첫 번째는 CPU 스케줄링이다. 대기 중인 여러 프로세스 중에서 어떤 프로세스에게 CPU를 줄 것인지, 어떤 순서와 기준으로 실행 순서를 결정할 것인지의 문제다.

두 번째는 메모리 관리다. 한정된 메모리를 여러 프로세스가 나눠 써야 하는 상황에서 각 프로세스에게 메모리를 어떻게 할당하고 효율적으로 관리할 것인지의 문제다.

세 번째는 프로세스 간 통신이다. 서로 독립된 메모리 공간을 가진 프로세스들이 필요에 따라 정보를 주고받아야 할 때, 이를 어떤 방식으로 처리할 것인지의 문제다.

네 번째는 동기화다. 여러 프로세스가 동일한 데이터에 동시에 접근하려 할 때 충돌이 발생할 수 있다. 이런 상황에서 데이터의 일관성을 유지하고 충돌을 막기 위한 관리 방법이 필요하다.

이 네 가지가 앞으로 다루게 될 프로세스 관리의 핵심 이슈다. 지금은 이런 문제들이 존재한다는 것을 큰 그림으로 파악해두고, 각 항목에 대한 구체적인 내용은 이후에 하나씩 살펴보게 될 것이다.

3️⃣ Process Execution and Control

Process Execution System Calls: fork()

Before directly observing process execution and control in a Ubuntu environment, let's first define the relevant terminology. There are representative system calls the operating system uses to manage processes from creation to termination. The first one we'll look at is fork().

fork() is a system call that creates one new process based on the currently running process. The newly created process is called the child process, and the original process is called the parent process.

When fork() is called, the parent process's memory space is logically duplicated to create the new process. The two processes created this way have different PIDs and run completely independently after creation.

There's an important point here. Code written after the fork() call is executed in both the parent and the child process. In other words, the execution flow splits into two branches after fork(), with each process independently continuing its own flow. Understanding this characteristic is essential for accurately grasping how code behaves after fork().

프로세스 실행 관련 시스템 호출; fork()

우분투 환경에서 프로세스의 실행과 제어를 직접 확인하기에 앞서, 먼저 관련 용어를 정리해두자. 운영체제가 프로세스의 생성부터 종료까지 관리하는 데 사용하는 대표적인 시스템 호출들이 있다. 그 중 첫 번째로 살펴볼 것이 fork()다.

fork()는 현재 실행 중인 프로세스를 기준으로 새로운 프로세스를 하나 더 생성하는 시스템 호출이다. 이때 새롭게 만들어진 프로세스를 자식 프로세스, 기존의 프로세스를 부모 프로세스라고 부른다.

fork()가 호출되면 부모 프로세스의 메모리 공간이 논리적으로 복제되어 새로운 프로세스가 만들어진다. 이렇게 생성된 두 프로세스는 서로 다른 PID를 가지게 되며, 생성 이후에는 완전히 독립적으로 실행된다.

여기서 중요한 점이 있다. fork() 호출 이후에 작성된 코드는 부모 프로세스와 자식 프로세스 모두에서 실행된다는 것이다. 즉 fork() 이후의 실행 흐름이 두 갈래로 나뉘어, 각 프로세스가 자신의 흐름을 독립적으로 이어서 실행하게 된다. 이 특성을 이해하고 있어야 fork() 이후의 코드 동작을 정확히 파악할 수 있다.

Distinguishing Parent from Child Using fork()'s Return Value

When fork() is called, both the parent and child processes execute the same code simultaneously. So how can you tell whether the currently executing code is running in the parent or the child? The answer is through fork()'s return value.

The return value falls into three cases. In the parent process, the PID of the newly created child process is returned. In the child process, 0 is returned. And if process creation fails, -1 is returned.

Using this difference in return values, you can clearly determine within the code after fork() whether the current execution context is the parent or the child. If the return value is 0, it's running in the child process; if the value is greater than 0, it's running in the parent. By using this return value as a condition to branch the code inside the program, you can control the parent and child to perform different behaviors.

fork()의 반환값으로 부모와 자식 구분하기

fork()를 호출하면 부모와 자식 프로세스가 동시에 같은 코드를 실행하게 된다. 그렇다면 현재 실행 중인 코드가 부모 프로세스에서 돌아가고 있는지, 자식 프로세스에서 돌아가고 있는지를 어떻게 구분할 수 있을까? 바로 fork()의 반환값을 통해 구분할 수 있다.

반환값은 세 가지 경우로 나뉜다. 부모 프로세스에서는 새로 생성된 자식 프로세스의 PID가 반환된다. 자식 프로세스에서는 0이 반환된다. 그리고 프로세스 생성에 실패한 경우에는 -1이 반환된다.

이 반환값의 차이를 이용하면 fork() 이후의 코드 안에서 지금 실행 중인 주체가 부모인지 자식인지를 명확하게 구분할 수 있다. 반환값이 0이면 자식 프로세스에서 실행 중인 것이고, 0보다 큰 값이면 부모 프로세스에서 실행 중인 것이다. 프로그램 안에서 이 반환값을 조건으로 분기를 나누면 부모와 자식이 서로 다른 동작을 수행하도록 제어할 수 있다.

The Execution Flow After fork()

Let's trace the process by which a single process splits into two execution flows via fork().

When the program starts, the main function executes, and the moment fork() is called, a new child process is created. From this point on, both the parent and child process exist simultaneously, and both continue executing from the code immediately after fork().

After that, the two processes execute different code blocks based on the return value. Because the child process receives 0, it executes the child's code block; the parent process receives the child's PID and executes the parent's code block. After each executes its respective code block, the two processes independently finish and terminate.

There's one thing that must be remembered here: it is not predetermined which process - parent or child - will run first. The parent might run first, or the child might. Because this depends on the operating system's scheduling decisions, the output order of the program can change every time it runs.

Ultimately, after fork(), a single piece of code runs separately in both the parent and child processes, and the order of execution is decided by the operating system.

fork() 이후의 실행 흐름

fork()를 통해 하나의 프로세스가 두 개의 실행 흐름으로 나뉘는 과정을 살펴보자.

프로그램이 시작되면 메인 함수가 실행되고, fork()가 호출되는 순간 새로운 자식 프로세스가 생성된다. 이 시점부터 부모 프로세스와 자식 프로세스가 동시에 존재하게 되며, 둘 다 fork() 다음 코드부터 실행을 이어간다.

이후 두 프로세스는 반환값을 기준으로 서로 다른 코드 블록을 실행한다. 자식 프로세스는 반환값이 0이기 때문에 자식 프로세스용 코드 블록을 실행하고, 부모 프로세스는 자식의 PID를 반환받아 부모 프로세스용 코드 블록을 실행한다. 각자의 코드 블록을 실행한 후 두 프로세스는 서로 독립적으로 실행을 마치고 종료된다.

여기서 반드시 기억해야 할 점이 있다. 부모와 자식 중 어느 프로세스가 먼저 실행될지는 정해져 있지 않다는 것이다. 부모가 먼저 실행될 수도 있고, 자식이 먼저 실행될 수도 있다. 이는 운영체제의 스케줄링 결정에 따라 달라지기 때문에 프로그램의 출력 결과는 실행할 때마다 순서가 바뀔 수 있다.

결국 fork() 이후에는 하나의 코드가 부모와 자식 프로세스에서 각각 실행되며, 그 실행 순서는 운영체제가 결정한다는 점을 이해해두어야 한다.

exec() — The System Call That Changes What a Process Runs

The system call used when an already-created process needs to switch to running a different program is exec(). It replaces the currently running process entirely with a new program.

The key here is that no new process is created. The process itself — its PID — remains the same. However, the code region, data region, and stack region that the process was running are all completely replaced with the contents of the new program. Think of it as keeping the process's shell intact while swapping out everything running inside it.

The outcome of an exec() call is also clearly defined. If the call succeeds, the original program's execution flow is completely terminated, and any code written after exec() does not execute — because there is no flow to return to once the replacement has occurred. Conversely, if the call fails, no replacement takes place, so the original program's flow continues and the code on the next line after exec() executes.

exec() is commonly used together with fork(). The typical pattern is to create a child process with fork() and then call exec() within the child to run an entirely different program.

exec() — 실행 내용을 바꾸는 시스템 호출

이미 생성된 프로세스가 실행할 프로그램을 바꿔야 할 때 사용하는 시스템 호출이 exec()다. 현재 실행 중인 프로세스를 완전히 새로운 프로그램으로 교체하는 역할을 한다.

여기서 핵심은 새로운 프로세스가 만들어지는 것이 아니라는 점이다. 프로세스 자체, 즉 PID는 그대로 유지된다. 하지만 그 프로세스가 실행하던 코드 영역, 데이터 영역, 스택 영역이 모두 새로운 프로그램의 내용으로 완전히 교체된다. 프로세스라는 껍데기는 그대로 두고 안에서 실행되는 내용만 통째로 바꾸는 것이라고 이해하면 된다.

exec()의 동작 결과도 명확하게 구분된다. 호출이 성공하면 기존 프로그램의 실행 흐름은 완전히 종료되고, exec() 이후에 작성된 코드는 실행되지 않는다. 이미 새로운 프로그램으로 교체되었기 때문에 돌아올 흐름 자체가 없어지는 것이다. 반대로 호출이 실패한 경우에는 교체가 이루어지지 않았기 때문에 기존 프로그램의 흐름이 그대로 유지되어 exec() 다음 줄의 코드가 실행된다.

exec()는 보통 fork()와 함께 사용되는 경우가 많다. fork()로 자식 프로세스를 생성한 뒤, 자식 프로세스에서 exec()를 호출해 전혀 다른 프로그램을 실행시키는 패턴이 대표적인 활용 방식이다.

Analyzing the exec() Execution Flow Example

This example is code that lets you directly observe how a process's execution flow changes before and after exec() is called. Let's follow the flow step by step.

First, a message is printed via printf before exec() is called. Up to this point, the original program's flow is executing normally.

Next, execl("/bin/ls", "ls", NULL) is called. The moment this single line executes, the current process is completely replaced by /bin/ls — the ls program that outputs a directory's file listing. The meaning of each argument is as follows.

The first argument, /bin/ls, is the full path of the file the operating system will actually execute. The operating system looks at this path to decide which file to run. The second argument, ls, specifies the name the executed program will recognize itself by. Even if you change the second argument to hello, the operating system will still execute /bin/ls, but internally the program will recognize its own name as hello. To avoid confusion, it is conventional to pass the same name as the executable file. The final NULL is a marker telling the operating system that the argument list ends here. Because exec-family functions can accept multiple arguments, the end must be explicitly indicated.

Once execl() succeeds, the original program's flow is completely gone. That's why the printf("exec 호출 후\n") written below it does not execute — there is no flow to return to.

The core takeaway from this example is one thing: exec() does not create a new process — it completely replaces the current process's execution content.

exec() 실행 흐름 예제 분석

이 예제는 exec() 호출 전후로 프로세스의 실행 흐름이 어떻게 바뀌는지를 직접 확인할 수 있는 코드다. 흐름을 단계별로 따라가보자.

먼저 printf를 통해 exec() 호출 전 메시지가 출력된다. 여기까지는 기존 프로그램의 흐름이 그대로 실행되는 구간이다.

그다음 execl("/bin/ls", "ls", NULL)이 호출된다. 이 한 줄이 실행되는 순간 현재 프로세스는 /bin/ls, 즉 디렉토리 파일 목록을 출력하는 ls 프로그램으로 완전히 교체된다. 각 인자의 의미를 살펴보면 다음과 같다.

첫 번째 인자 /bin/ls는 운영체제가 실제로 실행할 파일의 전체 경로다. 운영체제는 이 경로를 보고 어떤 파일을 실행할지 결정한다. 두 번째 인자 ls는 실행된 프로그램이 자기 자신을 어떤 이름으로 인식할지를 지정하는 값이다. 만약 두 번째 인자를 hello로 바꿔도 운영체제는 여전히 /bin/ls를 실행하지만, 실행된 프로그램 내부에서는 자신을 hello라는 이름으로 인식하게 된다. 혼란을 피하기 위해 실행 파일의 이름과 프로그램이 인식하는 이름을 동일하게 전달하는 것이 관례다. 마지막 NULL은 인자 목록이 여기서 끝났음을 운영체제에게 알려주는 표식이다. exec 계열 함수는 여러 인자를 받을 수 있기 때문에 끝을 명시적으로 표시해줘야 한다.

execl() 호출이 성공하면 기존 프로그램의 흐름은 완전히 사라진다. 그래서 그 아래에 작성된 printf("exec 호출 후\n")는 실행되지 않는다. 돌아올 흐름 자체가 없어지기 때문이다.

이 예제에서 확인할 수 있는 핵심은 하나다. exec()는 새로운 프로세스를 만드는 것이 아니라, 현재 프로세스의 실행 내용을 완전히 교체한다는 것이다.

The Combination of fork() and exec()

In practice, exec() is rarely used alone — it is almost always used together with fork(). Let's look at how this combination works.

First, fork() is called, creating parent and child processes. The child process initially executes the same code as the parent, but the moment execl("/bin/ls", "ls", NULL) is called on the child's side, the child's execution content is completely replaced by the ls program. Afterward, the parent continues executing its own code while the child, now replaced by ls, outputs the directory file listing. As a result, the parent's output and the ls output from the child both appear together.

This pattern is important because it is the fundamental way commands are executed in a shell. When a user types a command in the terminal, the shell creates a child process with fork(), then calls exec() in that child to replace it with the program corresponding to the entered command. The parent shell process remains intact and prepares to receive the next command, while the child process handles command execution. This structure is exactly why the shell becomes the parent process of user programs in the process tree we examined earlier.

fork()와 exec()의 조합

실제 운영체제에서 exec()는 단독으로 사용되기보다 대부분 fork()와 함께 사용된다. 이 조합이 어떻게 동작하는지 살펴보자.

먼저 fork()가 호출되면 부모 프로세스와 자식 프로세스가 생성된다. 자식 프로세스는 처음에는 부모와 동일한 코드를 실행하다가, 자식 프로세스 쪽에서 execl("/bin/ls", "ls", NULL)이 호출되는 순간 자식 프로세스의 실행 내용이 ls 프로그램으로 완전히 교체된다. 이후 부모 프로세스는 자신의 코드를 그대로 이어서 실행하고, 자식 프로세스는 ls 프로그램으로 교체되어 디렉토리 파일 목록을 출력한다. 결과적으로 부모 프로세스의 출력과 자식 프로세스에서 실행된 ls의 결과가 함께 나타나게 된다.

이 패턴이 중요한 이유는 바로 쉘에서 명령어를 실행할 때 사용되는 기본 방식이기 때문이다. 사용자가 터미널에 명령어를 입력하면 쉘은 fork()로 자식 프로세스를 생성하고, 그 자식 프로세스에서 exec()를 호출해 입력한 명령어에 해당하는 프로그램으로 교체한다. 부모인 쉘 프로세스는 그대로 유지되면서 다음 명령어를 받을 준비를 하고, 자식 프로세스가 명령어를 실행하는 것이다. 앞서 프로세스 트리에서 쉘이 사용자 프로그램들의 부모 프로세스가 된다고 했던 것이 바로 이 구조 때문이다.

exit() — The System Call for Terminating a Process

exit() is the system call used to terminate a running process. The moment it is called, the current process terminates immediately, and no code executes afterward. Like exec() examined earlier, it is a function from which execution does not continue after it's called.

When a process calls exit(), it notifies the operating system that this process is terminating. In this process, the operating system cleans up the resources the process had been using — memory, open files, and so on — and returns them to the system. The resources the process had been occupying are reclaimed at this point.

There's one more thing to note here. The termination information passed through exit() is delivered to the operating system as well. This information is stored so that the parent process can retrieve it later. A structure exists that allows the parent process to find out how the child process terminated. How this is actually handled in practice will be seen concretely through the wait() system call we'll look at next.

exit() — 프로세스 종료 시스템 호출

exit()는 실행 중인 프로세스를 종료할 때 사용하는 시스템 호출이다. 호출되는 순간 현재 프로세스는 즉시 종료되며, 이후 어떤 코드도 실행되지 않는다. 앞서 살펴본 exec()와 마찬가지로 호출 이후 실행 흐름이 이어지지 않는 함수다.

프로세스가 exit()를 호출하면 운영체제에게 이 프로세스가 종료됨을 알린다. 이 과정에서 운영체제는 해당 프로세스가 사용하던 메모리, 열려 있던 파일 같은 자원들을 정리해서 다시 시스템으로 돌려보낸다. 프로세스가 점유하고 있던 자원이 이 시점에 회수되는 것이다.

여기서 한 가지 더 짚고 넘어갈 점이 있다. exit()를 통해 전달되는 종료 정보가 운영체제에 함께 전달된다는 것이다. 이 정보는 부모 프로세스가 나중에 확인할 수 있도록 보관된다. 자식 프로세스가 어떻게 종료되었는지를 부모 프로세스가 알 수 있는 구조가 마련되어 있는 셈이다. 이 부분이 실제로 어떻게 처리되는지는 다음에 살펴볼 wait() 시스템 호출을 통해 구체적으로 확인하게 될 것이다.

Analyzing the exit() Example

The moment exit(0) is called, the process terminates immediately. As a result, the printf("이 문장은 실행되지 않는다.") written below it does not execute. Even though the code exists, the flow after exit() is completely blocked.

Let's also note the meaning of the number passed to exit() — the exit status value. 0 means the process terminated normally without any problems. Values other than 0, such as 1 or 2, are used to distinguish what kind of error occurred. As mentioned earlier, this exit status value is passed to the operating system so the parent process can retrieve it later.

The core takeaway from this example is one thing: once exit() is called, no subsequent code ever executes, and the program's execution ends immediately on the spot.

exit() 예제 분석

exit(0)이 호출되는 순간 프로세스는 즉시 종료된다. 그렇기 때문에 그 아래에 작성된 printf("이 문장은 실행되지 않는다.")는 실행되지 않는다. 코드가 존재하더라도 exit() 이후의 흐름은 완전히 차단되는 것이다.

여기서 exit()에 전달되는 숫자, 즉 종료 상태값의 의미도 짚어두자. 0은 프로세스가 문제 없이 정상적으로 종료되었음을 뜻한다. 반면 1이나 2 같은 0 이외의 값들은 어떤 종류의 오류가 발생했는지를 구분하는 데 사용된다. 이 종료 상태값은 앞서 언급했듯이 부모 프로세스가 나중에 확인할 수 있도록 운영체제에 전달된다.

이 예제를 통해 확인할 수 있는 핵심은 하나다. exit()가 호출되면 그 이후의 코드는 절대 실행되지 않으며, 프로그램의 실행이 그 자리에서 즉시 끝난다는 것이다.

wait() — The System Call for Waiting on a Child Process to Terminate

wait() is a system call that causes a parent process to wait until its child process terminates.

When a parent process calls wait(), if the child process is still running, the parent enters the waiting state. Only when the child process terminates does the parent process resume execution. In this process, the parent process can also retrieve the child's exit status value. The exit status value we saw earlier with exit() is delivered to the parent process at this moment.

One of wait()'s important roles is controlling execution order. Using wait() guarantees that the parent process won't terminate before the child. We said that after fork(), which runs first — parent or child — depends on the operating system's scheduling; by using wait(), you can clearly establish that the parent only moves on to the next step after the child has fully terminated. For this reason, wait() is also a system call used for synchronization between parent and child processes.

wait() — 자식 프로세스의 종료를 기다리는 시스템 호출

wait()는 부모 프로세스가 자식 프로세스가 종료될 때까지 기다리도록 만드는 시스템 호출이다.

부모 프로세스가 wait()를 호출하면 자식 프로세스가 아직 실행 중인 경우 부모는 대기 상태로 들어간다. 자식 프로세스가 종료되면 그제서야 부모 프로세스가 다시 실행을 이어간다. 이 과정에서 부모 프로세스는 자식 프로세스의 종료 상태값도 함께 회수할 수 있다. 앞서 exit()에서 살펴봤던 종료 상태값이 바로 이 시점에 부모 프로세스에게 전달되는 것이다.

wait()의 중요한 역할 중 하나는 실행 순서의 제어다. wait()를 사용하면 부모 프로세스가 자식보다 먼저 종료되지 않는다는 것이 보장된다. fork() 이후에는 부모와 자식 중 누가 먼저 실행될지 운영체제의 스케줄링에 따라 달라진다고 했는데, wait()를 사용하면 자식이 완전히 종료된 이후에 부모가 다음 단계로 넘어간다는 순서를 명확히 정할 수 있다. 이런 이유로 wait()는 부모와 자식 프로세스 간의 동기화에 사용되는 시스템 호출이기도 하다.

Analyzing the wait() Example

This example shows how fork(), exit(), and wait() are used together to control the execution order of parent and child processes.

First, a child process is created via pid_t pid = fork(). The child process then prints a "child process running" message, waits briefly with sleep(2), and terminates with exit(0).

Meanwhile, after fork(), the parent process doesn't immediately execute the next code — it calls wait(NULL) and stays in the waiting state until the child process fully terminates. Only after the child terminates with exit(0) does the parent wake up from the waiting state, print the "parent running after child terminated" message, and continue executing the subsequent code.

This example makes wait()'s role clear. Without wait(), there's no guarantee which of the parent and child runs first; but by using wait(), it is solidly guaranteed that the parent runs after the child terminates. Ultimately, wait() should be understood as the means by which a parent process waits for its child's termination and handles the result.

wait() 예제 분석

이 예제는 fork(), exit(), wait()가 함께 사용되어 부모와 자식 프로세스의 실행 순서가 어떻게 제어되는지를 보여준다.

먼저 pid_t pid = fork()를 통해 자식 프로세스가 생성된다. 이후 자식 프로세스는 "자식 프로세스 실행" 메시지를 출력하고, sleep(2)로 잠시 대기한 뒤 exit(0)으로 종료된다.

한편 부모 프로세스는 fork() 이후 곧바로 다음 코드를 실행하지 않고 wait(NULL)을 호출하여 자식 프로세스가 완전히 종료될 때까지 대기 상태로 머문다. 자식 프로세스가 exit(0)으로 종료되고 나서야 부모 프로세스가 대기 상태에서 깨어나 "자식 종료 후 부모 실행" 메시지를 출력하며 이후 코드를 이어서 실행한다.

이 예제를 통해 wait()의 역할이 명확하게 드러난다. wait()가 없다면 부모와 자식 중 누가 먼저 실행될지 보장할 수 없지만, wait()를 사용함으로써 자식이 종료된 이후에 부모가 실행된다는 순서가 확실하게 보장된다. 결국 wait()는 부모 프로세스가 자식 프로세스의 종료를 기다리고 그 결과를 처리하는 방법이라고 이해하면 된다.

Zombie Processes

exit() and wait() must be used as a matched pair. When the two don't properly interlock, a state called a zombie process occurs.

A zombie process is a process whose execution has already ended but which hasn't been fully cleaned up and remains in the system as a trace. When a child process calls exit() to terminate, the operating system temporarily holds onto the exit status value so the parent process can retrieve it. But if the parent process never calls wait(), no one retrieves that information. Even though the child's execution is over, its termination information hasn't been collected, so it can't be fully cleaned up and remains in the system.

A zombie process itself doesn't use the CPU, so it might not seem like a major problem at first. But as zombie processes accumulate, they continue to occupy system resources like PIDs — and over time, this leads to waste of system resources. Therefore, when writing a program that creates child processes, the parent process must always properly handle the child's termination through wait().

좀비 프로세스

exit()와 wait()는 짝을 이루어 사용되어야 한다. 이 둘이 제대로 맞물리지 않으면 좀비 프로세스라는 상태가 발생한다.

좀비 프로세스란 자식 프로세스의 실행은 이미 끝났지만 완전히 정리되지 않고 시스템에 흔적으로 남아있는 상태의 프로세스를 말한다. 자식 프로세스가 exit()를 호출해 종료되면 운영체제는 그 종료 상태값을 부모 프로세스가 확인할 수 있도록 잠시 보관해둔다. 그런데 부모 프로세스가 wait()를 호출하지 않으면 이 정보를 아무도 회수하지 않는 상황이 된다. 자식 프로세스는 실행이 끝났음에도 불구하고 종료 정보가 회수되지 않아 완전히 정리되지 못한 채 시스템에 남게 되는 것이다.

좀비 프로세스 자체는 CPU를 사용하지 않기 때문에 당장 큰 문제처럼 보이지 않을 수 있다. 하지만 좀비 프로세스가 누적되면 PID와 같은 시스템 자원을 계속 점유하게 되어 장기적으로 시스템 자원 낭비로 이어질 수 있다. 따라서 자식 프로세스를 생성하는 프로그램을 작성할 때는 반드시 부모 프로세스에서 wait()를 통해 자식의 종료를 올바르게 처리해주어야 한다.

How Zombie Processes Are Created

Let's look specifically at the path through which a zombie process comes into being.

When a child process calls exit(), its execution ends. At this point, the operating system must hold onto the child's termination information so the parent can retrieve it. But if the parent process never calls wait(), there's no way to deliver that information. And the operating system can't simply discard it either. The result is that the operating system leaves the child process's PCB in the kernel area as-is.

This is exactly the zombie process state. The execution is over, but the PCB hasn't been reclaimed, so the process remains in the system as a trace.

To summarize: a zombie process occurs when exit() and wait() fail to pair up. The child has terminated via exit(), but the parent hasn't retrieved the termination information via wait(). Only when these two system calls properly interlock is the child process's PCB fully cleaned out of the kernel and the process's lifecycle completely concluded.

좀비 프로세스가 만들어지는 과정

좀비 프로세스가 어떤 경로로 생겨나는지 그 과정을 구체적으로 살펴보자.

자식 프로세스가 exit()를 호출하면 실행은 끝난다. 이때 운영체제는 자식 프로세스의 종료 정보를 부모 프로세스가 확인할 수 있도록 보관해두어야 한다. 그런데 부모 프로세스가 wait()를 호출하지 않으면 이 종료 정보를 전달할 방법이 없다. 그렇다고 운영체제가 이 정보를 그냥 버릴 수도 없다. 결국 운영체제는 자식 프로세스의 PCB를 커널 영역에 그대로 남겨두게 된다.

바로 이 상태가 좀비 프로세스다. 실행은 이미 끝났지만 PCB가 회수되지 않아 시스템에 흔적으로 남아있는 프로세스인 것이다.

정리하면 좀비 프로세스는 exit()와 wait()가 짝을 이루지 못할 때 발생한다. 자식은 exit()로 종료했지만 부모가 wait()로 종료 정보를 회수하지 않은 상황이다. 이 두 시스템 호출이 제대로 맞물려야만 자식 프로세스의 PCB가 커널에서 완전히 정리되고, 프로세스의 생명 주기가 온전히 마무리된다.

The Impact of Zombie Processes on the System

Because a zombie process has finished executing, it doesn't use the CPU or perform any computation. So it can appear as though there's no immediate problem.

However, a zombie process's PCB remains in the kernel area, meaning it continues to occupy system resources like PIDs. PIDs are a limited resource in the system. As zombie processes begin to accumulate one by one, the available PID pool shrinks, and in extreme cases, the system can reach a point where new processes can no longer be created.

In the end, as zombie processes accumulate, PID resources are wasted and the operating system's management efficiency degrades ; a state where resources are being held without any execution taking place. This is why the parent process must always properly handle child process termination through wait().

좀비 프로세스가 시스템에 미치는 영향

좀비 프로세스는 실행이 끝난 상태이기 때문에 CPU를 사용하거나 어떤 연산을 수행하지는 않는다. 그래서 당장 눈에 띄는 문제가 없는 것처럼 보일 수 있다.

하지만 좀비 프로세스는 PCB가 커널 영역에 그대로 남아있기 때문에 PID와 같은 시스템 자원을 계속 점유하고 있다. PID는 시스템에서 사용할 수 있는 개수가 한정되어 있는 자원이다. 좀비 프로세스가 하나둘 쌓이기 시작하면 사용 가능한 PID가 그만큼 줄어들고, 극단적인 경우에는 새로운 프로세스를 생성할 수 없는 상황까지 이어질 수 있다.

결국 좀비 프로세스가 누적될수록 PID 자원이 낭비되고 운영체제의 관리 효율도 떨어지게 된다. 실행도 하지 않으면서 자원만 붙들고 있는 상태가 계속되는 것이다. 이것이 부모 프로세스에서 반드시 wait()를 통해 자식 프로세스의 종료를 제대로 처리해주어야 하는 이유다.

Analyzing the Zombie Process Example

This example is code that lets you directly observe how a zombie process is created when wait() is absent.

The child process prints a "Child process terminated" message and immediately terminates. Meanwhile, the parent process, without calling wait(), prints a "Parent process running" message and enters the waiting state via sleep(2).

The output order can change every time the program runs, because after fork(), which of the parent or child runs first is determined by the operating system's scheduling. But what's important in this example is not the output order. The key is that a situation has been created where the child process can terminate first while the parent hasn't called wait().

Even after the child terminates, if the parent never calls wait(), the operating system cannot immediately clean up the child's termination information. So it leaves part of the child's PCB in the kernel temporarily. This state — execution finished but termination information not yet collected — is exactly the condition that produces a zombie process.

Ultimately, this example illustrates what goes wrong when exit() and wait() fail to pair up. Any program that creates child processes must use wait() to properly handle the child's termination.

좀비 프로세스 발생 예제 분석

이 예제는 wait()가 없을 때 좀비 프로세스가 어떻게 만들어지는지를 직접 확인할 수 있는 코드다.

자식 프로세스는 "Child process 종료" 메시지를 출력한 뒤 바로 종료된다. 반면 부모 프로세스는 wait()를 호출하지 않은 채 "Parent process 실행중" 메시지를 출력하고 sleep(2)로 대기 상태에 들어간다.

출력 순서는 실행할 때마다 달라질 수 있다. fork() 이후에는 부모와 자식 중 누가 먼저 실행될지 운영체제의 스케줄링에 따라 결정되기 때문이다. 하지만 이 예제에서 중요한 것은 출력 순서가 아니다. 핵심은 부모 프로세스가 wait()를 호출하지 않은 상태에서 자식 프로세스가 먼저 종료될 수 있는 상황이 만들어졌다는 점이다.

자식이 종료된 이후에도 부모가 wait()를 호출하지 않으면 운영체제는 자식 프로세스의 종료 정보를 바로 정리하지 못한다. 그래서 커널에 자식 프로세스의 PCB 일부를 잠시 남겨두게 된다. 실행은 끝났지만 종료 정보가 회수되지 않은 이 상태가 바로 좀비 프로세스가 발생하는 조건이다.

결국 이 예제는 exit()와 wait()가 짝을 이루지 못했을 때 어떤 문제가 생기는지를 보여주는 사례다. 자식 프로세스를 생성하는 프로그램에서는 반드시 wait()를 통해 자식의 종료를 올바르게 처리해주어야 한다.

Ubuntu Practice

fork() Lab

The core takeaway from this lab is that after fork(), the parent and child processes have completely independent execution flows.

The moment fork() is called, the two processes go their separate ways. Which one runs first afterward is not determined by the program code ;it is decided by the operating system's scheduling. That's why even running the same code can produce output in a different order each time.

This is something that must always be kept in mind when writing programs that deal with processes. You should never try to control the execution order after fork() through the order in which code is written. If a guaranteed order is required, synchronization mechanisms like wait() must be used explicitly. In the end, this lab is an example that simultaneously demonstrates process independence, the non-determinism of scheduling, and the necessity of system calls to control it.

이번 실습을 통해 확인할 수 있는 핵심은 fork() 이후 부모와 자식 프로세스가 완전히 독립적인 실행 흐름을 가진다는 점이다.

fork()가 호출되는 순간 두 프로세스는 각자의 길을 간다. 이후 어느 쪽이 먼저 실행될지는 프로그램 코드가 결정하는 것이 아니라 운영체제의 스케줄링에 의해 결정된다. 그렇기 때문에 같은 코드를 실행하더라도 실행할 때마다 출력 순서가 달라질 수 있다.

이 점은 앞으로 프로세스를 다루는 프로그램을 작성할 때 반드시 염두에 두어야 할 부분이다. fork() 이후의 실행 순서를 코드 작성 순서로 제어하려 해서는 안 된다. 순서를 보장해야 하는 상황이라면 wait()와 같은 동기화 수단을 명시적으로 사용해야 한다. 결국 이번 실습은 프로세스의 독립성과 스케줄링의 비결정성, 그리고 그것을 제어하기 위한 시스템 호출의 필요성을 한꺼번에 보여주는 예제라고 할 수 있다.

exec() Lab

This example is code that lets you directly observe how a process's execution flow changes when exec() is called.

Looking at the output, the content written before the exec() call appears normally, but the code written after the exec() call does not produce output. The moment exec() succeeds, the current process's code and execution content are completely replaced by a new program. Once the replacement occurs, there is no flow to return to the original program, so the code written below it absolutely does not execute.

The core takeaway from this example is one thing: exec() does not create a new process — it replaces the entire execution content of the current process. The process itself is preserved; only the program running inside it is replaced. This example clearly demonstrates the distinction between fork() ; a call that creates a new process ; and exec(); a call that changes the content of an existing one.

이 예제는 exec()가 호출되었을 때 기존 프로그램의 실행 흐름이 어떻게 바뀌는지를 직접 확인할 수 있는 코드다.

실행 결과를 보면 exec() 호출 전에 작성된 출력은 정상적으로 나타나지만, exec() 호출 후에 작성된 코드는 출력되지 않는다. exec()가 성공하는 순간 현재 프로세스의 코드와 실행 내용이 완전히 새로운 프로그램으로 교체되기 때문이다. 교체가 이루어진 이후에는 기존 프로그램으로 돌아올 흐름 자체가 사라지기 때문에 그 아래에 작성된 코드는 절대 실행되지 않는다.

이 예제를 통해 확인할 수 있는 핵심은 하나다. exec()는 새로운 프로세스를 만드는 것이 아니라 현재 프로세스의 실행 내용을 통째로 바꾸는 것이라는 점이다. 프로세스 자체는 그대로 유지되고 그 안에서 실행되는 프로그램만 교체된다. fork()가 프로세스를 새로 만드는 호출이라면, exec()는 기존 프로세스의 내용을 바꾸는 호출이라는 차이를 이 예제가 명확하게 보여준다.

fork() + exec() Combined Lab

This example demonstrates the typical usage flow of fork() and exec() — the most commonly used combination in actual operating systems.

One thing that stands out is that the output order differs every time the program runs. The parent process prints its own message, while the child process is replaced via exec() with the ls program and outputs the directory listing. Which of the two runs first is determined by the operating system's scheduling, which is why the order of results changes with each run.

The core takeaway from this example is the basic process creation pattern in which fork() and exec() are used together. Creating a new process with fork() and then calling exec() in the child to replace it with an entirely different program — this structure is the most fundamental way programs are executed in an operating system. The pattern the shell uses to execute user commands follows exactly this approach. In a single example, you can confirm both concepts: the division of roles between fork() and exec(), and the non-determinism of execution order.

이 예제는 실제 운영체제에서 가장 많이 사용되는 fork()와 exec()의 전형적인 사용 흐름을 보여준다.

실행할 때마다 출력 순서가 매번 달라진다는 점이 눈에 띈다. 부모 프로세스는 자신의 메시지를 출력하고, 자식 프로세스는 exec()를 통해 ls 프로그램으로 교체되어 디렉토리 목록을 출력한다. 둘 중 누가 먼저 실행될지는 운영체제의 스케줄링이 결정하기 때문에 실행할 때마다 결과의 순서가 바뀌는 것이다.

이 예제를 통해 확인할 수 있는 핵심은 fork()와 exec()가 함께 사용되는 기본적인 프로세스 생성 패턴이다. fork()로 새로운 프로세스를 생성하고, 자식 프로세스에서 exec()를 호출해 전혀 다른 프로그램으로 교체하는 이 구조가 운영체제에서 프로그램을 실행하는 가장 기본적인 방식이다. 앞서 살펴봤던 쉘이 사용자의 명령어를 실행하는 방식도 바로 이 패턴을 따른다. fork()와 exec()의 역할 분담, 그리고 실행 순서의 비결정성이라는 두 가지 개념을 이 예제 하나로 함께 확인할 수 있다.

exit() Lab

This example is code that lets you directly observe that a process's execution terminates immediately when exit() is called.

Looking at the output, only the content written before exit(0) appears; the printf("이 문장은 실행되지 않습니다\n") written below it does not. The moment exit() is called, the process terminates on the spot instantly, and the subsequent code never even gets the chance to execute.

The core takeaway from this example is that exit() is the definitive termination point of a process. Even without reaching the end of a function or the last line of the program, the exact moment exit() is called, is when the process ends. Just like exec() examined earlier, this example clearly demonstrates that exit() is also a function from which execution flow is completely blocked after it is called.

이 예제는 exit()가 호출되었을 때 프로세스의 실행이 즉시 종료됨을 직접 확인할 수 있는 코드다.

실행 결과를 보면 exit(0) 이전에 작성된 내용만 출력되고, 그 아래에 작성된 printf("이 문장은 실행되지 않습니다\n")는 출력되지 않는다. exit()가 호출되는 순간 프로세스의 실행이 그 자리에서 즉시 종료되고, 이후의 코드는 실행될 기회조차 얻지 못하기 때문이다.

이 예제를 통해 확인할 수 있는 핵심은 exit()가 프로세스의 명확한 종료 지점이라는 것이다. 함수의 끝이나 프로그램의 마지막 줄까지 도달하지 않더라도, exit()가 호출된 바로 그 순간이 프로세스가 끝나는 시점이 된다. 앞서 살펴본 exec()와 마찬가지로 exit() 역시 호출 이후의 흐름이 완전히 차단되는 함수라는 점을 이 예제가 명확하게 보여준다.

wait() Lab

This example is code that lets you directly observe the behavior of a parent process waiting for a child process to terminate when wait(NULL) is called.

Looking at the output, the "child process running" message appears first, followed by a brief pause, then the "parent running after child terminated" message. The parent process's output only appears after the child process has run first and fully terminated.

This is possible because the parent process called wait(NULL) and entered the waiting state until the child terminated. We said that after fork(), there's no guarantee which of the parent or child runs first — but by using wait(), the execution order is clearly guaranteed: the parent only moves to the next step after the child has fully terminated.

The core takeaway from this example is that wait() does more than simply wait for the child to terminate — it plays the role of synchronizing the execution order between the parent and child processes.

이 예제는 부모 프로세스가 wait(NULL)을 호출했을 때 자식 프로세스의 종료를 기다리는 동작을 직접 확인할 수 있는 코드다.

실행 결과를 보면 "자식 프로세스 실행" 메시지가 먼저 출력되고, 잠시 텀이 생긴 뒤에 "자식 종료 후 부모 실행" 메시지가 출력된다. 자식 프로세스가 먼저 실행되고 완전히 종료된 이후에야 부모 프로세스의 출력이 나타나는 것이다.

이것이 가능한 이유는 부모 프로세스가 wait(NULL)을 호출하며 자식 프로세스가 종료될 때까지 대기 상태에 들어갔기 때문이다. fork() 이후에는 부모와 자식 중 누가 먼저 실행될지 보장할 수 없다고 했는데, wait()를 사용함으로써 자식이 완전히 종료된 이후에 부모가 다음 단계로 넘어간다는 실행 순서가 명확하게 보장된다.

이 예제를 통해 확인할 수 있는 핵심은 wait()가 단순히 자식의 종료를 기다리는 것을 넘어, 부모와 자식 프로세스 사이의 실행 순서를 동기화하는 역할을 한다는 점이다.

이 예제는 부모 프로세스가 wait()를 호출하지 않았을 때 좀비 프로세스가 발생하는 상황을 직접 확인할 수 있는 코드다.

파일을 실행하면 자식 프로세스는 메시지를 출력하고 종료된다. 부모 프로세스는 sleep()으로 대기 상태에 들어간다. 겉으로 보기에는 자식 프로세스가 정상적으로 종료된 것처럼 보인다. 출력도 나왔고 실행도 끝났으니 아무 문제가 없어 보이는 것이다.

하지만 부모 프로세스 코드에 wait()가 없다. 자식 프로세스가 exit()로 종료되었지만 부모가 wait()를 호출하지 않았기 때문에 운영체제는 자식의 종료 정보를 부모에게 전달하지 못한 채 커널에 그대로 남겨두게 된다. 실행은 끝났지만 PCB가 완전히 정리되지 않은 상태, 즉 좀비 프로세스가 만들어진 것이다.

이 예제가 보여주는 핵심은 좀비 프로세스는 겉으로 드러나지 않는다는 점이다. 출력 결과만 봐서는 정상적으로 종료된 것과 구별이 되지 않는다. 하지만 내부적으로는 커널에 자식 프로세스의 정보가 남아 자원을 점유하고 있다. 이것이 wait()를 반드시 짝을 맞춰 사용해야 하는 이유다.

Software Testing - Part 1: Core Concepts / Part 2: Test Process (1) / Part 3: Test Process (2)

Heesu Noh — Tue, 24 Mar 2026 12:56:38 GMT

1️⃣ Core Concept of the Test
2️⃣ Test Process Details - 1
3️⃣ Test Process Details - 2

Core Concept of the Test

Impact of Software Defects

What is the purpose of testing? The software we develop is ultimately embedded in final products such as automobiles, aircraft, and mobile devices. A small mistake made by a developer becomes a fault, which then leads to a failure. These failures propagate to the higher-level subsystems that contain the software, then from the subsystem to the system, and finally to the end product, ultimately becoming a major hazard that leads to an accident. This results in harm; including casualties, economic losses, and environmental disasters. A small error originating in software can propagate through fault propagation and escalate into a critical problem. When a fault occurs due to a small human mistake, the primary goal of testing is to detect that fault. By discovering faults through testing, they can be contained rather than propagated to higher-level systems, ultimately reducing the likelihood of accidents. For this reason, testing must be conducted systematically by a third party based on predefined test cases. This is the purpose and role of testing.

[Reference] Difference Between Testing and Debugging

Testing and debugging are easily confused, but they are clearly distinct concepts with different purposes and roles.

The primary purpose of testing is to discover unknown faults. Debugging, on the other hand, aims to accurately correct already known faults identified through testing. There is also a difference in terms of who is responsible. Testing can be performed by internal team members, but a third party such as an external test team can discover more unknown faults by approaching the system from a different perspective. Debugging, however, is best handled by internal developers who are familiar with the system, as it requires locating and correcting known faults.
The key activities also differ. The core activity of testing is fault detection, and test cases must be prepared in advance to carry this out systematically. In debugging, the first step is fault localization; identifying the exact location of the fault, such as which bit in memory is affected or between which modules in a connected system the fault occurred. This is followed by fault identification, which involves determining the type of fault, such as whether it is a compilation error or a logical error. Finally, fault correction, the act of properly fixing the identified fault, is the concluding activity of debugging.

In conclusion, testing is the activity of finding faults without prior knowledge of what went wrong, while debugging is the activity of precisely fixing faults with full knowledge of what the problem is.

Error, Defect, Failure 용어

In the context of software quality, it is important to clearly distinguish between three terms: Error, Defect, and Failure.

An error is the concept that causes a defect and refers to a mistake made by a person, typically a developer or analyst. In other words, it refers to the incorrect human action itself. An error leads to a defect, also referred to as a fault or bug.
A defect is a flaw embedded in a product as a result of an error, and it becomes the root cause of failures or problems. That is, if an error is the human act, then a defect is the flaw left behind in the code or artifact as a consequence of that act.
A failure is the state of malfunction that manifests when the system is actually executed due to an underlying defect. In other words, a failure is the phenomenon in which a latent defect surfaces in the runtime environment.

In summary, a human mistake known as an error produces a defect embedded in the product, and that defect manifests as a failure during system execution. The three terms are linked in a cause-and-effect chain, and understanding each one clearly is fundamental to software quality management.

Example of Error, Defect, and Failure

The concepts of Error, Defect, and Failure in software quality engineering can be illustrated through a concrete example. Consider the following simple pseudocode: Speed = Distance / Time. In this code, Time is positioned in the denominator of the division operation. If the value of Time becomes 0, a divide-by-zero exception will occur.

Mapping the three terms to this scenario makes each concept clear. First, the error is the developer's failure to consider the case where Time could be 0 — a mistake that occurred in the developer's thinking. Next, the defect is the resulting absence of exception-handling code in the program to address the case where Time equals 0 — a flaw reflected in the code itself. Finally, the failure is the occurrence of a Divide By Zero Exception when the program is actually executed with a Time value of 0 — the actual malfunction of the system.

The reason software quality engineering distinguishes between these three concepts is to propose appropriate countermeasures at each stage. At the error stage, training and process improvements are needed to reduce human mistakes. At the defect stage, code reviews and static analysis can eliminate defects before they lead to failures. By clearly distinguishing each stage, it becomes possible to accurately identify the root cause of a problem and systematically establish preventive and corrective measures.

Common Misconceptions About Testing

There are three common misconceptions about software testing.

The first misconception is that testing proves the absence of defects. However, the fundamental purpose of testing is not to prove that there are no defects, but to discover as many unknown defects as possible. Testing is, in essence, an activity that demonstrates the existence of defects. To achieve this, testing should be performed by a third party rather than the developer who built the product, and it should be conducted across diverse environments. Testing across various operating systems such as Windows, iOS, and Android can uncover defects that the developer had not anticipated.

The second misconception is that testing is easy and that all defects can be found. If testing is viewed simply as checking outputs against inputs, it may appear straightforward. However, proper testing requires thorough planning, design, and analysis, as well as a deep understanding of the product under development. Therefore, testing is by no means an easy task, and testers must also possess sufficient knowledge and competence in development. Furthermore, finding all defects is practically impossible. In the era of artificial intelligence, programs can have hundreds of millions of parameters, making it infeasible to test every possible combination within a reasonable timeframe. For this reason, it is important to incorporate the concept of defect prevention from the outset. Through activities such as reviews conducted during the requirements analysis and design phases, defects should be prevented from propagating to subsequent stages before testing even begins.

The third misconception is that testing only needs to take place after the implementation, or coding, phase. From the perspective of the V-Model, however, testing should not begin after coding is complete. Rather, it must be initiated from the earliest stages of development, including requirements analysis and design. Test planning and preparation should begin before implementation starts, enabling quality to be managed systematically throughout the entire development lifecycle.

Test Process Details - 1

What is Systematic Testing?

To understand systematic testing, one must first think of the concept of PDCA. PDCA stands for Plan, Do, Check, and Act, and serves as the foundational framework for any organization to carry out its projects in a structured manner. Systematic testing refers to a state in which a process built upon this PDCA framework is established and followed throughout all testing activities.

1. Test Process from a PDCA Perspective

In the Plan phase, the overall test purpose, scope, schedule, and methods are established, and the features to be tested are selected. For example, in a shopping mall system, features such as member management and order history would be defined as test targets during this phase. Once the plan is established, test design follows, during which test cases are developed and test procedures are defined. From the perspective of the V-Model, test cases are derived based on requirements, architecture, and detailed design, meaning that test case development should begin immediately upon completion of the Plan phase. The test environment must also be prepared at this stage. This includes not only the environment in which the software operates independently, but also the environment in which it interfaces with related hardware devices, and even the real-world operating environment — for instance, if the software is embedded in a vehicle, testing must be conducted during actual driving conditions. Testing across diverse environments is essential to uncovering unknown defects. In particular, once coding is complete in the V-Model, there is likely insufficient time to plan and design tests. This is because the focus must shift to executing the actual tests on the running program. Therefore, it is essential to remember that test planning and design must be carried out in parallel with development in order to improve overall quality.

In the Do phase, the test cases developed during the Plan phase are actually executed and the test results are evaluated. This work can be automated using testing tools, which automatically assess results once test cases are provided as input.

In the Check and Act phases, the test results are analyzed and their adequacy is assessed. Based on the analysis, countermeasures are established and corrective actions are taken. If the identified issues are determined to stem not from defects themselves but from problems in the process or development environment, corrective action recommendations are issued accordingly.

In conclusion, it is essential to remember that systematic testing is not merely about executing tests, but rather an activity in which test planning, design, execution, evaluation, and improvement are carried out organically across the entire PDCA cycle.

[Reference] ISO/IEC/IEEE 29119 Testing Standard

ISO/IEC/IEEE 29119 is the most representative international standard in the field of software testing, jointly established by three international organizations: ISO, IEC, and IEEE. This standard defines a multi-layer test process to ensure that testing is conducted systematically and correctly.

The key emphasis of this standard is that the test process should not be addressed solely at the project level, but that a test process and its foundations must first be established at the organizational level. In other words, the standard requires a top-down structure in which the process and infrastructure for effective testing are first defined at the organizational level, cascaded down to the project and task management level, and ultimately executed by test engineers in accordance with those established criteria.

Specifically, this standard defines a multi-layer test process consisting of four layers, each following a top-down structure in which direction and criteria are passed from higher to lower levels.

The first layer is the organizational test process. At this level, test policies and test strategies that apply across the entire organization are established. This serves as the highest-level foundation that sets the direction and criteria for all subsequent testing activities.

The second layer is the project-level test management process. Based on the policies and strategies established at the organizational level, a test management process is constructed at the project level. This process includes three key activities: test planning, monitoring and control, and test completion.

The third layer is the test management process. Building upon the project-level test management process, this layer involves establishing a more granular test management process broken down by task, phase, and test type. The same three activities; test planning, monitoring and control, and test completion; are applied at this level as well. Although the composition of activities is identical between the second and third layers, they are distinct stages that differ in their scope and level of application.

The fourth layer is the dynamic test execution process. This is the stage in which the software is actually executed and tested based on the criteria and processes passed down from the upper layers. To carry out testing effectively at this stage, test design and implementation must first be completed, followed by the setup and maintenance of the test environment, after which actual test execution takes place.

The key point is that testing should not be viewed merely as an execution activity. Rather, direction and criteria must first be defined and communicated at the organizational and project levels before testing begins. Subsequent chapters will examine in detail what each of the organizational, test management, and dynamic test layers specifically covers.

2. Organizational Test Process

The organizational test process sits at a higher level than the testing carried out at the project level, and represents the stage at which the direction and criteria for testing across the entire organization are defined. This process consists of three key activities.

The first activity is organizational test specification development. This involves developing an organizational test policy specification and an organizational test strategy specification based on the organization's test objectives. For example, if quality achievement targets are set in stages, this activity would involve concretely developing the policies and strategies needed to achieve 50% coverage at stage one, 30% at stage two, and 100% at stage three, broken down to the level of unit testing, integration testing, and so forth.

The second activity is monitoring and control of organizational test specification utilization. This involves monitoring whether the organizational test specifications developed in the first activity are being effectively applied across projects and tasks within the organization, and exercising control when they are not being properly followed. Policies and strategies established at the organizational level must be applied to all subordinate projects, and appropriate control measures must be taken when this is not the case.

The third activity is organizational test specification update. No matter how well-crafted the initial policies and strategies may be, issues are likely to surface when they are applied to real projects. For instance, if shortcomings in an existing policy are revealed during unit test execution, that policy must be revised and improved. The core of this activity lies in continuously incorporating feedback and results from the application of the specifications at the project and task level, and iteratively improving the organizational test policy and strategy specifications.

In conclusion, the organizational test process is not a one-time activity of establishing policies and strategies, but rather a cyclical process of monitoring whether the developed specifications are being correctly applied in practice, and continuously improving them based on the outcomes.

Example of Organizational Test Policy and Strategy Specifications

The organizational test policy and strategy specifications are structured hierarchically, beginning with the highest-level policy specification and cascading down to increasingly detailed strategy specifications.

At the highest level sits the organizational test policy specification. This document contains the broadest set of criteria applicable to all testing activities across the organization, and includes the test purpose, test process, test organization and roles, referenced test standards, test asset management and reuse methods, and policies for test process evaluation and improvement.

Below this sits the organizational test strategy specification. This layer defines more granular test strategies based on the policy specification, and includes risk management related to testing, test selection and prioritization, test documentation, configuration management, defect management, the use of automation tools, and individual test strategies related to performance and security testing.

The next layer defines strategies by specific test type. This is where decisions are made regarding which testing methods will be applied in practice, including unit testing strategy, integration testing strategy, and system testing strategy.

At the lowest layer sit the most granular strategies, including project-level test strategies and individual test-level test strategies. At this layer, specific strategies are defined that apply directly to particular projects or individual test units.

In conclusion, the organizational test policy and strategy specifications represent a system designed to ensure that consistent test criteria are communicated from the organizational policy specification, which captures the overall direction of the organization - all the way down to the project and individual test level through a progressively detailed hierarchical structure.

Test Process Details - 2

1. Test Management Process

Having previously examined the organizational test process, we established that test policies and strategies are formulated in the form of specifications. The test management process is the process by which these established strategies are reflected in the actual projects and individual tasks within the organization, and by which the testing activities carried out within them are systematically managed.

The concept of "management" here is directly linked to the PDCA framework discussed earlier. When PDCA is applied to the test management process, it is structured as follows. First, a test plan is established, corresponding to the Plan phase. Actual test execution is then carried out in the dynamic test process as the Do phase. Whether the results are proceeding appropriately is verified and acted upon in the test monitoring and control phase, corresponding to Check and Act. Finally, upon completion of testing, a test completion report is produced.

Furthermore, if changes arise during the monitoring and control phase, the test plan must also be continuously updated. A plan is not a fixed document once established; rather, it is a living document that must be consistently revised to reflect test execution results and changes in circumstances.

In conclusion, the test management process consists of test planning, test monitoring and control, and test completion, and is a PDCA-based management activity designed to ensure that the organization's test policies and strategies are systematically executed and managed at the project and task level.

Test Management Process; Detailed Composition

The test management process consists of three key activities: test planning, test monitoring and control, and test completion.

In the first activity, test planning, the scope and targets of testing are identified at the project and task level, and the test strategy defined within the organizational test process is established by referencing it as an input.

In the second activity, test monitoring and control, the execution of the dynamic test process is monitored based on the test plan, and the current state of testing is continuously tracked. If issues arise during execution, the testing activities must be appropriately controlled and necessary corrective actions must be taken.

In the third activity, test completion, the artifacts generated after testing is concluded are systematically managed. Since these artifacts may be reused in the future, they must be properly stored, and the test environment must also be organized with reusability in mind. Once these activities are completed, a final test closure report is produced.

Detailed Activities of Test Planning

Test planning is not merely a matter of setting a schedule; rather, it consists of the following highly systematic detailed activities.

The first activity is understanding the context. Before planning the tests, it is essential to first understand the overall situation, including the project's objectives, requirements, relevant stakeholders, and overall schedule. Through this understanding, the scope of testing is clarified and the direction for structuring the test plan is formulated. This process yields a preliminary test plan and development schedule.

The second activity is risk identification and analysis. Risk management is a critical element of project management. During the course of testing, a wide range of risk factors may arise, such as changes in requirements, shifts in priorities, or the replacement of personnel. These risks must be identified and analyzed in advance, and methods for mitigating them must be derived. The analyzed risks and their mitigation strategies are then incorporated into the test strategy design.

The third activity is test strategy design and resource determination. Based on the risk analysis, the test strategy is designed, and the human resources and detailed schedule to be allocated are determined accordingly.

The fourth activity is drafting the test plan. Based on the preceding activities, an initial draft of the test plan is produced. However, the plan at this stage is not yet a finalized document and must go through a review and consensus process involving the relevant stakeholders.

Finally, through the review and consensus process, an agreed-upon test plan is produced and shared with all relevant stakeholders.

In conclusion, it is essential to remember that test planning is not a simple preparatory step, but rather a highly systematic process that spans from understanding the context through risk analysis, strategy design, resource determination, plan drafting, and review and consensus.

Detailed Activities of Test Monitoring and Control

Test monitoring and control is the process of verifying that testing is proceeding correctly in accordance with the established test plan, and taking appropriate corrective action when necessary.

The input to this process is the test plan. Once the test plan is received, the setup required for monitoring and control is configured accordingly. The content of tests executed in the dynamic test execution phase is then compared against the plan, and test measurement is performed. Based on these measurement results, monitoring is carried out. During monitoring, the progress of testing is continuously tracked, and if any deviation from the plan is identified, control activities are initiated. For example, if unit testing has not been performed or integration testing has been omitted, appropriate corrective measures are taken for those activities that have deviated from the plan.

Reporting must also be carried out on these activities. A representative indicator for test status reporting is test case effectiveness, which is expressed as a percentage representing the ratio of the number of defects found to the number of test cases executed. Furthermore, if there are 100 requirements, the degree to which the test cases satisfy those requirements can be measured as a percentage, providing insight into how well the product meets its requirements.

In conclusion, test monitoring and control is not merely about confirming whether testing has been completed, but rather a systematic process of quantitatively measuring and reporting test progress and quality satisfaction levels based on data.

Detailed Activities of Test Completion

Even after testing has been successfully conducted, the test completion phase involves a set of systematic preparatory activities for future testing.

The first activity is verification of the test asset repository. The assets generated during the testing process are reviewed and organized, with decisions made regarding where and how they will be stored. This is done to ensure that these assets can be reused in future testing efforts.

The second activity is test environment restoration. The environment that was configured for testing is restored to its original state so that it can be utilized again in future tests.

The third activity is retrospective review and lessons learned. The strengths and shortcomings of the current round of testing are reflected upon and documented, so that they can be used to improve the quality of future testing activities.

Finally, the test completion report is produced. The report includes a test summary, a comparison of planned versus actual results, test effectiveness metrics, and requirements satisfaction levels. Furthermore, since defects that were not discovered during testing may still remain in the software, the report must also address residual risk identification, post-test countermeasures, test artifact management, a list of reusable assets, and lessons learned, all compiled and managed in report form.

In conclusion, the test completion phase is not simply about wrapping up testing, but consists of a highly systematic set of activities encompassing asset storage, environment restoration, retrospective review, and completion reporting. It is important to remember that the entire process covered so far; test planning, test monitoring and control, and test completion, constitutes a PDCA-based process for systematically managing testing.

2. Dynamic Test Process

Having examined test planning at the task level, the dynamic test process consists of a series of activities carried out to actually execute testing based on that plan.

The first activity is test design and implementation. Test cases are developed based on requirements documents, and the test cases and procedures required for actual test execution are concretely developed at this stage. The second activity is test environment setup and maintenance. The environment in which testing will actually take place is configured and maintained in accordance with the test environment requirements. The appropriate environment must be set up in advance — whether testing will be conducted in a PC environment, an actual vehicle environment, or otherwise — to suit the characteristics of the test target. The third activity is test execution. Tests are executed based on the test specification. The manner in which test results are handled varies depending on whether the issue identified was previously known. If it is a known issue, it must be resolved, and the process of determining how to address the defect continues through test result reporting.

One important point concerns when the test design and environment configuration activities that precede the actual Do phase of test execution take place. It is essential to remember that when the requirements analysis and architecture design processes on the left side of the V-Model are being carried out, test design and environment configuration are also conducted in parallel. The core principle of the dynamic test process is that test preparation must begin from the earliest stages of development, not just immediately before test execution.

Dynamic Test Process; Detailed Composition

The dynamic test process consists of four activities: test design and implementation, test environment setup and maintenance, test execution, and test result reporting.

In the first activity, test design and implementation, test cases and test procedures are developed in accordance with the test scope and test strategy identified in the test plan. Specifically, this involves analyzing the test basis. The development artifacts used to conduct testing, and deriving test requirements, test conditions, test coverage criteria, and test cases.

In the second activity, test environment setup and maintenance, the environment and data required for actual test execution must be prepared. The test environment encompasses not only a standalone software environment such as a PC, but also embedded environments that include hardware, and even vehicle environments, depending on the characteristics of the test target.

In the third activity, test execution, actual tests are run using the previously developed test procedures, and the results of each test execution are recorded in the form of Pass or Fail.

In the fourth activity, test result reporting, defects are identified and recorded based on an analysis of the test execution results. This ensures that discovered defects are systematically managed and that the necessary follow-up actions can be initiated.

[Reference] Test Basis: Concept and Examples

The test basis refers to the development artifacts required for conducting testing; specifically, the documents and materials that serve as the foundation for deriving test cases and test procedures.

Viewed through the lens of the V-Model, the development artifacts on the left side correspond to the test activities on the right side. The test basis utilized at each test level is as follows.

For unit testing, which corresponds to the detailed design phase, the detailed design document serves as the test basis. Since detailed design is concrete to a degree comparable to actual source code, the source code itself is also a representative test basis artifact for unit testing. Integration testing verifies whether the interfaces between individual modules are appropriate; accordingly, the architecture design document that forms the basis for this verification serves as the test basis. System testing uses the requirements specification, the output of requirements analysis; as its basis. Finally, since acceptance testing is the stage at which the highest-level requirements of the customer and end users are verified, the requirements definition document and use case definition document serve as the test basis.

In conclusion, the test basis comprises the artifacts that must be referenced at each test level in order to derive test cases and test procedures, and it is important to understand that the development artifacts on the left side of the V-Model directly provide the foundation for the test activities on the right.

Linux 101: My Notes on Users, Permissions, and Getting Things Done

Heesu Noh — Sat, 21 Mar 2026 16:21:24 GMT

1️⃣ Using Linux, Installing Programs
2️⃣ Using Commands

Using Linux, Installing Programs

1. Getting Started

When Linux starts, a login screen appears. Entering the login details used during initial setup and pressing Enter will prompt for a password. The password is not displayed on screen. In the case of Ubuntu, information such as the kernel version and current system status is shown. This information cannot be changed by the user.

In Linux/Unix, uppercase and lowercase letters are treated differently. Additionally, the number 0, the letter O, the number 1, the pipe symbol |, lowercase l, and uppercase I all look similar but are different characters. The same applies to the backtick ` and the single quote '. Furthermore, computers convert human-readable characters into numbers for processing. The language humans recognize is called a high-level language, while the language machines recognize is called a low-level language. Humans interpret symbols based on their everyday meaning, whereas computers convert them into numbers. For example, the letter A (a) that humans recognize is converted by computers to 65 and 97 respectively in ASCII code.

Linux/Unix Accounts: Groups and Users Linux is an environment where multiple people can use a single computer simultaneously. Those users may be team members or administrators, making it crucial to manage who is allowed to do what. The units used to manage this are users and groups. A user is simply a single login account. An account is required to access the system, and permissions are granted accordingly. Individual permissions can be assigned directly to a user, and these take priority over group permissions. For example, even if a group is blocked from accessing file A, if user kim is granted access directly, kim can still access it. A group is a unit for managing permissions by bundling users together. A single user can belong to multiple groups simultaneously. A group itself cannot execute anything or log in; there must be a user within it for it to have any meaning. As an analogy, a group is like a "type of access pass" and a user is "the person holding that pass." The pass itself does not open the door. When a new account is created, a group with the same name is automatically created alongside it. Create account 'kim' → User 'kim' is created → Group 'kim' is also automatically created The permission priority is as follows: individual (user) permissions take precedence over group permissions. Permissions can be assigned to groups in bulk, and individual permissions can override them when exceptions are needed.

root root is the superuser account with full administrative privileges over the entire system. Every file can be read, modified, and deleted, every setting can be changed, and other user accounts can be created or removed. Because the name "root" is publicly known and identical across every Linux system in the world, it is the primary target for hacking. Distributions like Ubuntu block direct root login by default and use the sudo command instead. Root login can be enabled for convenience, but doing so exposes the system to hacking. The root user's home directory is located at /root. It is important to be aware of the dual meaning of the term "root directory". It can refer to the root account's home directory, but it can also refer to the top-level directory of the entire filesystem, /.

sudo sudo is not available to everyone; only users who have been granted the permission can use it. The account created during the initial installation of Linux is automatically granted sudo privileges, but accounts created afterward do not have sudo access by default and must be explicitly granted it by an administrator. The users who can use sudo and the scope of their access are managed in a file called /etc/sudoers. This file allows fine-grained control over which users can run sudo and which specific commands they are permitted to execute. An important point here is that having sudo privileges does not mean inheriting all of root's permissions. Logging in directly as root grants full access to all system privileges, whereas sudo allows only the permissions explicitly defined in /etc/sudoers. In other words, sudo is not a system that copies the entire set of root's keys; it lends only the specific keys needed for a given situation.

Prompt: The prompt is the text displayed at the beginning of the line in the terminal where commands are entered. Its basic format is username@computername:directory$, and this single line contains everything about who the current user is, where they are, and what permissions they hold. The username is the name of the account currently logged in. Since Linux allows switching to a different account during use; including switching to root if the user has the necessary permissions and knows the password, the current username is always displayed in the prompt so it is immediately clear who is performing actions. The computer name is the name of the machine currently connected to, displayed to distinguish which computer is being used when remotely connected to another machine. The directory indicates the current working folder, where ~ is a special symbol representing the logged-in user's home directory. For example, the home directory of the account test is /home/test, which is abbreviated as . The symbol at the very end of the prompt indicates the current permission level. # is displayed when the user has root privileges, and $ is displayed for regular users. The specific symbol may vary depending on the shell being used, but Ubuntu uses $ by default. For example, if the terminal displays test@test:$, the currently logged-in user is test, the connected computer name is also test, the current location is test's home directory, and the $ symbol confirms regular user privileges. If the user switches to root, the prompt changes to root@test:~#, with both the username and the trailing symbol changing simultaneously, making the privilege change immediately visible.

Logging Out (logout, exit) Linux/Unix systems are rarely powered off and are often running 24 hours a day. Because multiple users can use the system simultaneously, shutting down after one user finishes would inconvenience others still using it. However, when an administrator needs to stop the operating system, commands such as shutdown and poweroff are used. These commands can only be executed with root privileges. logout and exit terminate only the current user's session visible on screen, whereas shutdown and poweroff bring the entire system down, making it completely inaccessible.

Adding and Deleting Accounts: Accounts are added using useradd and deleted using userdel. However, these commands only perform basic operations; meaning they do not automatically create a home directory or password for the account. A home directory does not refer to the /home folder itself, but rather to the personal space represented by ~ for each account. While a personal computer is typically owned and used entirely by one person, Linux/Unix is designed for multiple users to use a single system simultaneously, so storage space is divided and shared among users. The personal space allocated to each account is what is referred to as a home directory. Therefore, an account's home directory means the personal space where that account can store and use files, which is a different concept from the /home directory where all users' home directories are collectively stored. This distinction is important to keep in mind. There are also adduser and deluser commands, which must be separately installed as they are not built into the operating system. Unlike useradd and userdel, these include additional functionality such as prompting for a password during account creation.

Built-in Manual Linux/Unix provides a built-in manual that can be accessed by typing commands directly in the terminal. It is not included by default due to storage constraints and must be installed separately. The manual is divided into sections numbered 1 through 9, each covering a different type of content such as commands, system calls, and configuration files. The section numbers do not need to be memorized as they can be looked up easily when needed.

2. Copying and Installation

Copying vs. Installing Copying is simply placing identical content in a different location, much like photocopying a document. The content is the same, but the copy may not be usable depending on the computer environment. Installing goes beyond copying by performing additional procedures; it makes the software usable in the new environment by supplementing what is missing and incorporating information specific to the installation location.

The limitations of copying can be understood through an analogy. If a wireless internet device used at home is physically moved to an office, it may or may not work, because the device was configured for the home internet connection and only the location has changed. The same applies to software: copying source code to a new location does not mean it can be run immediately.

The process from copying to execution is as follows: the source code is copied, a compiler and libraries are installed separately, paths and linked files are configured for the environment, and only after all of these steps can the executable (binary file) be run.

Installation automatically handles all the steps required to make software usable anywhere. Returning to the earlier analogy, it is like pulling a dedicated line within the office building and performing a proper installation.

The installation process works as follows: the executable (binary file) is automatically copied to the appropriate location, and a script for environment configuration; including path assignment and linked file specification - is executed automatically, leaving the software ready to use immediately.

In summary, copying is the act of moving files, while installing is the process of automatically performing all the necessary configurations and procedures to make the software actually function in that environment. This is why simply copying a program is not sufficient, and depending on the purpose, the process is divided into full installation and simply copying the executable file.

3. Program

How to Use apt When installing a program in Ubuntu, the following command is used. linux@linux:~$ sudo apt install openssh-server Running this command first prompts for the sudo password, which is the password set when the account was first created. Once the password is entered, the installation proceeds. At this point, apt does not simply install openssh-server alone - it automatically installs all the libraries and additional programs required for that program to function. At the end of the installation process, a [Y/N] prompt appears asking for confirmation of these additional installations.

In other words, apt is a tool that installs all the files and libraries needed to use a program in a single operation. Taking openssh-server as an example, there are multiple dependency files required to run this program, and rather than the user having to find and install each one individually, apt identifies and installs them all at once.

Package List Management: apt retrieves packages by referencing package repository information. On Ubuntu, the files /etc/apt/sources.list and /etc/apt/sources.list.d/ubuntu.sources store repository information, and apt uses this to fetch the required packages from those repositories. The version information in the repository list can be updated with the sudo apt update command. It is important to distinguish between types of updates: the update prompted by the system at first login is related to critical security patches from the operating system's perspective, whereas the update performed manually with sudo apt update refreshes the version information of the package list. Since not everything needs to be kept up to date at all times, updates can be performed selectively as needed. apt is a program that manages the installation and removal of software on Debian-based Linux distributions.

Installing a Package; sudo apt install package-name

Removing a Package: sudo apt remove package-name.. If it is necessary to also delete files created during the installation of a package,

purge is used instead of remove. sudo apt purge package-name The difference between remove and purge is that remove deletes only the package itself, while purge deletes both the package and all associated files created during installation. Upgrading a Package To upgrade installed packages to the latest version, the following two commands are executed in order.

sudo apt update / sudo apt upgrade: apt update first refreshes the version information in the package list, after which apt upgrade upgrades the actual packages to their latest versions.

5. How to use dpkg

While apt automatically manages everything required for installation, dpkg is a tool that installs only a single package bundle. Unlike apt, it does not automatically install dependent libraries or additional programs - it processes only the one specified package. It is important not to confuse dpkg with compression. Compression bundles multiple files together for the purpose of transfer or storage. dpkg, on the other hand, manages files necessary for running a program as a single package unit. The fundamental difference is that dpkg manages units of executable programs, not simple file bundles.

Using Commands

passwd

The passwd command is used to change the password of an account. When the prompt displays linux@linux:~$, the account name to be changed corresponds to the first linux at the very beginning of the prompt.

whoami

There are situations where it is necessary to confirm which account is currently being used when executing commands. One such example is when an administrator connects as a regular user in order to provide technical support. Running the whoami command immediately displays the name of the account currently in use.

id

Running the id command outputs more detailed information compared to whoami. It displays comprehensive information about the currently active account, including user and group data. The numbers shown in the output are important, as files are represented by numbers during the process of copying and moving them.

who

The who command displays a list of accounts currently logged into the system. On a personal Ubuntu Linux system used by a single person, only one account will be shown. However, if multiple accounts are logged in simultaneously or multiple terminals are open, all logged-in accounts will be listed.

The output of the who command also includes an item indicating the method of login, which is divided into two types. tty (teletypewriter) refers to a direct physical connection, while pts (pseudo-terminal slave) refers to a virtual connection. In other words, if the user is sitting directly in front of the computer, the connection is shown as tty, and if connected remotely, it is shown as pts.

w

The w command shows what is currently happening on the system. The output is displayed in the following format.

13:12:33 up 21min,  2 users,  load average: 0.00, 0.00, 0.00
USER     TTY FROM             LOGIN@  IDLE  JCPU  PCPU  WHAT

The current time, system uptime, number of logged-in users, and system load average are all displayed on a single line. Below that, detailed information for each logged-in account is listed. jcpu refers to the total CPU time used by all processes running under that account, while pcpu refers to the CPU time used by the process currently shown in the what column.

These commands do not need to be memorized in detail. Simply knowing that they exist is enough, as they can easily be looked up through a search when needed.

date

Running the date command outputs the time configured on the system. An important point to note is that this time does not necessarily match the actual real-world time. Since the output reflects the time set on the system, it may differ from the actual time if a user has manually changed it. This works the same way as the time settings on a mobile phone; it can be set to automatically sync with the current time, or manually configured to a desired time by the user.

touch

The touch command updates the access time and modification time of a file to the current time. Options that begin with - are called flags. The main flags available for touch are as follows.

-t is used to change the time to a user-specified time rather than the current time. -m changes only the modification time of the file. -a changes only the access time of the file.

Detailed descriptions of each flag can be found in the built-in manual using man touch, or through external websites. There is no need to memorize the specific options - they can be looked up and applied as needed.

printenv, env

printenv and env are commands used to check what environment the computer is currently operating in. Running either command outputs the current environment information, and it is also possible to hide an ID or change a group for security purposes. The env command is used in the following format.

linux@linux:~$ env [NAME=VALUE] ... [COMMAND [ARG] ...]

This sets the environment variable specified by NAME to the value of VALUE and then executes COMMAND. In other words, rather than permanently changing the entire system environment, it applies the specified environment value only for the duration of that particular command execution.

ls

ls is the most commonly used command for files and directories, displaying the contents of the current directory. A variety of flags can be used alongside it.

Running ls -F appends an indicator to each file name to show the type of file. Running ls without -F displays only the file names without any indicators.

The reason indicators were introduced is that when ls first appeared, color display was not supported and output was shown in a single color, making it difficult to distinguish file types. Color support is now standard, so indicators are no longer strictly necessary, but the feature is maintained because some users still rely on it and there is no reason to remove it. Preserving existing functionality for the sake of compatibility is a general principle in computing.

. and ..

. refers to the current directory, while .. is a relative notation referring to the directory one level above the current directory.

Permissions and File Types

Running ls -lh /etc outputs detailed information about files and directories. The character displayed at the very beginning of each line indicates the type of file. d indicates a directory, l (lowercase L) indicates a symbolic link, and - indicates a regular file. For a simple view, ls -F can be used, and for a detailed view, ls -lh is the appropriate option.

Permissions are divided among four targets: u for the owner (user), g for the group, o for others, and a for all.

When a file's permissions are displayed as drwxr-xr-x, the leading d indicates the file type, and the remaining characters are read in groups of three.

rwx  /  r-x  /  r-x
 u       g       o

The meaning of each character is as follows: r means read permission (4), w means write permission (2), and x means execute permission (1).

In binary notation, the values corresponding to r, w, and x are 4, 2, and 1 respectively. Permissions can be set by specifying the sum of these values.

Full permission  : 4+2+1 = 7  →  rwx
Read/Write only  : 4+2   = 6  →  rw-
Read only        : 4     = 4  →  r--

For example, setting permissions to 644 results in rw-r--r--, and setting them to 777 results in rwxrwxrwx. More complex administrator-level permission features also exist using s and t, but at this stage it is sufficient to simply be aware that such features exist.

pwd

The pwd command displays the current working directory. When permissions are frequently changed or tasks accumulate, it can become easy to lose track of the current location. In practice, pwd is used frequently.

cd

The cd command is used to navigate between directories. cd . moves to the current directory, meaning the location does not change. cd .. moves one level up to the parent directory. cd ~ moves to the home directory of the currently logged-in user.

The reason ~ is used is that it allows relative path expression. For example, to navigate to the bin directory under a specific user's home directory, cd ~/bin can be used. Without ~, the full path such as cd /home/linux/bin would need to be typed out each time. When there are many users across various environments, each user's home directory path is different, making absolute paths impractical. Using ~/bin universally implies each user's own bin directory under their respective home directory, which is why ~ is an essential notation.

du

While Windows and Mac allow users to easily check storage usage through a file explorer, Linux Ubuntu uses the du command for this purpose. du stands for disk usage and shows how much disk space the current directory or file occupies.

df

The df command is used to check how much of the filesystem is being used. For example, if a 1TB disk is installed and 200GB is in use, the remaining 800GB will not be shown unless it has been mounted. A volume must be mounted before it can be recognized and used by the system.

The main flags are as follows: -a displays all filesystems including all types, and -h displays file sizes in a human-readable format. For example, using the -h option outputs sizes in units such as KB, MB, and GB instead of bytes.

mkdir, rmdir

mkdir is used to create a directory.

rmdir is used to delete a directory. Adding the -p flag allows deletion of both parent and child directories simultaneously.

cp

The cp command is used to copy files, creating an exact duplicate of the original. When copying, there are important points to note regarding file names. If the files are in different directories, the original and the copy may share the same name. However, within the same directory, two files cannot have the same name, so the copy must be given a different name. If a file with the same name already exists in the destination directory - for example, when copying a src file to a dst file and a file named dst already exists, the system will ask whether to overwrite it. This is an area that requires particular care when using Linux. Windows and Mac display overwrite warnings by default, whereas Linux may overwrite without warning depending on the settings. There is a difference in terms of efficiency, but it is important to judge which approach is more appropriate depending on the situation.

mv

The mv command is used to move a file or directory from its current location to another. If the destination is a different directory, the file is fully moved to that location. If used within the same directory, it can also serve as a way to rename a file without actually moving it.

rm

Directories can be deleted using rmdir. The rm command is also used for deletion.

chown, chmod

chown is used to change the owner of a file or directory. For example, it can be used to change the owner from root to test, or from test to root. Because this involves changing ownership, the user must have the appropriate permissions to execute it.

chmod is used to change the execution permissions of a file or directory. Permissions can be set using either the numeric method or the rwx character method. The symbols used in the character method have the following meanings: + adds a permission, - removes a permission, and = assigns only the specified permissions while removing any that are not explicitly stated. For example, u+rw means adding read (r) and write (w) permissions to the owner (u).

Summary

Linux is an environment where multiple users can work simultaneously, making permission management critically important. Because root holds full control over the entire system, a successful attack could expose everything, resulting in a serious security breach. Anyone in a position to manage root privileges must always be aware of the responsibility that comes with it.

When copying or moving files, it is essential to develop the habit of verifying whether overwriting an existing file is intentional. Accidental overwrites are often difficult or impossible to undo.

There are multiple ways to install programs. While it is possible to compile source code directly and copy it, this approach is difficult to manage and time-consuming. For this reason, Ubuntu primarily uses package management tools such as apt and dpkg. Among these, apt is the most widely used. At this stage, it is sufficient to know the names of packages and how to perform updates.

Inside the Computer: Hardware Composition and Program Data Processing

Heesu Noh — Thu, 19 Mar 2026 12:50:32 GMT

1️⃣ Composition of computer hardware
2️⃣ Processing of program data

1️⃣ Composition of computer hardware

Four Major Components of a computer
: Processor (CPU), Memory, System Bus, and I/O Devices. All components are connected through the system bus and operate together as a single system.

CPU: The central unit that performs calculations and processes instructions Memory; A space where programs and data needed during CPU operations are temporarily stored. The CPU can only directly process data that is loaded into memory System Bus; A common pathway through which the CPU, memory, and I/O devices exchange data I/O Devices; All devices used to interact with the outside world, including keyboards, mice, disks, monitors, and network devices.

Hardware Connection Structure Here is a structural overview of how each component is actually connected.

Inside the CPU: Composed of the ALU, registers, and control unit, it not only performs calculations but also controls the order and timing of instructions
Memory (RAM): Holds currently running programs and data. The CPU exchanges data with memory through the system bus
I/O Devices: Not directly connected to the CPU, but indirectly connected through an I/O controller

The most critical point is that all components are connected through the system bus, and all instruction delivery and addressing takes place through this pathway. The operating system uses this structure to utilize the CPU, allocate memory, and control I/O devices.

Von Neumann Architecture This is the most representative computer architecture followed by the majority of computers today. Two Most Important Characteristics First, the CPU, memory, I/O devices, and storage devices are all connected through a single bus.
Second, programs are stored in memory just like data. Because the CPU interprets the contents of memory as instructions and executes them, a program must be loaded into memory before it can run.
Why Is This Important? This is because the role of the operating system, which will be covered later, is centered around this memory. The operating system regulates execution order, allocates resources, and manages the system based on the programs loaded into memory.
Key Perspective Rather than focusing on the historical background of Von Neumann, the goal is simply to understand the concept that "a computer executes programs centered around memory."

2. Processor(CPU) and Registers

Processor (CPU): The CPU is the entity that actually executes programs stored in memory. It is the central unit in a computer that executes program instructions, interpreting commands stored in memory, performing the necessary operations, and controlling the overall flow of execution. The CPU also directly retrieves programs and data stored in memory, processes them, and stores the results back in memory. In other words, if memory is the space that stores programs, the CPU is the entity that actually executes them.

Components of the Processor: The CPU is composed of three elements. Registers, ALU (Arithmetic Logic Unit), and the control unit; each playing a different role in processing a single instruction.
Registers are extremely fast temporary storage spaces inside the CPU that temporarily hold data and instructions retrieved from memory, as well as intermediate results of operations. Registers are the reason the CPU can work so quickly.
The ALU (Arithmetic Logic Unit) is the part responsible for actual calculations, performing arithmetic operations such as addition and subtraction, as well as logical operations such as comparisons.
The Control Unit decides which instruction to execute and directs the registers and ALU on when and how to operate through control signals. It manages the overall flow through the internal bus.
In summary, registers handle storage, the ALU handles computation, and the control unit handles control; these three elements work together to execute a single instruction.

Registers are the storage space the CPU uses at the moment it executes an instruction. Values needed for execution, data required for operations, and intermediate results produced during calculations are all temporarily stored here. Because it would take too long for the CPU to travel back and forth to memory every time it works, information that is immediately needed is loaded into registers for processing.
Registers are therefore most directly connected to the flow of program execution and are the first storage space the CPU accesses. However, since they exist inside the CPU, their number and capacity are very limited; but in return, they are extremely fast.

Components of Registers There are multiple registers inside the CPU, and each register plays a different role during the instruction execution process.
PC (Program Counter) Stores the address of the next instruction to be executed. The CPU uses this value to determine where to fetch the next instruction
IR (Instruction Register) Stores the currently executing instruction fetched from memory
MAR (Memory Address Register) Stores the memory address to be accessed
MBR (Memory Buffer Register) Temporarily stores the instruction or data value read from memory
ACC / DR Stores data used in operations and the results of those operations.
Actual arithmetic and logical operations are performed by the ALU These registers and memory are connected through the system bus, and the CPU executes programs by exchanging instructions and data between registers and memory. Ultimately, this structure represents the core of the entire execution flow in which the CPU fetches → interprets → computes → and stores the results of instructions.

3.Cache and Memory

Cache is a high-speed memory located between the CPU and main memory. While the CPU has a very fast processing speed, main memory is relatively slow to access. This speed difference often causes the CPU to stall while waiting for memory to respond, and cache memory is used to alleviate this problem.
Cache is a space that pre-stores instructions and data frequently used by the CPU, allowing it to retrieve needed data directly from the nearby cache rather than going all the way to main memory. In summary, cache is a type of memory, but it should be understood as a performance-oriented structural element designed to help the CPU operate faster.

Memory is a device that stores currently running programs and data. When a program is executed, the program and the data it needs must be loaded into memory, and the CPU reads from this memory to carry out instructions. In other words, memory is the workspace where the CPU performs its operations while the computer is running. When designing a computer system, memory and storage devices are typically selected based on three criteria.
The first is speed; how quickly data can be read and written.
The second is cost; how much it costs per unit of capacity.
The third is volatility; whether data is retained when the power is turned off. Since no single storage device can satisfy all three criteria simultaneously, computer systems combine these characteristics in a hierarchical structure.

Main memory is the representative storage device that the CPU can directly access, and along with registers, it is the memory most closely connected to the CPU. When executing a program, the program's instructions and data must be located in main memory. Anything not in memory cannot be accessed by the CPU and therefore cannot be executed.
Main memory also serves to store intermediate results and temporary data generated during program execution. Main memory is generally implemented using DRAM technology, making it a volatile memory that can only retain data while power is supplied. Therefore, when the power is turned off, all contents of main memory are lost. As a result, main memory is a core storage space for program execution, but it is not a storage device suited for long-term data retention.

Memory Hierarchy Computer memory is not a single layer but a structure divided into multiple layers. The higher up in the hierarchy, the closer to the CPU; meaning faster speed but smaller capacity. The lower down, the slower the speed, but the larger the capacity and the lower the cost.
The CPU can directly access data in registers, cache, and main memory, but cannot directly access secondary storage. Therefore, programs stored in secondary storage must first be transferred to main memory before they can be executed. The reason for using this hierarchical structure is simple — because it is not possible to make all data storage both fast and cheap, the hierarchy strikes a balance between speed and cost in order to maximize CPU performance.
System Bus The hardware components inside a computer do not operate independently; They work together by exchanging data with one another. The system bus is the pathway that connects the CPU, main memory, and I/O devices into a single system. Rather than connecting each device directly to one another, the system bus is structured so that all devices exchange data and control signals through a common pathway. Whether the CPU is fetching instructions from memory, receiving input data from an I/O device, or storing processed results, all data movement takes place through the system bus. In short, the system bus is the connection structure that ties all hardware inside the computer together into one unified system.

System Bus Signal Types Although the system bus appears to be a single pathway, it is actually divided based on the type of information being transmitted. Depending on the nature of the signals being carried, it is divided into three types: address bus, data bus, and control bus. The address bus is the pathway that tells the CPU where to access. It specifies which location in memory or among devices is to be used. The data bus is the pathway through which actual instructions and data travel. The content being processed moves between the CPU, memory, and I/O devices through this bus. The control bus carries signals that control the manner and sequence of operations, such as read operations, write operations, I/O requests, and whether a task has started or completed.
In summary, the address bus handles where, the data bus handles what, and the control bus handles how — these three buses work together to ensure proper communication between the CPU, memory, and I/O devices.
I/O Devices The CPU cannot communicate with users through main memory alone; this is where I/O devices come in. I/O devices are hardware that connects the computer with the user or the external environment. They serve roles such as receiving input via a keyboard, displaying results on a monitor, and storing or transmitting data through disks or networks. These I/O devices do not connect directly to the CPU or main memory, but instead exchange data through the system bus.
They are broadly divided into three categories based on their role.
Input Devices: Keyboard, Mouse, Camera
Output Devices: Monitor, Printer, Speaker
Storage / Communication: Disk (SSD/HDD), Network Card

In summary, I/O devices are the components that allow information processed inside the computer to connect with the outside world. I/O Devices and the Operating System I/O devices are extremely diverse and complex in terms of hardware. Their operating methods, speeds, and control mechanisms are all different. If application programs had to directly control every device individually, they would become enormously complex. This is why the operating system takes on this role. Because the operating system manages I/O devices with different characteristics in a consistent manner, application programs do not need to worry about the specific operating methods of each device; they simply work using common operations such as read, write, and output. The actual device control is carried out by the operating system on their behalf. Ultimately, although I/O devices are diverse and complex at the hardware level, the operating system hides that complexity, allowing application programs to operate in the same way regardless of the type of device.

2️⃣ Processing of program and data

1) Command and Data

Information handled by a computer is broadly divided into data and command. Data refers to values that enter through input devices or are processed during program execution, while instructions are directives telling the CPU what tasks to actually perform; such as "add these values," "store this," or "move to the next step." An important point here is that a program is not one large task, but rather a collection of many individual instructions. These commands are stored at specific memory addresses, and the CPU fetches and executes them one by one in order. In other words, the CPU reads instructions stored in memory one at a time, and uses the necessary data alongside them to carry out the entire program.

CPU command - Level Operation; The CPU processes one instruction at a time, and each instruction goes through a defined set of steps. First, if the required data is in main memory, it is brought into a register through the bus; this is the preparation phase for performing an operation. Next, the value stored in the register is passed to the ALU, where the actual operation such as ADD is performed. Once the operation is complete, the result is either stored back in a register or, if necessary, moved to memory. The control unit manages the sequence of this entire flow, and the CPU repeats this process for every single command.

In summary, the CPU executes a program by continuously repeating the basic cycle of fetching data, computing, and moving the result. Instruction Execution Cycle The following is the process the CPU goes through when executing a single instruction. Regardless of which command is being executed, the CPU always repeats the same defined sequence.

1. Command Fetch; Based on the address pointed to by the Program Counter (PC), the next command to be executed is retrieved from memory and stored in the Instruction Register (IR)
2. Instruction Decode / PC Update; The CPU interprets what the fetched instruction means, and updates the PC to point to the next instruction. This is the step that determines what to do and where to go next -
3. Operand Fetch. If the instruction requires data, the relevant values are retrieved from memory or registers Instruction Execute.
4. The actual operation; such as arithmetic, logical, or comparison operations - is carried out by hardware including the ALU Result Store.
5. Once the operation is complete, the result is stored in a register or written to memory if necessary Move to Next Instruction - The CPU returns to step 1 for the next instruction, and this cycle repeats continuously.

CPU-Memory Speed Gap; CPU and Memory Speed Difference The most significant problem in computer performance is the speed gap between the CPU and memory. The CPU is extremely fast at interpreting and executing instructions, while main memory is relatively slow to access.
This means the CPU is often ready to execute the next instruction, but the required data has not yet been loaded from memory; leaving the CPU with nothing to do but wait for a memory response.
If this waiting time repeats, overall system performance will suffer no matter how powerful the CPU is. Over time, CPU performance has risen steeply while memory performance has grown only gradually, causing the gap to widen.

This gap is known as the processor-memory performance gap, and it means that even if CPU speed continues to increase, the overall system performance will be bottlenecked at the point where memory access cannot keep up.

Cache Memory; Cache memory is used to alleviate the speed gap between the CPU and main memory. It is a fast storage space located between the CPU and main memory that temporarily stores instructions and data frequently used by the CPU.
As a result, the CPU can retrieve needed data directly from the cache without accessing main memory every time. Cache is not a replacement for main memory; it is a supplementary storage space designed to assist main memory access and reduce CPU waiting time.

How Cache Memory Works When the CPU needs an instruction or data, it checks the cache first rather than going directly to main memory. If the required data is already in the cache, this is called a cache hit; the CPU can retrieve the data immediately without accessing main memory, enabling very fast processing.
On the other hand, if the data is not in the cache, this is called a cache miss; in this case, the CPU must go all the way to main memory, which takes more time. The retrieved data is then stored in the cache to prepare for future accesses.
In summary, a cache hit is fast and reduces memory access, while a cache miss is slow and requires memory access. The effectiveness of cache depends on how frequently cache hits occur.

Why Cache Works Effectively; Locality The reason cache is effective is due to a property called locality. The tendency for memory access during program execution to be concentrated and repeated within a specific range, rather than occurring randomly. Locality is divided into two types.
Temporal locality refers to the characteristic that data or instructions used recently are likely to be used again in the near future. Variables inside loops or repeatedly executed instructions are typical examples.
Spatial locality refers to the characteristic that if a particular memory location is accessed, nearby instructions or data are also likely to be used soon. Processing an array in order or executing a sequence of consecutive instructions are representative examples; in such cases, loading not just the needed data but also nearby data into the cache at once increases the probability of cache hits on future accesses.
Cache is therefore not simply a fast memory; it is a structure designed to exploit these locality characteristics to reduce memory access and improve performance.

2. Bottleneck

When a program runs, the CPU does not just perform calculations; it uses various resources such as memory access, cache usage, and I/O processing in sequence. If even one of these resources is slow, the entire flow will be delayed at that point. The slowest processing stage that limits overall performance in this way is called a bottleneck. For example, even if the CPU itself is very fast, if memory access is slow, the CPU must wait for the response; and as a result, the overall program speed is determined by memory performance.

To address this, systems use structures such as alleviating memory access delays with cache and improving the processing methods themselves to reduce I/O-related bottlenecks.

Common Flow of I/O Device Processing I/O devices are physically much slower than the CPU. Therefore, if the CPU simply waits whenever an I/O operation occurs, efficiency drops significantly. For this reason, how much the CPU should be involved in I/O processing becomes an important design consideration for the operating system.
In some cases the CPU directly checks the device status, while in others the device sends a completion signal after finishing its task. Depending on the I/O processing method, the CPU can either be tied up waiting or freed to perform other work. The operating system uses various I/O processing methods to maximize system performance.

Polling; is a method in which the CPU periodically checks the status of an I/O device directly after making a request. The CPU continuously asks the device whether it is ready. The advantage of this method is that it is very simple to implement; it only requires checking device status without complex control logic. However, because the CPU repeatedly checks status without doing any other work even when the device is not yet ready, CPU resources are wasted and overall system efficiency drops.

Interrupt; With the interrupt method, the CPU does not check device status directly. Instead, after making an I/O request, it hands the task off to the device and continues performing other work. When the device finishes its task, it sends a signal to the CPU ; this signal is called an interrupt. The CPU temporarily pauses its current work, handles the device's completion request, and then returns to its original task. In other words, rather than the CPU continuously checking device status, the device calls the CPU only when needed. Because the CPU can perform other work during I/O wait time, this method uses CPU resources far more efficiently than polling. The flow of the interrupt method is as follows. First, the CPU sends an I/O request to the device and continues other work without waiting. The device processes the requested data, and upon completion sends an interrupt to the CPU. Multiple devices can generate interrupts simultaneously, but the CPU can only handle one interrupt at a time. When an interrupt occurs, the CPU stops its current work and passes control to the kernel. The operating system processes multiple simultaneous interrupts one by one in order of priority. Once all handling is complete, the CPU returns to its original task. The key point is that the CPU does not continuously check the device. There is no resource waste, and the device calls the CPU only when necessary.

DMA (Direct Memory Access); With DMA, the CPU only sets the conditions for the transfer; such as where data should come from, where it should go, and how much should be transferred. The actual data transfer is then handed off to the DMA controller. Once this setup is complete, the CPU no longer participates directly in the data transfer process and moves on to other work. The actual data movement occurs directly between the I/O device and memory without passing through the CPU. When the transfer is complete, the DMA controller generates an interrupt to notify the CPU that the transfer has finished.
In this way, the CPU only configures the transfer once at the beginning, and all subsequent repetitive data movement is handled entirely by the DMA controller. DMA is therefore the most efficient I/O method, consuming almost no CPU resources even for large-volume data transfers.

Summary of I/O Methods I/O processing methods have evolved based on how much CPU involvement is required.

The Foundations of Software Testing

Heesu Noh — Mon, 16 Mar 2026 14:43:12 GMT

1️⃣ Quality and the V-Model Life Cycle Process
2️⃣ Process Overview
3️⃣ Test Overview

1️⃣ Quality and the V-Model Life Cycle Process

1. Definition of Quality

"Quality" is a term we use all the time; we say a product or service is "high quality" or "low quality." But how exactly is quality defined?

Definitions by Leading Scholars

W. E. Deming (Father of Quality): "Quality is meeting the needs of customers."
J. M. Juran: "Quality is fitness for use."
P. B. Crosby: "Quality is conformance to requirements."

Synthesizing these definitions, quality is determined by how well a product or service satisfies customer or user needs - in other words, how closely it conforms and how fit it is for its intended purpose.

Definition of Quality: The totality of characteristics of a product or service that bear on its ability to satisfy stated and implied requirements.

Stated Requirements vs. Implied Requirements

Stated requirements: Requirements that are documented or formally and explicitly defined.
Implied requirements: Requirements that are not documented but are implicitly expected by the user. Customers often cannot fully articulate everything they need, and these taken-for-granted, hidden expectations are what we call implied requirements.

Therefore, a development organization must uncover and analyze not only stated requirements but especially implied requirements, and reflect them in the product or service - only then can the team deliver what customers truly want.

2. Two Perspectives on Quality

What is ultimately delivered to the customer is the final product, but evaluating quality does not mean looking at the end deliverable alone. Quality must be viewed from two perspectives.

1) Process Quality

This refers to the quality of the entire workflow, including development, maintenance, and the work products produced at each stage. The higher the process quality, the more positive its impact on product quality. Process quality is based on the PDCA cycle.

Plan: Establish a plan before starting work.
Do: Execute the work according to the plan.
Check: Regularly monitor whether the work is on track.
Act: Continuously improve the process.

Organizations that follow the PDCA cycle demonstrate strong process quality, ensuring that all work products are managed systematically.

2) Product Quality

Work products generated through the process must ensure traceability and consistency. Only when the final software or system satisfies the customer's requirements can we say the product quality is good.

Key Takeaway: It is not just the final deliverable that matters. Process quality must be raised first — this improves the quality of intermediate work products at each stage, which in turn elevates the quality of the final deliverable.

[Reference] Quality Management System Based on PDCA

A quality management system takes inputs such as customer requirements and the needs and expectations of relevant stakeholders, and operates across departments — including planning, marketing, sales, R&D, development, testing, and production — all working in accordance with the PDCA cycle. Even a single coding task follows the Plan → Do → Check → Act cycle. Through this, organizations continuously improve customer satisfaction, QMS performance, and the quality of products and services.

[Reference] Relationship Between Process Quality and Product Quality: The BMW Case Study

BMW divided its component suppliers into two groups — those with high process quality and those with low process quality — and analyzed the relationship between process quality and product quality. Product quality was measured by the number of defects discovered prior to SOP (Start of Production).

High process quality organization:

16 months before SOP: 50% of total defects identified
11 months before SOP: 90% of total defects identified → 11 months of buffer time remaining

Low process quality organization:

8 months before SOP: 50% of total defects identified
2 months before SOP: 90% of total defects identified → 10% of defects still unresolved just before production begins

The gap between the two groups in reaching the 90% defect detection milestone is a striking 9 months. This case study clearly demonstrates that process quality has an enormous impact on product quality — and this applies not only to the automotive industry but across all industries.

3. V-Model Life Cycle Process

The V-Model is a software development life cycle (SDLC) model in which the development activities on the left side correspond directly to the testing activities on the right side. It is a model that places strong emphasis on Verification and Validation (V&V), and is also named after the initials of these two concepts.

[Reference] Concepts of Verification and Validation

Verification: Confirming that the product conforms to its specifications, requirements, and design specs. "Are we building the product right?"
Validation: Confirming that the product meets the user's intended use and purpose. "Are we building the right product?"

Example: Testing an electric vehicle against its technical specifications is Verification; evaluating it from the actual end user's perspective is Validation.

[Reference] V&V Techniques

V&V techniques are broadly divided into Static methods and Dynamic methods.

Static methods: Techniques for finding defects without executing the program.
- Review: Document and code reviews (e.g., peer review, walkthrough, inspection)
- Analysis: Static code analysis, formal methods, etc.
Dynamic methods: Techniques for finding defects by actually executing the program.
- Testing: Black-box testing, white-box testing, etc.

Development–Test Correspondence in the V-Model

Customer/User Requirements ↔ Acceptance Testing (AT)
Requirements Analysis ↔ System Testing (ST)
Architectural Design ↔ Integration Testing (IT)
Detailed Design ↔ Unit Testing (UT)
Implementation (coding) → Execute tests sequentially, starting from Unit Testing

It is important to note that this correspondence does not mean "run the tests as soon as each development phase is complete." Rather, as each development phase progresses, the corresponding test planning activities — including test procedures, test approaches, environment setup, and test case development — are carried out in parallel. The actual test execution begins after the code is implemented and proceeds in the order: Unit Testing → Integration Testing → System Testing → Acceptance Testing.

Advantages of the V-Model

Improved quality of work products: By applying static methods such as reviews and analysis, defects in requirements documents, architecture documents, and code can be identified before execution, improving the overall quality of all work products.
Shorter schedule through parallel development and test planning: Running development and test planning in parallel fosters close communication between teams. As the quality of upstream work products improves, the workload in downstream phases is reduced — leading to an overall shorter project schedule.

2️⃣ Process Overview

1. Definition and Role of a Process

In general terms, a process defines the steps, sequences, and procedures for carrying out a piece of work. That is the dictionary definition. But when we look at the role a process actually plays, we can see something more specific.

A process exists so that we can systematically create products; such as vehicles, systems, components, or software - that satisfy customer requirements, including functional requirements, non-functional requirements, and constraints. What does it take to do work well? Work proceeds systematically when procedures and methods, tools and equipment, and personnel are properly integrated. A process can therefore be defined as "the means of integrating procedures/methods, tools/equipment, and personnel in order to build a product that satisfies customer requirements."

Consider a waste sorting facility as an example. When a garbage truck arrives and workers need to sort plastics, aluminum, and paper efficiently, they need defined procedures, equipment set up for efficient sorting, and clearly assigned personnel. When all of these elements work together naturally, the sorting operation runs smoothly. In the same way, a process acts as the glue that integrates procedures, methods, tools, equipment, and people so that work can be done well. The role of a process is significant.

2. Defining a Plan Means Defining a Process

Defining a process and creating a plan are essentially the same thing. Planning is not simply about writing a to-do list — it includes defining the process itself. In other words, a plan must define more than just a schedule. A process binds together procedures, methods, tools, equipment, systems, and personnel. The same applies when planning. To perform testing effectively, you need applicable methods and procedures, systems and equipment for efficiency, and personnel assigned to the right roles. Only when all of these elements are properly in place does the work proceed smoothly.

In short, planning is not merely a to-do list — it is the act of defining a process, and a process-driven plan is what truly matters.

[Reference] How to Define a Process: ETVX

ETVX is a framework for systematically defining the activities to be performed at each stage of a process. The acronym stands for:

Entry Criteria: The conditions that must be met before a task can begin — the entry gate for starting the work.
Task: The detailed activities to be performed and the steps in which they are carried out.
Verification: The criteria for verifying that the work in progress is being done correctly — a mid-process quality check.
eXit Criteria: The conditions that must be satisfied for the work to be considered complete — the final completion gate.

ETVX can be applied to each phase of software development, including requirements development, design/implementation, and testing. Here is an example of how ETVX is applied to the software requirements development process in practice.

Process: Software Requirements Development
Purpose: Analyze customer and system requirements to develop software requirements.

Applying ETVX:

Entry Criteria: Customer requirements and system requirements analysis results are delivered as inputs.
Task: Analyze, specify, and review the software requirements.
Verification: Check that customer and system requirements are properly reflected in the software requirements. Confirm that an objective review of the software requirements has been conducted.
eXit Criteria: System requirements must be fully converted into software requirements, and the software requirements review must be completed with no outstanding issues.

In addition to the ETVX steps, a process definition also includes Tools, Methods, and Roles — because a process encompasses not just procedures but also tools, equipment, and personnel.

Tools: Requirements modeling tools, requirements management systems.
Methods: Stakeholder interviews (using checklists), inspection-based reviews.
Roles: Requirements analysis → performed by the requirements engineer; requirements specification → performed by the requirements engineer; requirements review → performed by reviewers such as the architect and tester.

By defining procedures, methods, tools, equipment, and personnel within the ETVX framework, teams can clearly understand each process and apply the right approach at every stage. This framework is widely used in industry and is well worth knowing.

3️⃣ Test Overview

1. Definition of Testing

Let's look at some of the most widely referenced definitions of software testing.

Myers defines testing as "the process of executing a program with the intent of finding defects." This definition emphasizes that the fundamental purpose of testing is defect detection.

Craig and Jaskiel define testing as "a lifecycle process that engineers, uses, and maintains testware in order to measure and improve the quality of the software being tested." Here, testware refers to the various work products and tools produced during the planning and design of testing activities. In other words, this definition frames testing as the systematic and engineering-based application of testware to measure and improve how well a software product satisfies both stated and implied customer requirements.

IEEE Std 829 defines testing as "the process of analyzing a software item to detect the differences between existing and required conditions and to evaluate the features of the software item." This definition focuses on identifying the gap between the expected behavior of a system and its actual behavior under defect, error, or bug conditions.

While these definitions vary in focus, they share a common goal: to find as many defects and bugs as possible — before the software is released. Once code is complete, it undergoes a series of test levels including Unit Testing, Integration Testing, System Testing, and Acceptance Testing, with various techniques applied at each level. Since defects discovered after deployment or production cause far greater quality issues, the priority is early defect detection across multiple perspectives before release.

2. The Test Process from a PDCA Perspective

As discussed in the context of the V-Model, actual test execution only becomes possible once the code has been implemented. However, the preparation that takes place before execution is equally critical. The test process as a whole follows the PDCA cycle.

Plan — Test Planning

The overall test plan is established and the features to be tested are identified. This phase covers planning activities such as defining the test schedule, test scope, test items, test strategy, and test environment. In the V-Model, test planning runs in parallel with the development phases — customer/user requirements, requirements analysis, architectural design, and detailed design — with corresponding test plans being prepared alongside each. Close communication and collaboration between the development team (left side of the V) and the test team (right side of the V) is the core concept of the V-Model.

Plan — Test Design

Based on the test plan, test cases are developed and test procedures are defined. The test environment is also prepared to replicate real-world usage conditions. For example, in automotive software development, this would mean setting up an environment that reflects actual road conditions.

Do — Test Execution

The prepared test cases are executed and the actual results are compared against expected results to produce a pass/fail verdict.

Check / Act — Test Evaluation and Improvement

Test results are analyzed and evaluated. Progress is monitored and controlled against the test plan, corrective actions are taken when issues arise, and the test process itself is continuously improved.

In summary, testing is not simply the act of execution (Do). The full test process encompasses the entire PDCA cycle — planning, design, execution, and evaluation/improvement — and this is an important point to keep in mind.

[Reference] ISO/IEC/IEEE 29119 Software Testing Standard

An internationally recognized standard for software testing exists in the form of ISO/IEC/IEEE 29119. The standard is structured as follows:

Part 1 — Concepts and Vocabulary: Defines common concepts and terminology used in software testing.
Part 2 — Test Processes: Defines the framework for software test processes.
Part 3 — Test Documentation: Defines requirements for test work products and documentation.
Part 4 — Test Techniques: Defines test design techniques applicable to software testing.
Part 5 — Keyword-Driven Testing: Covers test automation through the use of keyword-driven test scripts.

The standard defines the test process using a multi-layer architecture consisting of three levels:

1) Organizational Test Process

At the organizational level, the Test Policy and Test Strategy are established. These are then passed down to the project level, where they serve as the governing framework for all test activities.

2) Test Management Process

This process operates at the project level and manages testing across different test levels and types — including Unit, Integration, Performance, and Security Testing. It defines a series of activities: test planning, test monitoring and control, and test completion.

3) Dynamic Test Process

This is the process by which testing is actually carried out. It consists of a series of activities including test design and implementation, test environment setup and maintenance, test execution, and test results reporting.

This standard provides an internationally agreed-upon framework for establishing and operating a systematic test process, and is widely referenced in industry practice.

Introduction to Operating Systems: Concepts, Development, and Practice

Heesu Noh — Fri, 06 Mar 2026 16:15:42 GMT

1️⃣ Concept and role of operating system

2️⃣ Development and characteristics of operating systems
3️⃣ Establishing a practical environment

1️⃣ Concept and role of operating system

Definition of an Operating System

The core software that coordinates all operations between the user and hardware

5 Key Roles

Intermediary between the user and hardware
Controls and manages the execution of applications
Allocates and manages computer resources
Provides input/output control and data management services
Provides an abstracted execution environment that hides the details of hardware

2. Computer System Structure

User (Human, device, other computer)
  ↕
Software (Operating System) ← Core
  ↕
Hardware (CPU, memory, storage, I/O device)

These layers can never be skipped.

3. Main Roles of the OS (3 Roles)

Role	Description
Coordinator	Manages execution order of multiple programs and provides a concurrent execution environment
Resource Allocator	Distributes CPU, memory, storage, etc. efficiently, fairly, and safely
Control Program	Blocks improper resource usage and prevents errors from spreading throughout the system

4. Goals of OS Development (3 Goals)

Convenience; Hides complex hardware and provides intuitive interfaces (mouse, touch) → Accessible to everyone
Efficiency; Manages resources in a balanced, waste-free manner → Higher throughput, faster response, stable operation
Improved Control Services; Stably controls I/O devices and system state in multi-user and multi-program environments

Main Functions of the OS (7 Functions)

① Memory Management

Manages RAM allocation and reclamation when programs are executed
Memory is volatile → All data is lost when power is turned off

② Secondary Storage Management

Stores data on SSDs and HDDs (non-volatile, large capacity but slow)
Loads only what is needed into main memory, and moves the rest back to storage
Always works in conjunction with memory management

③ Process Management

Manages running programs as individual process units
Divides CPU time into small slices and distributes them to each process
Prevents resource monopolization and conflicts

④ Input/Output Device Management

Abstracts the differences between diverse devices such as keyboards, mice, and printers
Standardizes I/O so that programs can interact with all devices in a uniform way
Prevents conflicts during simultaneous requests and handles them efficiently

⑤ File (Data) Management

Systematically stores and manages documents, photos, executables, and other files
Prevents file corruption even when multiple users or programs access the same file

⑥ System Protection (User Access Control)

Defines and manages permissions for who can do what
Protects the system from malicious access and faulty programs

⑦ Networking and Command Interpreter

Manages external communication such as internet access and file transfers
Translates command-line inputs and button clicks into actual system operations

One-line Summary: An operating system is the core software that makes computers convenient for users, efficient for the system, and stable for the overall environment.

2️⃣ Development and characteristics of operating systems

💡 Development Process of Operation System

Why Did Operating Systems Become Necessary?

In the early days of computing, humans managed everything directly. There were two core problems: setting up tasks and handling input/output took far too long, and since the CPU was fast while I/O was slow, the CPU was constantly sitting idle. In trying to solve this inefficiency, an automated management system — the operating system — was born.

The Evolution of Operating Systems

Era	Key Development
1940s	No OS; humans managed everything directly
1950s	Batch processing introduced; the beginning of automation
1960s	Multiprogramming & time-sharing; improved CPU efficiency, multi-user support
1970s~	Distributed processing; multiple computers connected via network → leading to today's cloud computing

💡Three Major Types of Operating Systems

Batch Processing OS: Collects jobs and processes them in order. Automation achieved, but CPU efficiency remained low

Time-Sharing OS: Divides CPU time into tiny slices and alternates between tasks → fast response times, multi-user support

Distributed OS: Connects multiple computers via a network to operate as a single system → improved performance and reliability, the foundation of cloud computing

💡What Is OS Structure?

OS structure refers to how the functions of an operating system are internally organized and designed. It is not about describing what the OS does on the surface, but rather how it is architected on the inside. The central concept is the kernel, which is responsible for managing core resources such as the CPU and memory, with various services arranged around it. In short, the difference in OS structure comes down to how functions are arranged and separated within the system.

🤔Why Is OS Structure Necessary?

There are two main reasons why a defined operating system structure is needed.

Functional Complexity: Early operating systems had relatively little to do, but today's OS is responsible for processor management, file protection, network handling, and much more. Implementing all of these functions without any structured design would make the system nearly impossible to maintain.
Structure Determines Outcomes: Even with the same set of functions, the choice of structure directly affects the system's performance, stability, extensibility, and maintainability. A poorly designed structure means that modifying a single function could impact the entire system, whereas a well-designed structure allows only the necessary parts to be updated without broader side effects.

Four Types of OS Structure

There are four main types of OS structure, each designed differently depending on its purpose and use environment.

Structure	Core Idea	Advantages	Disadvantages
Monolithic	All functions packed into a single kernel	Fast processing speed	Errors affect the entire system; hard to maintain
Modular	Functions separated as plug-and-play components	Flexible and extensible	Requires careful management of module dependencies
Layered	Access only permitted from upper to lower layers in order	Easy to debug; high stability	Performance overhead; lacks flexibility
Microkernel	Only core functions in kernel; rest moved to user space	High safety and security	Frequent message passing causes performance

1) Monolithic Structure

The earliest and simplest form of OS structure. All operating system functions run within a single kernel space, meaning nothing is separated — everything is packed together in one place. Functions within the kernel call each other directly with no intermediate steps, which makes processing very fast with minimal overhead. However, the dependency between functions is extremely high, meaning a problem in one area can spread to the entire system, making it difficult to maintain and debug. In short: fast, but risky. Early UNIX and MS-DOS were built on this structure.

Looking at the structure from the top down, users sit at the very top, followed by user programs such as shells, compilers, and system libraries. When these user programs make a request, they enter the kernel through a system call. Inside the kernel, all functions; including the file system, processor scheduling, memory management, and I/O management; are packed together within a single kernel space. In this structure, the kernel knows everything and manages everything directly.

However, the dependency between functions is extremely high, meaning a problem in one area can spread to the entire system, making maintenance difficult. The larger the structure grows, the harder it becomes to manage. Ultimately, the monolithic structure is one that gains fast performance by placing all functions into a single kernel, but at the cost of safety and maintainability

2) Modular Structure

Born from the question: "What if we kept the monolithic base but managed functions as separate, interchangeable parts?" The modular structure maintains the monolithic foundation but separates functions into independent modules that can be dynamically loaded or removed as needed; much like plugging in or unplugging components. Its greatest strength is balance: it preserves the performance of a monolithic structure while significantly improving flexibility and maintainability. The downside is that dependencies between modules must be carefully managed, and loading a faulty module can still affect kernel stability. Modern UNIX-based systems such as Linux and Solaris use this structure.

3) Layered Structure

A structure with a strict top-to-bottom hierarchy. The OS is divided into multiple layers, and each layer can only interact with the layer directly below it — skipping layers is not allowed. This makes it easy to pinpoint exactly which layer a problem occurred in, improving overall stability and debuggability. However, because every request must pass through each layer one by one, the number of calls increases and performance suffers. The rigid rules between layers also make it inflexible when adding new features or modifying existing ones. As a result, while layered structure is easy to understand and well-organized in theory, it is rarely used in commercial operating systems due to its performance limitations. It has been used in educational OS environments such as THE operating system.

Looking at the structure from top to bottom, the layers are stacked as follows. The most important point here is that each layer can only interact with the layer directly below it. Skipping any intermediate layer is strictly impossible. While this structure is very easy to understand, applying it directly to a real operating system places a significant burden in terms of both performance and flexibility. In short, the layered structure is well-suited for understanding and design, but has clear limitations when it comes to real-world performance.

4) Microkernel Structure

Starts from the idea of "leave only the essentials in the kernel." Only the most fundamental functions - process management, memory management, and inter-process communication ; remain in the kernel. Everything else, such as file systems and device management, is moved out to user space. This results in a smaller kernel with significantly improved safety and security. The tradeoff is that the kernel and user space must frequently exchange messages, which introduces some performance overhead. Real-world examples include Mach, MINIX, and QNX — with QNX being widely used in safety-critical fields such as automotive and medical devices. macOS and Windows, on the other hand, use a hybrid structure that incorporates only some microkernel concepts rather than adopting it fully.

The structure can be mapped out as follows. Only the core functions remain in the kernel, while everything else; such as the file system and device management is moved out to user space. Services in user space can only communicate with the kernel through system calls. As a result, the structure is cleanly separated, and even when a problem occurs, its impact does not spread widely across the system. In short, this is a structure that minimizes the kernel in order to maximize stability and extensibility.

🤔Why Do Modern Operating Systems Use a Hybrid Structure?

네, 해당 부분도 한국어 요약과 영어 번역 함께 정리해드립니다!

한국어

왜 현대 운영체제는 혼합 구조인가?

이유는 크게 두 가지입니다.

첫 번째, 구조마다 장단점이 너무 뚜렷하기 때문입니다. 단일 구조는 성능은 좋지만 관리가 어렵고, 마이크로 커널은 안정성은 좋지만 성능이 아쉬우며, 계층 구조는 이해하기는 쉽지만 유연성이 부족합니다. 하나의 구조만으로는 모든 요구를 동시에 만족시키기 어렵습니다.

두 번째, 운영체제에 대한 요구가 너무 많아졌기 때문입니다. 현대 운영체제는 단순히 작동만 하면 되는 수준이 아닙니다. 높은 성능, 안정성과 보안, 다양한 하드웨어 지원, 지속적인 기능 확장까지 모두 요구받습니다. 현실적으로 단 하나의 구조가 이 모든 것을 충족시키기는 불가능에 가깝습니다. 그래서 운영체제는 상황에 맞게 구조를 섞는 방법을 선택하게 되었습니다.

결론적으로 현대 운영체제는 하나의 구조를 고집하지 않고, 여러 구조의 장점을 취하고 단점은 최대한 줄이기 위해 혼합 구조를 사용합니다.

리눅스: 모놀리식 구조 기반 + 모듈 구조 결합
Windows·macOS: 모놀리식 + 마이크로 커널 개념 혼합

핵심 결론: 현대 운영체제는 완벽한 구조를 찾는 것이 아닌, 현실적인 타협을 선택한 결과물이다.

🤔Why Do Modern Operating Systems Use a Hybrid Structure?

There are two main reasons.

First, every structure has clear and distinct trade-offs. The monolithic structure offers great performance but is difficult to manage. The microkernel structure is highly stable but falls short in performance. The layered structure is easy to understand but lacks flexibility. No single structure can realistically satisfy all requirements at the same time.

Second, the demands placed on operating systems have grown enormously. Modern operating systems are no longer expected to simply run; they must deliver high performance, strong stability and security, support for a wide variety of hardware, and continuous feature expansion all at once. It is practically impossible for any single structure to meet all of these demands. As a result, operating systems have chosen to mix structures depending on the situation.

In conclusion, modern operating systems do not commit to a single structure. Instead, they adopt the strengths of multiple structures while minimizing their respective weaknesses through a hybrid approach.

Linux: Monolithic base + Modular structure combined
Windows & macOS: Monolithic + Microkernel concepts blended together

Key Takeaway: Modern operating systems are not the result of finding a perfect structure — they are the product of practical compromise.

3️⃣ Establishing a practical environment

💡Virtual Machines

1. What Is a Virtual Machine?

A virtual machine (VM) is a technology that divides the resources of a single physical computer into multiple independent execution environments. Using virtualization software, you can create several virtual computers on one laptop, each running a different operating system just like a real machine. Since each virtual machine operates independently, any problem that occurs inside a VM does not affect the actual host laptop.

2. Why Use Virtual Machines?

Directly installing an operating system on a personal PC can lead to the following problems.

Incorrect settings can prevent the system from booting or cause system damage
Once a problem occurs, restoring the original state is difficult
Switching between multiple operating systems for practice is a significant burden on a personal PC

Virtual machines solve all of these issues. Multiple operating systems can be practiced safely on a single PC, and recovery is easy even if something goes wrong.

3. Host-Based Virtualization

This course uses a host-based virtualization approach, where virtualization software is installed on top of the existing operating system and virtual machines are run within it. This method allows direct installation on a personal PC and easy recovery, making it the most suitable approach for hands-on practice.

The practice workflow is as follows.

Install Virtualization Software → Create Virtual Machine → Install Ubuntu OS

4. Virtualization Software: VirtualBox

This course uses VirtualBox for hands-on practice.

Advantages: Free to use, relatively simple installation and setup, beginner-friendly
Note: VirtualBox does not work properly — or at all — on MacBooks with Apple Silicon chips (M1·M2·M3). Students with these devices must use alternative virtualization software.

5. Linux Distribution for Practice: Ubuntu

The operating system to be installed inside the virtual machine is Ubuntu Linux. It was chosen for three reasons.

Provides a user-friendly interface accessible even to beginners
Well-supported by learning resources and an active community
Relatively straightforward to install and use in a virtual machine environment

At the end of this chaper - I was able to install Ubuntu in the VM using VirtualBox! yay.

Understanding Software: Definitions, Quality Challenges, and Development Processes

Heesu Noh — Tue, 03 Mar 2026 12:58:44 GMT

1️⃣Definition and characteristics of software
2️⃣ Software quality issues
3️⃣ Software Development Process

1️⃣Definition and characteristics of software

1) Changes Brought About by Software

What Kind of World Do We Live In?

Looking back at the era we live in, we have passed through the 1st, 2nd, and 3rd Industrial Revolutions, and we are now entering the age of the 4th Industrial Revolution - an era of intelligent information technology built on AI, IoT, Big Data, and autonomous driving.

So what makes all of this possible? It is the software that controls and drives these systems. We are living in a time where software plays a central role across society and the economy.

The Role of Software - The Case of Autonomous Driving

Tesla, a leader in the autonomous driving industry, uses numerous cameras and sensors to recognize objects, people, and lane markings, and to control the vehicle accordingly. What actually performs this judgment and control is software - invisible, yet operating deep within the system. The role of software will only continue to grow.

The Scale of Software - Understanding Size Through LOC

The size of software is measured in LOC (Lines of Code).

Software	LOC
Practice code	Tens to hundreds of lines
Average mobile app	~30,000 lines
Android OS	~12 million lines
Windows OS	~40 million lines
Modern high-end car	100 million+ lines

If 30,000 lines were printed on A4 paper (at roughly 40–50 lines per page), it would produce approximately 600 to 700 pages. Imagine someone handing you a 600-page book - just reading and understanding it would be an enormous challenge. The sheer scale of software is already staggering.

Software Defects and Quality Concerns

As the size of software grows, so does the number of potential defects.

Even the best software development companies in the United States are known to produce "8 to 12 defects per 1,000 lines of code."

Applying this to real-world scale:

A 30,000-line app → approximately 300 potential defects
A 100-million-line car → approximately 1,000,000 potential defects

Why do so many defects occur? The most fundamental reason is that software is still developed by humans. As long as humans are doing the development, human error is inevitable.

2) What Is Software?

We have already discussed software-centric society. So how do we define software itself? The most common definition is: a collection of instructions - source code programs - that control hardware. But if we take a broader view, the definition expands significantly.

The software development process is not focused solely on coding. It begins with gathering customer requirements, then moves through analysis, and then a design phase where those requirements are examined in concrete detail from a development perspective. From there, implementation (coding) takes place, followed by testing to verify that the software satisfies the original requirements, and finally the software is deployed.

Because software goes through this entire journey — from requirements to deployment — it produces files such as requirements specifications, architecture design documents, source code, and test result reports. Software, therefore, can be defined as all artifacts and data produced through the development process, including but not limited to the source code itself. This definition already implies that software development and maintenance are part of what software fundamentally is.

Why Must the Process Be Reflected in the Definition? This is directly tied to the four key characteristics of software.

1. Invisibility (비가시성)

The internal structure of software is not visible to the eye. When using an app or a website, you see the UI on screen, but the actual underlying structure is not clearly visible. This is the most defining characteristic of software.

Should we simply accept this invisibility? If we do, projects will become increasingly difficult to manage. Instead, we must make every effort to visualize the structure as much as possible. The most representative approach is architecture design - before any coding begins, identifying what components make up the software, how they interact, and what their interfaces look like. Visualization is a deliberate and necessary effort.

2. Non-Linearity (비선형성)

Software has a complex, non-linear structure. In a linear system, the flow of components is predictable and easy to follow. In a non-linear system, components are intricately entangled with one another.

The goal is to reduce this complexity, and this is addressed during the analysis and design phases. For example, if a system has a complexity level of 6 and we want to bring it down to 4 or 5, one approach is to introduce a mediating module between the existing modules. Instead of having them interface directly with each other, they communicate indirectly through the intermediary — and this reduces overall complexity. These structural decisions must be considered before programming begins.

3. Does Not Wear Out, But Continuously Changes (마모되지 않고 변경됨)

Hardware wears down with use — it degrades and eventually breaks. Software, on the other hand, does not wear out. The same software used today, tomorrow, a month from now, or a year from now on the same platform will always perform the same way. It does not deteriorate.

However, software is constantly changing. For how long? Until the product it is embedded in is no longer in use. Changes occur for many reasons — a user discovers a problem, an internal request is made, or new requirements emerge.

Consider a car, which is typically used for about 10 years, or a subway train, which can remain in service for around 30 years. The software embedded in these vehicles does not stop evolving once it is first released — it continues to be updated and modified throughout the entire lifespan of the product.

4. Human Intensive (사람 중심의 작업)

Hardware components are manufactured in factories and production lines, largely automated by robots. Software, however, is still built by people — and because of that, potential human error is inherent in the process.

The goal is to minimize human error and to detect faults before they lead to failures. The most representative tool for achieving this is testing. Through rigorous testing, we must actively work to find and eliminate the many defects that exist within software.

The Consequences: A 30% Success Rate

Due to these four characteristics, the success rate of IT projects involving software development is approximately 30%. One might assume that most projects succeed, but in reality, success requires hitting all three marks of QCD — Quality, Cost, and Delivery. Achieving all three simultaneously is extremely difficult, which is why the success rate remains so low.

Nevertheless, we must continue striving to raise that success rate and improve software quality, however incrementally.

Conclusion

Software is growing in scale at a rapid pace, and the number of potential defects grows proportionally. For this reason, systematic and thorough efforts to ensure software quality are absolutely essential.

2️⃣ Software quality issues

What Happens When Software Defects Occur?

When the final product is an aircraft or an automobile, software may appear to be just a small component. However, a small error made by a developer becomes a fault, which then leads to a failure, propagating through subsystems and eventually up to the entire system -ultimately influencing the final product and causing accidents. This is known as software fault propagation. In severe cases, it can result in the loss of human life or catastrophic damage to property.

Real-World Accident Cases Caused by Software

1. Medical Field; Therac-25 Radiation Therapy Machine

The Therac-25, a radiation therapy machine developed in 1985 by ACEL in Canada, is one of the most well-known examples. Because direct radiation exposure is harmful to the human body, the machine was designed with two modes: a low-power Electron mode and a high-power X-ray mode. In X-ray mode, a turntable was designed to intervene and prevent radiation from being directly applied to the patient.

However, due to a software malfunction, the turntable failed to activate even when the machine was in X-ray mode. As a result, between 1985 and 1987, 6 radiation overdose incidents occurred, leaving 3 people dead and 3 others with permanent radiation-related disabilities. This is one of the most cited examples of a small developer error leading to fatal consequences.

2. Space Industry; Ariane 5

In 1996, the Ariane 5 rocket, launched by the European Space Agency, exploded just 37 seconds after liftoff; at a height still visible to the naked eye from the ground.

The cause was traced to the reuse of software from the previous model, Ariane 4 (a 16-bit system), in the Ariane 5 (a 64-bit system) without accounting for the difference in bit architecture. During data conversion, an overflow error occurred in the variable representing altitude. Although the rocket was at a normal altitude, the software incorrectly determined that it had gone off course. Because the system was designed to self-destruct upon detecting a trajectory deviation, the rocket was automatically destroyed. Once again, a small developer oversight led to a catastrophic outcome.

3. Aviation; Boeing 737 MAX 8

In October 2018, a Boeing 737 MAX 8 crashed, killing all passengers on board. The aircraft was equipped with the MCAS (Maneuvering Characteristics Augmentation System), which controls the pitch of the aircraft and relies on data from the AOA (Angle of Attack) sensor.

The critical flaw was a software error that allowed the MCAS to activate even when the AOA sensor was malfunctioning. The system continuously commanded the horizontal tail to push the nose downward, and when the pilots were unable to override it, the aircraft crashed.

4. Automotive; Toyota Lexus ES350

In August 2009, a sudden unintended acceleration incident involving a Lexus ES350 in California, USA, resulted in the deaths of an entire family of four. The vehicle was equipped with an ETCS (Electronic Throttle Control System), and following the accident, a simulation experiment successfully reproduced the sudden acceleration by manipulating specific bit values in memory; providing experimental proof that the software was at fault.

Conclusion

These cases make it abundantly clear why ensuring software quality is of the utmost importance. Achieving that quality requires systematic efforts across process, engineering, and technical dimensions, and at the heart of those efforts lies testing.

3️⃣ Software Development Process

The Importance of Process for Quality Assurance

How can we ensure software quality? Watts Humphrey, a renowned software engineer, once said:

"The quality of a product is determined by the quality of the process used to develop and maintain it."

In other words, a better process leads to a better product. While delivering software to the customer is important, what truly matters is establishing and adhering to a well-defined process throughout the entire development lifecycle - from requirements analysis to architecture design.

Stages of Software Development

Software development progresses through the following stages:

Customer Requirements — Abstract and loosely defined requirements are received from the customer.
Requirements Analysis — The customer's requirements are broken down and analyzed from a development perspective.
Design — The HOW is defined: what structure the software will have and how it will behave. This is an indispensable stage in the development process.
Implementation (Coding) — Developers write the actual code based on the design.
Testing — The implemented software is verified to ensure it meets the design specifications and satisfies the original requirements.

Only after testing is complete is the software considered finished and ready for deployment to the customer. Thoroughly following each of these stages is the most fundamental way to improve the quality of delivered software.

Software Development Life Cycle (SDLC) Models

1. Waterfall Model

The most classic model, in which the development process flows sequentially from top to bottom. Like a waterfall.

Requirements Analysis → Design → Implementation → Testing → Maintenance

Each phase must be fully completed before moving on to the next. This model is best suited for projects with clear and well-defined requirements.

2. Prototyping Model

Rather than developing everything at once, this model involves first building a prototype of the most critical components from the customer's perspective, then gathering feedback before continuing development. The product is gradually refined based on customer evaluation.

This approach is far more flexible than the Waterfall model and can respond effectively even when customer requirements change during development.

3. Spiral Model

Like the Prototyping model, this approach develops software incrementally, prioritizing the most important requirements first. The key distinction is that risk analysis is explicitly integrated into every cycle of the process. At each spiral, risks are identified and analyzed, and the findings are incorporated into the next cycle; making it a more systematic and risk-aware development methodology.

4. V-Model

One of the most widely used models in practice today, the V-Model is particularly applied in projects where quality is paramount. It is derived from the Waterfall model and takes the shape of the letter "V".

The model places strong emphasis on Verification and Validation, mapping each development phase on the left side of the "V" to a corresponding testing phase on the right.

Development Phase	Testing Phase
Customer Requirements	Acceptance Testing
Requirements Analysis	System Testing
Architecture Design	Integration Testing
Detailed Design	Unit Testing

Summary of Today's Key Takeaways

Today's content can be summarized into three main points:

① Definition and Characteristics of Software - Software is not merely source code; it encompasses all artifacts and data produced throughout the entire development process. Its key characteristics include invisibility, non-linearity, human-intensive work, and continuous change.

② The Importance of Software Quality Assurance - These characteristics make quality assurance inherently difficult, but as the accident case studies demonstrate, ensuring quality is absolutely non-negotiable.

③ Software Development Processes - To achieve quality, structured processes must be applied. The four representative models are the Waterfall Model, Prototyping Model, Spiral Model, and V-Model.

Understanding Linux/Unix

Heesu Noh — Fri, 27 Feb 2026 16:30:00 GMT

1️⃣ Hardware Basics
2️⃣ Software Basics
3️⃣ Unix - A Closer Look
4️⃣ Linux - A Quick Look
5️⃣ Study Summary

1️⃣ Hardware Basics

Hardware is anything physical you can touch and feel on a computer. It is divided into three main components: CPU, Memory, and Storage.

The CPU (Central Processing Unit) is like the brain of a computer. It processes all the instructions and is produced by companies such as AMD and Intel.

Memory temporarily stores data that the CPU is currently working with, allowing for fast processing. A common example is RAM. Traditionally, RAM was known as volatile storage — meaning everything is lost when the power is turned off — while storage was non-volatile and could be recovered. However, in recent years, memory that retains data even without power has emerged. For now, the key distinction is whether data remains accessible after the power is cycled on and off.

Storage preserves data even when the power is off, saving files in a permanent format. It is divided into HDD (Hard Disk Drive) and SSD (Solid State Drive).

Other hardware includes input/output devices such as keyboards, monitors, printers, and speakers. There are also network devices like the NIC (Network Interface Card). The NIC gets its name from the first letters of "Network Interface Card." While modern motherboards often have it built in, it was originally a separate card-shaped device, similar in size to a credit card.

2️⃣ Software Basics

Unlike hardware, software has no physical form you can touch. It is divided into four categories: System Software, Operating Systems, Device Drivers, and Application Software — though the boundaries between them are not always clear-cut.

System Software manages the hardware and serves as the foundation for all other software running on the computer.

Operating Systems manage the computer's resources. Common examples include Linux, Windows, Unix, and Android.

Device Drivers are software that operates and manages specific hardware devices. A new driver is added each time a new device is introduced.

Application Software is software that performs specific tasks for the user, such as installing and updating apps.

A Closer Look at Operating Systems

Can a program run without an operating system? The answer is yes! Early computers actually worked this way. Even today, it is still possible through microcontrollers.

So why do we have operating systems? They were developed to help us use and manage computer resources more efficiently. Operating systems didn't always exist — they came about as computers grew more complex and the need for better resource management increased.

Here's the English version continuing the same blog style:

3️⃣Unix - A Closer Look

If you worked with computers in the 20th century, Unix was something you almost certainly had to know. It was the dominant operating system of the time, widely used from the latter half of the 20th century onwards. Back then, Unix made computers available for free or at a very low cost, which helped drive the adoption of operating systems overall.

Unix was developed in 1969 at Bell Labs, a research division of AT&T, the telecommunications company.

How did we get here?

Before Unix, there was a software system called Multics. In the early days of computing, programs were written on a 1:1 basis for each specific machine — completely different from today where the same software can run across many different devices. Every time a new machine came out, developers had to repeat the same work all over again. This made the need for a unified, portable software system very clear.

Key Features of Unix

Time Sharing System Unix introduced the concept of sharing resources and time among multiple users. Think of it like a food delivery app: an Uber driver doesn't just deliver for one customer. They handle multiple orders. Similarly, Unix supports multiple users (multi-user) and multiple processes (multi-processing) running at the same time.

Command Line Interface (CLI) Unlike today's large graphical displays, early technology only allowed users to view and input one line at a time on screen.

Everything is a File In Unix, everything is treated and processed as a file.

Batch Processing Before time sharing, computers used batch processing, where tasks were collected and processed one at a time. When one person's job finished, the next person could use the computer. This was how computers were shared back then.

The introduction of the Time Sharing System changed everything. It allowed user 1, 2, and 3 to all work at the same time, rather than waiting in line. This became the foundation of what we know as the Unix system.

4️⃣Linux - A Quick Look

Linux was first released in 1991 by Linus Benedict Torvalds, starting out based on an operating system called MINIX.

MINIX was originally developed as an educational alternative to the expensive UNIX, making it more accessible for learning purposes. Linux began from MINIX and later expanded by incorporating many of the functional features of UNIX.

During the era when Unix dominated the market, Unix did not provide its source code. Instead, it would install Unix for you, and once you became comfortable using it, the environment practically encouraged you to go work for the company. For students who wanted to learn, it was a very restrictive environment — and for teachers, it was equally difficult to teach something so closed off.

This is the background that gave birth to Linux.

What I find really inspiring is that Linus started Linux as a hobby while he was still a student, through a MINIX class he was taking at the time. As a student myself, I find that really motivating. It's a reminder that great things can start from simply learning and being curious! 🌱

Study Summary

Unix was an operating system developed by inheriting the time sharing concept from Multics. It was a highly scalable system, but eventually lost its market dominance to Linux. Why?

The risks associated with using Unix for free were significant. There was a time when Unix was a mandatory subject at universities, and because of that it was already widely spread with strong technical support. However, when Linux launched in the 1990s, it was completely free to use and had incorporated many of Unix's best features.

Unix kept its source code private, meaning users could only use it as provided without being able to modify it. Linux was the complete opposite — open and accessible. While official technical support for Linux does require payment, most distributions are free to install and use.

String Matching Algorithms: Everything About Efficient Pattern Search

Heesu Noh — Sun, 08 Dec 2024 06:12:42 GMT

Contents

1️⃣문자열 매칭 (String Matching)
2️⃣원시적 매칭 (Naive Matching Algorithm)
3️⃣오토마타 이용 매칭 (Automata Theory)
4️⃣라빈-카프 알고리즘(Rabin-Karp Algorithms)
5️⃣KMP 알고리즘과 실패 함수 (KMP Algorithm and Failure Function)
6️⃣보이어-무어 알고리즘(Boyer-Moore Algorithm)

1️⃣문자열 매칭 (String Matching)

문자열 매칭 (String Matching)이란 텍스트 문자열에 주어진 **패턴 문자열 (Pattern)**이 나타나는 위치를 찾아내는 과정을 말한다. 일반적으로 패턴 문자열은 텍스트보다 매우 짧은 경우가 많다.

문자열 매칭 알고리즘은 다양한 분야에서 활용된다.

문서 작업: 특정 문자열을 검색하는 알고리즘
인터넷 검색: 키워드 검색
데이터베이스: 항목 값 비교 또는 검색
백신 프로그램: 악성코드 패턴 탐지

2️⃣원시적 매칭 (Naive Matching Algorithm)

원시적 매칭은 텍스트에서 패턴 문자열을 찾기 위해 **순차적으로 하나씩 비교 (Sequential Comparison)**하는 방식이다. 이는 실무에서 거의 사용되지 않지만, 다른 문자열 매칭 알고리즘을 이해하기 위해 기본적으로 알아야 하는 개념이다. 시간 복잡도 (Time Complexity)는 O(mn) 패턴 길이 m, 텍스트 길이 n로 모든 위치를 하나씩 검사하기 때문에 효율이 떨어질 수 있다.

✅ 작동 과정 (How it Works)

첫 번째 위치부터 비교 (Compare from the first position)
텍스트의 첫 번째 문자부터 패턴의 첫 번째 문자까지 하나씩 비교한다.
- 매칭에 실패하면 패턴을 한 칸 뒤로 이동한다.
- 이 과정을 텍스트 끝까지 반복한다.
패턴이 매칭되었을 경우 (When the pattern matches)
패턴의 모든 문자가 텍스트의 해당 위치에서 일치하면 해당 위치를 출력하게 된다.

✅원시적 매칭의 작동 원리(Principal Work of Native Matching Algorithms)

텍스트: `bobycatsoaropt`, 패턴: `soar`

텍스트의 처음부터 soar를 비교:
- b와 s 비교 → 실패
- o와 s 비교 → 실패
- ...
패턴을 한 칸씩 이동하며 비교:
- s와 s → 성공
- o와 o → 성공
- a와 a → 성공
- r와 r → 성공 (완전 매칭)

💡중요한 포인트 (Key Points)

순차적 비교 (Sequential Comparison): 문자 하나씩 비교
패턴 이동 (Pattern Shift): 매칭 실패 시 한 칸 이동
응용 분야 (Applications): 인터넷 검색, 데이터베이스 검색, 백신 프로그램 등
단점 (Disadvantages): 시간 복잡도가 높아 대규모 데이터에는 비효율적

원시적 매칭 의사코드 (Pseudocode for Naive String Matching Algorithm)

텍스트 탐색 범위 설정 (Set search range in text):
- 반복문은 i = 1부터 i = n−m + 1까지 진행된다.
- 이는 패턴의 길이 m만큼 텍스트에서 잘라 비교할 수 있도록 범위를 제한하는 것이다.
비교 조건 (Comparison condition):
- 현재 텍스트의 i번째부터 i + m − 1 번째까지의 문자열을 패턴과 비교한다.
- 모든 문자가 동일할 경우 매칭이 발견된다.
시간 복잡도 (Time Complexity):
- n은 텍스트의 길이, m은 패턴의 길이로, 최악의 경우 O(mn)만큼의 연산이 필요하다.

💡중요한 포인트 (Key Points)

탐색 범위 조정 (Adjust search range): 패턴이 텍스트 끝을 초과하지 않도록 n−m+1 까지만 탐색
조건 검사 (Condition check): 텍스트의 특정 부분이 패턴과 동일한지 비교
시간 복잡도 (Complexity): 비효율적이지만 개념 이해에 중요

원시적 매칭의 비효율적인 사례 (Inefficiency of Naive String Matching)

💡요약: 원시적 문자열 매칭은 순차적으로 모든 위치에서 비교하기 때문에, 특히 패턴의 대부분이 매칭되다가 마지막에 불일치가 일어나는 경우, 이미 비교한 부분까지 모두 반복해야 하므로 비효율적이다. 이러한 문제를 개선하기 위해 패턴 문자열의 앞부분과 뒷부분의 일치 여부를 활용하는 고급 알고리즘이 필요하다.

1번째 시도: abcdabcd는 매칭되지만 d와 w에서 불일치 발생.
2번째 시도: 패턴을 한 칸 이동, 처음부터 다시 비교 → 실패.
3~4번째 시도: 반복적으로 비교 → 실패.
5번째 시도: 최종적으로 완전한 매칭 성공
이 과정에서 이미 일치했던 abc를 계속 비교하며 연산이 낭비된다 (파란색 박스)

✅ 비효율적인 사례 설명 (Example of Inefficiency):

패턴 문자열과 텍스트의 일부가 매칭되더라도, 마지막 문자에서 불일치가 발생하면 앞서 비교했던 모든 연산이 무의미해진다.
텍스트에서 패턴을 한 칸씩 이동하며 처음부터 다시 비교를 시작해야 한다.

✅ 첨부 예제와 연결된 설명 (Example Analysis):

텍스트 (Text): ...abcdabcdabcwz, 패턴 (Pattern): abcdabcwz

✅문제점 (Problem): 매칭 실패 시 매번 처음부터 다시 비교하고 불필요한 연산이 많아 시간 효율성이 떨어진다.

💡 중요한 포인트 (Key Points)

마지막 문자 불일치 (Mismatch at the last character): 불필요한 반복 연산 발생.
패턴의 앞/뒤 일치 정보 활용 (Use pattern prefix/suffix): 효율적인 매칭 가능.
비효율성의 해결 (Resolving inefficiency): 오토마타(automata)이용 매칭

3️⃣오토마타 이론 (Automata Theory)

오토마타는 수학적이고 추상적인 기계로, 현재 상태를 기반으로 입력에 따라 상태를 변화시키며 출력을 생성한다. 이는 컴퓨터 과학에서 알고리즘과 문제를 해결하는 데 매우 유용하며, 문자열 매칭에서도 활용될 수 있다.

✅ 오토마타 이론이란? (What is Automata Theory)

어원 (Etymology): 오토 (Auto): 자동, 마토 (Mata): 기계
정의 (Definition): 오토마타는 문제를 해결하기 위해 유한한 상태를 정의하고, 특정 규칙에 따라 상태를 전환하며 출력을 생성하는 **추상적 기계 (Abstract Machine)**이다. 이는 실제 컴퓨터가 아닌 수학적으로 모델링된 개념이다.

✅오토마타의 예(Example of Automata Theory)

현재 컴퓨터의 상태와 입력에 따라 생성되는 결과가 정의되어 있고 특정규칙에 따라 출력 결과가 달라지는 오토마타 이론의 좋은 예제이다.

상태 (States):
1. off 상태
2. 절전 상태
3. on 상태
전이 (Transitions):
- 전원 스위치를 켜면 off → 절전 상태
- 입력(키보드/마우스)이 있으면 절전 상태 → on 상태
- 일정 시간 입력이 없으면 on 상태 → 절전 상태
- 전원 스위치를 끄면 모든 상태에서 off 상태로 복귀

✅ 원시적 매칭과 오토마타 이용 매칭 비교(Comparison of Naive Matching and Automata-Based Matching)

원시적 매칭 vs. 오토마타 이용 매칭 (Naive Matching vs. Automata-Based Matching):
- 원시적 매칭은 패턴의 문맥을 고려하지 않아 불필요한 연산이 많다.
- 오토마타는 상태와 문맥을 활용하여 효율적으로 비교 연산을 줄이게 된다.

오토마타 이론 #1: 구성요소 (Components of Automata)

오토마타는 아래 5가지 구성 요소로 정의된다:

Q (상태 집합, Set of states): 시스템이 가질 수 있는 모든 상태의 집합.
q₀ (시작 상태, Initial state): 입력 처리를 시작하는 초기 상태.
A (목표 상태의 집합, Set of accept states): 입력을 받아들이는 종료 상태(들).
Σ (입력 알파벳, Input alphabet): 시스템이 처리할 수 있는 입력의 집합.
δ (상태 전이 함수, Transition function): 특정 상태에서 특정 입력을 받을 때 이동할 다음 상태를 정의하는 함수.

오토마타 이론 #2: 상태 전이 함수 다이어그램 표현(Example of Transition Function Diagram)

시작 상태 (State 0):
- 아무 문자도 처리하지 않은 초기 상태이다.
- 첫 번째 입력이 a라면 상태 1로 이동한다.
상태 1:
- 입력이 b라면 상태 2로 이동한다.
- 만약 다른 문자가 오면 상태 0으로 돌아간다.
- 이유: 패턴이 ababaca로 시작하므로 b가 와야 다음으로 진행된다.
상태 2 → 상태 6:
- 올바른 입력이 계속되면 상태가 차례로 증가하게 된다.
  예: 상태 2에서 a → 상태 3, 상태 3에서 b → 상태 4...
상태 7:
- 입력된 문자가 ababaca와 완전히 매칭되면 최종 상태 7에 도달한다.
- 이는 패턴을 성공적으로 찾았다는 뜻이다.
잘못된 입력 처리:
- 예: 상태 5에서 b가 오면 전 단계 상태 4로 되돌아간다.
- 이는 이미 매칭된 일부 정보를 재사용하여 불필요한 비교를 줄이기 위함이다.
- 원시적 매칭이었다면 처음부터 돌아가지만 오토마타는 중간 단계로 이동하기 때문에 비교연산을 줄일 수 있는 것이다.

상태 전이 함수는 위에서 언급된 예제처럼 다이어그램도 표시할 수 있고 이번 예제와 같이 테이블로도 구성할 수 있다.

오토마타 #2: 상태 전이 함수 테이블 구성 (Automata Transition Function Table Representation)

✅ 상태 전이 함수란? (What is Transition Function): 상태 전이 함수는 현재 상태에서 입력 문자를 기반으로 다음 상태를 결정한다.

예: 상태 0에서 입력이 a이면 상태 1로 이동.
이는 문자열 매칭 과정을 구조적으로 표현하는 방법이다.

✅ 테이블 구성 (Table Representation): 상태 전이 다이어그램을 테이블 형태로 변환한다.

행 (Row): 현재 상태.
열 (Column): 입력 문자.
값 (Value): 다음 상태.

왼쪽 기본 테이블 (Full Table): 모든 알파벳에 대해 상태를 정의한다.
압축된 테이블 (Compressed Table): 패턴에 등장하는 문자(a, b, c)만 고려하고, 나머지 문자(기타)는 한 열로 압축하여 테이블을 단순화하고 시간을 절약하였다.

💡중요포인트:

기존 방식: 모든 알파벳에 대해 상태를 정의 → 테이블이 복잡
개선 방식: 패턴에 등장하는 문자만 고려 → 테이블이 단순화되고 연산 효율 증가

오토마타 #3: 의사코드 표현 (Pseudocode in Automata)

오토마타를 활용해 문자열 매칭을 수행하는 과정을 의사코드로 확인해보자.

δ(delta) 오토마타에서 상태 전이 함수 (Transition Function)를 의미한다. 오토마타가 현재 상태에서 특정 입력을 받았을 때 어디로 이동해야 하는지를 결정하는 역할을 한다.

δ(delta) 그리스 문자의 이름으로, 수학과 컴퓨터 과학에서 함수나 변화(transition)를 나타낼 때 자주 사용한다.

FA-Matcher (A[], δ[][], f):
▷ f : 목표 상태 (Final state), 
▷ n : 텍스트 A의 길이, 
▷ m : 패턴 길이

q ← 0  ▷ q는 현재 상태를 나타냄

for i ← 1 to n:  ▷ 텍스트의 각 문자를 처리
    q ← δ(q, A[i])  ▷ 현재 상태와 입력 문자에 따라 상태 전이
    if (q = f):  ▷ 현재 상태가 최종 상태라면
        A[i - m + 1]에서 매칭이 발생했음을 알린다.

변수 정의 (Variables):
- q: **현재 상태 (Current state)**를 나타낸다. 시작 상태는 0이다.
- δ: **상태 전이 함수 (Transition function)**이다. 현재 상태와 입력 문자에 따라 다음 상태를 반환한다.
- f: **최종 상태 (Final state)**로, 패턴 매칭이 완료되었음을 나타낸다.
작동 과정 (How It Works):
- q ← 0: 시작 상태에서 시작한다.
- for 루프는 텍스트의 각 문자를 차례로 처리한다.
- q ← δ (q, A[i]): 현재 상태와 입력 문자 A[i]를 상태 전이 함수에 넣어 다음 상태로 이동한다.
- if (q = f): 현재 상태가 최종 상태에 도달하면 매칭이 발생했음을 출력한다.
  - 매칭이 발생한 위치는 i − m+1 이다.

오토마타 #4: 수행 시간 분석 (Time Complexity of String Matching Using Automata)

오토마타를 이용한 문자열 매칭의 수행 시간은 크게 두 부분으로 나눌 수 있다:

매칭 수행 시간: 텍스트의 길이에 비례.
오토마타 상태 전이 함수 준비 시간: 패턴과 알파벳 크기에 비례.

✅ 매칭 수행 시간 (Matching Time)

for 루프: 텍스트의 각 문자에 대해 한 번씩 상태 전이를 수행 → O(n)
상태 전이 함수 실행: 한 번의 전이가 O(1)이므로 효율적.
매칭 수행 시간은 텍스트의 길이 n에 선형적으로 비례합니다.

✅상태 전이 함수 준비 시간 (Setup Time for Transition Function)

입력 알파벳의 개수: ∣Σ∣ 사용 가능한 문자(예: 영어 소문자 26개).
패턴 길이: m
준비 시간: O(∣Σ∣m) 모든 문자와 패턴의 길이에 대해 전이 상태를 정의.

✅총 수행 시간: Θ(n+∣Σ∣m)

n: 텍스트의 길이

✅예제 수행 시간 계산 (Example Calculation)

입력 데이터:
- 텍스트 길이 n=1000
- 패턴 길이 m=10
- 알파벳 크기 ∣Σ∣=26
계산:
- 매칭 수행 시간: O(n) = O(1000)
- 상태 전이 함수 준비 시간: O(∣Σ∣m) = O(26⋅10) = O(260).
- 총 수행 시간: O(1000+260) = O(1260)

💡중요포인트:

오토마타 기반 매칭은 텍스트 길이에 선형적으로 비례하여 효율적이다.
상태 전이 함수 준비 시간이 추가로 필요하지만, 이는 매칭 단계의 효율성을 보장하는 투자이다.
입력 데이터의 특성(텍스트 길이, 패턴 길이, 알파벳 크기)에 따라 성능이 결정된다.

4️⃣라빈 - 카프 알고리즘(Rabin-Karp Algorithms)

문자열 매칭 알고리즘으로 라빈 카프 알고리즘에 대해서 배워본다. 문자열 패턴을 숫자로 변환하여 비교하는 효율적인 알고리즘이다.

라빈 카프 알고리즘(Rabin-Karp Algorithms): 가능한 문자들에 대해 숫자를 대응하고 패턴을 하나의 진수로 표현해서 만약 121 이라는 숫자가 있을 때, 문자로 하면 3개의 문자가 되듯이, 어떤 문자의 패턴이 있을때 그것을 각각의 문자로 할당해서 진수 표현하는 것이다. 이것을 “수치화 과정”이라고 한다.
수치화 (Numerical Transformation): 문자열의 각 문자를 숫자로 매핑하고, 해당 숫자를 특정 진수(base)로 표현하여 문자열을 숫자로 변환하는 과정
이 과정을 통해 빠르게 문자열의 패턴을 비교할 수 있게 된다.

✅수치화의 예 (Example of Numerical Transformation)

주어진 데이터: 알파벳 집합 Σ={a,b,c,d,e}, 크기 ∣Σ∣=5

매핑: a=0, b=1, c=2, d=3, e=4

진수로 변환:
- c = 2⋅5^2 : 2×25 = 50
- a = 0 ⋅ 5^1 : 0×5 = 0
- d = 3 ⋅ 5^0 : 3×1 = 3
합산:
- 50 + 0 + 3 = 53

💡라빈-카프 알고리즘의 중요포인트

문자열을 숫자로 변환하여, 숫자 비교로 문자열 패턴 매칭을 수행한다.
효율성: 숫자 비교는 문자열 비교보다 빠르며, 해시 값을 이용하여 중복 계산을 줄일 수 있게된다.

✅수치화 계산 방법 기본 개념 (Basic Numerical Calculation)

문자를 숫자로 변환 (Mapping Characters to Numbers): 문자열의 각 문자를 고유한 숫자로 매핑한다.

진수로 표현 (Representing in Base ddd): 문자열의 각 문자를 진수 기반으로 표현하여 수치화한다.

예) 10진수(기본 진수 d=10)를 사용하여 P[m] 다음과 같이 표현:
p=P[m] + 10⋅P[m−1] + 102⋅P[m−2] + ⋯ + 10m−1⋅P[1]
- P[1]: 가장 왼쪽 문자 → 가장 큰 자리수.
- P[m]: 가장 오른쪽 문자 → 가장 작은 자리수.

일반적인 계산 방식 (Basic Calculation): 텍스트에서 부분 문자열 A[i⋯i+m−1]를 수치화하는 계산:

시간 복잡도 O(m): 각 문자마다 10을 곱하면서 반복 계산.
전체 텍스트에 대해 비교: n개의 부분 문자열에 대해 계산 → 총 O(mn)
- 문제점: 원시적 매칭과 성능 차이가 없음.

✅수치화 계산의 개선 방법 (Optimization in Calculation)

점화식 사용 (Using Recursive Formula):

이전 계산값 ai−1을 재활용하여 다음 값 ai를 계산에 활용한다.
첫 번째 문자 영향 제거: −10m−1 ⋅ A[i−1]
새로운 문자 추가: +A[i + m−1]

💡🤔최적화 효과가 있을까? 그렇다.

각 부분 문자열 계산이 O(1)로 줄어듦.
전체 텍스트 비교 시간: O(n)
미리 계산된 10m−1는 반복 사용 가능.

💡중요 포인트:

수치화 개념 (Numerical Transformation): 문자열을 숫자로 변환해 효율적으로 비교한다.
기본 계산 방식의 한계 (Limitations of Basic Calculation): 시간 복잡도 O(mn)으로 원시적 매칭과 동일한 점
점화식을 통한 개선 (Optimization Using Recursive Formula): 이전 계산값을 활용하여 불필요한 연산 제거한다. O(n)시간 복잡도로 최적화하였다.
효율성 (Efficiency): 덧셈과 곱셈 각각 2회로 계산 가능 → 이뜻은 실행 속도가 대폭 증가되었다는 뜻이다.

✅라빈-카프 알고리즘 수치화를 이용한 매칭 예제 (Example of Matching Using Numerical Transformation in the Rabin-Karp Algorithm)

💡 요약 (Summary)

수치화 계산 방법 (Numerical Calculation):
- 문자열의 각 문자를 진수로 변환하여 수치값을 계산한다.
- P[]: 패턴 문자열 → 고유 수치값 계산.
- A[]: 텍스트의 부분 문자열 → 고유 수치값 계산 후 패턴과 비교.
점화식을 사용한 계산 (Using Recursive Formula):
- 이전 계산값을 활용해 현재 값을 효율적으로 계산한다.
- 수치값이 같으면 문자열 매칭 성공하였다는 뜻이다
효율성: 점화식을 통해 한 칸씩 이동하며 효율적으로 비교할 수 있다.

❤️ P[] 패턴 문자열의 수치화 (Numerical Transformation of Pattern): eeaab

각 문자에 고유한 값을 부여한다. e=4, a=0, b=1
e=4, 5^4=625, a = 0, b = 1 결과: p=300

❤️A[] 텍스트 문자열의 수치화 (Numerical Transformation of Text): acebb

처음 5개의 문자를 가져와 수치 계산.
결과: a1 = 356
매칭 실패 (Does not match p = 3001)

❤️문자열의 수치화 대신 점화식을 사용한 효율적 계산 (Efficient Calculation Using Recursive Formula)

ai−1: 이전 계산값.
5m−1⋅A[i−1]: 빠지는 첫 번째 문자.
A[i+m−1]: 추가되는 마지막 문자.

❤️ 계산 과정 (Calculation Process)

a2: cebbc

결과: a2 = 1782
매칭 실패 (Does not match p=3001).

a3: ebbce

결과: a3 = 2664
매칭 실패 (Does not match p=3001).

a7: eeaab

결과: a7 = 3001
매칭 성공 (Matches p=3001).

💡중요한 포인트 (Key Points)

수치화의 효율성 (Efficiency of Numerical Transformation):
- 점화식을 통해 이전 계산값을 재활용한다.
- 계산 복잡도를 O(mn)에서 O(n)으로 줄였다.
정확성 (Accuracy):
- 동일한 문자열은 동일한 수치값을 가진다.
- 수치값이 일치하면 문자열 매칭 성공하게 된다.
효율적 매칭 (Efficient Matching):
- 한 칸씩 이동하며 계산 → 빠르고 정확한 비교가 가능하다.

💡왜 수치화를 하는 걸까? (Why Use Numerical Transformation?)

문자열 비교는 한 문자씩 순서대로 확인해야 하므로 시간이 오래 걸리게 된다.
라빈-카프는 문자열을 숫자로 변환하고 숫자끼리 비교하므로 더 빠르게 매칭 여부를 확인할 수 있다.

✅라빈-카프 알고리즘 수치화를 이용한 의사코드 표현 (Pseudocode for Rabin-Karp Algorithm Using Numerical Transformation)

💡요약: 아래 예제 코드는 **패턴 문자열 P[]**를 텍스트 문자열 A[]에서 찾아내는 과정을 보여준다. 문자열을 숫자로 변환(수치화)하고, 이 숫자값을 비교하여 매칭 여부를 확인한다.

1. 초기화 (Initialization)

p ← 0; a₁ ← 0

p: 패턴 문자열 P[]의 수치값을 저장하는 변수이다.
a₁: 텍스트의 첫 번째 부분 문자열의 수치값을 저장하는 변수이다.

2. 패턴과 첫 번째 부분 문자열의 수치값 계산

for i ← 1 to m:
    p ← d * p + P[i]
    a₁ ← d * a₁ + A[i]

i: 현재 패턴과 텍스트의 문자를 계산하는 위치.
패턴 수치값 계산:
- p = d⋅p+P[i] : 기존 값에 d를 곱하고 현재 문자의 값을 더한다.
- 예: eeaab → p = 4⋅5^4 + 4⋅5^3 + 0⋅5^2 + 0⋅5^1 + 1⋅5^0 = 3001
첫 번째 텍스트 부분 문자열 수치값 계산: a1 = d⋅a1+A[i]
- 예: acebb → a1 = 0⋅5^4 + 2⋅5^3 + 4⋅5^2 + 1⋅5^1 + 1⋅5^0 = 356

3. 나머지 부분 문자열의 수치값 계산 (점화식 사용)

for i ← 1 to n - m + 1:
    if (i ≠ 1):
        aᵢ ← d * (aᵢ₋₁ - dᵐ⁻¹ * A[i-1]) + A[i+m-1]

i≠1: 두 번째 부분 문자열부터 계산한다.
점화식 설명:
- aᵢ: 새로운 부분 문자열의 수치값.
- aᵢ₋₁: 이전 부분 문자열의 수치값.
- dm−1: 패턴 길이에 따라 계산된 상수.
- A[i−1]: 빠지는 문자.
- A[i+m−1]: 추가되는 문자.

예제 계산:

a1 = 356
a2 = 5⋅(a1−0⋅625)+2 = 1782
a3 = 5⋅(a2−2⋅625)+4 = 2664

4. 매칭 여부 확인

if (p = aᵢ): A[i] 자리에서 매칭이 되었음을 알린다.

패턴의 수치값 p와 현재 부분 문자열의 수치값 aᵢ를 비교.
값이 같다면 패턴이 텍스트의 해당 위치에서 매칭됨을 의미.

예제:

패턴 p=3001
a₇ = 3001 → 매칭 성공

💡중요포인트

수치화: 문자열을 숫자로 변환하여 비교한다.
점화식: 이전 계산값을 재활용해 효율적 계산을 한다.
매칭 성공 조건: 패턴 수치값 p와 부분 문자열 수치값 aᵢ가 같으면 매칭이 성공된다.
효율성: 반복 계산을 줄이고 O(n) 시간 안에 패턴 매칭을 완료한다.

✅라빈-카프 알고리즘 수치화를 이용한 매칭의 문제점

💡요약: 문자열 패턴과 텍스트의 수치화 과정에서 계산 결과가 너무 커져 **오버플로우 (Overflow)**가 발생할 수 있다. 이를 해결하기 위해 **모듈러 연산 (Modulo operation)**을 사용하여 값을 제한한다.

라빈-카프 알고리즘 (Rabin-Karp Algorithm):
- 문자열 패턴과 텍스트를 **해시값 (Hash value)**으로 변환한 뒤, 빠르게 비교한다.
- 모듈러 연산을 적용하여 효율적으로 수치화된 값 비교 수행할 수 있게 된다.
장점: 빠른 매칭 가능하고 모듈러 연산을 통해 값의 크기를 제한하여 오버플로우 방지한다.

❤️ 문제점과 해결책 (Challenges and Solutions)

문제점:
- 문자 집합 Ω와 패턴 길이 m에 따라 ai가 매우 커질 수 있음.
  예: a_i = d(a_{i-1} - d^{m-1}A[i-1]) + A[i+m-1]
- 너무 큰 숫자는 레지스터의 크기를 초과해 **오버플로우 (Overflow)**가 발생한다.

해결책: **모듈러 연산 (Modulo operation)**을 적용하여 값을 제한한다.
- 식 변경: bi = (d(bi−1−(dm−1 modq) ⋅ A[i−1]) + A[i+m−1])
- q 값: 충분히 큰 소수를 선택해 충돌을 최소화한다.
- 이 방법은 해시테이블의 해시값 계산과 유사하다.

❤️ 라빈-카프 알고리즘의 작동 원리 (How Rabin-Karp Works)

패턴 해시값 계산 (Hash Calculation for Pattern):
- P[]=e,e,a,a,b
- 계산: p = (4⋅5^4+4⋅5^3+0⋅5^2+0⋅5^1+1) mod 113 = 63
원문 해시값 계산 (Hash Calculation for Text):
- 원문 A[]: a, c, e, b, b, c, e, e, a, b, c, e, e, d, b…
- 처음 5개의 해시값 a1 = 17
- 슬라이딩 윈도우 방식으로 다음 해시값 계산: a2 = 87

매칭 과정 (Matching Process): 패턴 해시값 p=63 과 원문 해시값 ai를 비교하여 해시값이 같으면 실제 문자열을 확인하여 매칭 여부 판단한다.
결과: a7 = 63에서 매칭 성공하였다.

💡중요포인트: 라빈-카프 알고리즘은 빠르고 효율적인 문자열 매칭 알고리즘이다.

문제: 문자열 수치화 값이 커져 오버플로우 발생 가능.
해결: 모듈러 연산으로 값 크기를 제한하여 문제 해결.
라빈-카프 알고리즘 작동 원리:
- 패턴과 텍스트를 해시값으로 변환해 비교.
- 해시값이 같으면 매칭 성공(문자열도 추가 확인).
효율성: 슬라이딩 윈도우 방식으로 이전 계산 재활용 → 빠른 연산.
결과: 패턴 해시값 p와 텍스트 해시값 ai가 동일한 경우 매칭 확인하게 된다.

❤️ 라빈-카프 알고리즘 의사코드 요약 (Rabin-Karp Algorithm Pseudocode Summary)

모듈러 연산 사용 (Use of Modulo Operation):
- 해시값 계산 중 값이 커지는 것을 방지하기 위해 모든 계산에 모듈러 연산이 적용되었다.
- p, bi, h 계산 시 mod q를 사용한다.
시간복잡도 (Time Complexity):
- 전체 비교에 O(n + km), 평균적으로 O(n)의 복잡도를 갖는다.
작동 원리 (How it Works):
- 패턴 P의 해시값 p와 텍스트 A의 해시값 bi를 계산한다.
- 해시값이 일치하면, 실제 문자열도 비교하여 매칭 여부 확인하게 된다.

**💡중요포인트: 모듈러 연산 (Modulo Operation)**을 통해 수치화 과정에서 값이 너무 커지지 않도록 계산마다 사용하여 오버플로우를 방지한다. 패턴 해시값 p와 텍스트 해시값 bi를 비교하여 매칭 여부를 빠르게 판단하고 슬라이딩 윈도우를 사용하여 효율적으로 계산한다. 문자열 매칭을 빠르게 수행이 가능하고 큰 데이터를 처리할때 효율적이다. 또한 해시값 충돌 가능성 때문에 해시값이 같을 경우 문자열을 추가로 비교하여 정확성을 보장한다.

5️⃣KMP 알고리즘과 실패 함수 (KMP Algorithm and Failure Function)

💡 요약 (Summary)

KMP 알고리즘의 핵심 (Core Idea of KMP):
- 매칭 실패 시 처음으로 돌아가지 않고, 중간 지점으로 복귀하여 이전 비교 결과를 활용한다. 이를 통해 비교 연산 횟수가 감소하게 된다.
실패 함수 (Failure Function):
- 매칭 실패 시 돌아갈 중간 지점을 미리 계산하고 저장하는 함수이다.
- 패턴의 중복 정보를 활용하여 불필요한 연산을 줄일 수 있다.
KMP vs. 오토마타: 오토마타와 유사한 방식으로 동작하지만, 준비 작업이 더 단순하고 빠르다는 장점이 있다. (오토마타는 상태함수 준비시 좀 복잡하다)

✅ KMP 알고리즘 vs. 오토마타 매칭 비교 (KMP Algorithm vs. Automata Matching)

원시적 매칭과의 차이 (Compared to Naive Matching):
- 원시적 매칭: 매칭 실패 시 처음으로 돌아가 다시 비교.
- 오토마타와 KMP: 실패 시 중간 상태로 이동해 이전 결과를 재활용.
KMP와 오토마타의 공통점 (Similarities):
- 둘 다 문맥 활용: 매칭 실패 시 중간 상태로 이동해 불필요한 연산 제거.
KMP와 오토마타의 차이점 (Differences):
- 오토마타: 상태 전이 함수를 상태 다이어그램으로 정의. 준비 과정이 복잡하지만 체계적이다.
- KMP: 실패 함수를 이용해 중간 지점을 계산한다. 준비 과정이 단순하고 빠르다.

✅KMP 알고리즘 작동 원리 (How KMP Works)

패턴 매칭:
- 텍스트와 패턴을 비교하면서 실패 시 중간 지점으로 이동한다.
- 실패 함수 π[i]를 사용하여 복귀 지점을 결정한다.
실패 함수의 역할 (Role of Failure Function):
- 매칭 실패 시 처음으로 돌아가지 않고, 이미 매칭된 부분을 활용해 비교를 이어간다.
- 예: π[8] = 4 → 8번째 자리에서 실패하면 4번째 자리로 복귀하게 된다.
실패 함수 계산 (Failure Function Calculation):
- 패턴 내에서 부분 문자열의 중복 정보를 활용하여 재활용한다.
- 패턴의 각 위치에서 실패 시 복귀할 위치를 저장한다.

✅예제 분석 (Example Analysis)

매칭 실패 예시: 텍스트: A[], 패턴: P[] = abcdabcwz

텍스트에서 abcdabcd 까지 매칭 후 d≠w에서 실패가 발생하였다.
실패 함수로 인해 중복된 abc를 살리고, d부터 비교를 재개한다.

실패 함수 예시: P [1:9] = abcdabcwz

π[8]=4: 8번째 자리에서 실패 시 4번째 자리 d로 이동한다.
결과적으로 4칸 점프하여 불필요한 연산 제거하게 된다.

✅KMP의 장점 (Advantages)

효율적 비교: 실패 함수로 중복 비교를 줄여 평균적으로 O(n+m)시간 복잡도를 가진다.
중복 활용: 패턴 내부 중복을 활용해 이전 결과를 재활용한다.
실제 응용: 텍스트 검색, 네트워크 패턴 매칭, 데이터 분석 등에 사용된다.

✅ KMP 알고리즘과 실패 함수 준비 의사코드 (KMP Algorithm and Failure Function Pseudocode)

💡 요약 (Summary)

실패 함수 준비 (Failure Function Preparation):
- 목적: 패턴 내 중복 정보를 활용해 매칭 실패 시 돌아갈 위치를 미리 계산.
- 시간복잡도: Θ(m), 패턴의 길이 m에 비례.
KMP 알고리즘 (KMP Algorithm):
- 매칭 실패 시 실패 함수 π[j]를 이용해 중간 지점으로 점프.
- 원문의 각 문자에 대해 비교 진행 → Θ(n) 시간 소요.
- 전체 시간복잡도: Θ(n+m)

k: 패턴의 접두사 길이를 나타냄.
π[j]: 매칭 실패 시 복귀 위치를 저장.
if (k = 0 or A[j] = P[k]) j++, k++; k = 0이거나 원문의 값이 A[j] = P[k]패턴의 값과 일치하면 계속 증가하다가 π[j] <- k 실패함수 값을 업데이트해준다.

else j <-π[j] 매칭 실패 시 j를 π[j]로 업데이트하여 중간 지점으로 복귀한다.
매칭 성공 시, 매칭된 위치를 출력하고 다시 π[j]를 이용해 다음 비교 시작.
먼저 패턴에 대한 실패 함수를 준비한다. preprocessing(p) 그다음엔 while문을 돌며 실패가 일어났을 경우에 해당 지점으로 점프를 해나가게 된다. else j <-π[j] 그러다가 전체적으로 매칭이 일어나면 매칭이 되고 처음으로 다시 돌아가게된다. j <-π[j]

✅시간복잡도 분석 (Time Complexity Analysis)

실패 함수 준비: Θ(m), 패턴의 길이에 비례
KMP 알고리즘: Θ(n), 원문의 길이에 비례.
전체 시간복잡도: Θ(n + m), 효율적 문자열 매칭 가능.
실패 함수를 준비하는데 패턴의 길이만큼의 시간이 필요해서 preprocessing() = Θ(m)이고 while 루프는 원문의 문자를 하나씩 보기때문에 Θ(n)이다.

6️⃣보이어-무어 알고리즘(Boyer-Moore Algorithm)

기존의 라빈 카프 알고리즘이나 kmp알고리즘은 텍스트 문자열을 처음부터 적어도 1번씩은 검색하기 때문에 세타n의 수행시간이 필요한다. 텍스트를 왼쪽에서 오른쪽으로 비교하기때문에 최선의 경우에도 n만큼의 시간이 필요하다. 보이어 무어는 생각의 전환을 하였다. 텍스트 문자열을 전부 보지 않고 점프를 하며 일부만 보는것이다 또한 패턴을 왼쪽이 아닌 오른쪽부터 비교하게 된다. 긴문자열의 패턴이 5개라면 5번째부터 본다. 매칭이 되지 않는다면 앞에있는 4개는 보지 않고 뛰어넘게된다. 이렇게 반복하는 방식이다. 실무적으로 높은 성능을 보이는 알고리즘으로 많이 사용되고 있다.

✅ 보이어-무어 알고리즘 작동 원리 (How it Works)

비교 방향: 전체 텍스르로 보면 앞으로 진행하지만 패턴으로 보면 반대방향으로 진행한다.
- 패턴: 오른쪽에서 왼쪽으로 비교.
- 텍스트: 여전히 왼쪽에서 오른쪽으로 진행.
불일치 처리:
- KMP알고리즘이 실패함수를 사용하는 것처럼 보이어무어 함수도 불일치 시 “점프 테이블”을 참고해 특정 칸 수를 건너뛴다.
- 검색하는 텍스트보다 패턴이 훨씬 짧고 여러번 찾을 때 유리하다.
- 실무적으로 문자열 매칭 알고리즘을 사용할때 보이어 무어 알고리즘이 벤치 마크 표준으로 사용될 정도로 많이 사용되고 있다.

✅ 예제 분석 (Example Analysis)

텍스트: A[] = ...btiger.., 패턴: P[] = tiger
비교 과정:
- 마지막 문자 r부터 비교 → 불일치 (b ≠ r).
- 점프 테이블에 따라 5칸 점프 → 다음 비교 시작.

텍스트: A[]=...tigertiger... 패턴: P[]="tiger"
비교 과정:
- i ≠ r에서 불일치 발생.
- 패턴에서 i가 세 번째 위치에 있으므로 3칸 점프.
- 앞의 2개 문자는 재활용하여 비교 연산 감소.

✅점프 테이블 1 (Jump Table in Boyer-Moore Algorithm)

💡요약 (Summary)

점프 테이블이란? (What is a Jump Table?)
- 패턴의 각 문자에 대해 매칭 실패 시 몇 칸 점프할지를 정의한 테이블이다.
- 패턴의 오른쪽에서 왼쪽으로 비교하기 때문에 점프 테이블도 반대 방향으로 정의한다.
점프 테이블의 목적 (Purpose of Jump Table):
- 매칭 실패 시 효율적으로 불필요한 비교를 건너뛰기 위해 사용한다.
- 특정 문자가 패턴에 없는 경우 패턴 길이만큼 점프한다.

"heyhibyez"에서 "bye"를 매칭.
1. y ≠ e: 점프 테이블에 따라 1칸 점프.
2. h ≠ e: 기타 문자 → 3칸 점프.
3. y ≠ e: 다시 1칸 점프.
4. 마지막 문자 e일치 → 매칭 성공.
문자별 이동 거리 계산:
- 패턴의 각 문자에 대해 끝에서의 거리를 기준으로 이동 거리를 설정.
- 패턴에 없는 문자 → 패턴의 길이만큼 점프.
- 동일한 문자가 여러 번 나타나면 가장 작은 이동 거리 선택.

마지막으로 b, y, e 순서로 비교 → 모두 매칭 → 매칭 성공.

✅점프 테이블 2 (Jump Table in Boyer-Moore Algorithm)

❤️ 패턴: tiger의 점프 테이블

오른쪽 끝 문자 기준으로 이동 거리 계산:
- t: 마지막에서 4번째 → 이동 거리 = 4
- i: 마지막에서 3번째 → 이동 거리 = 3
- g: 마지막에서 2번째 → 이동 거리 = 2
- e: 마지막에서 1번째 → 이동 거리 = 1
- r: 마지막에서 0번째 → 이동 거리 = 0
- 기타 문자: 패턴 길이(5)만큼 점프

❤️패턴: `rational`의 점프 테이블

같은 문자가 여러 번 나타나는 경우:
- 가장 오른쪽의 위치를 기준으로 이동 거리를 계산하지만, 가장 작은 이동 거리를 선택.
- 예: a는 1번째와 6번째에 나타남 → 이동 거리 = 1 (6번째 기준).
오른쪽 끝 문자 기준으로 이동 거리 계산:
- r: 마지막에서 7번째 → 이동 거리 = 7
- a: 마지막에서 6번째 → 이동 거리 = 1 (작은 값 선택)
- t: 마지막에서 5번째 → 이동 거리 = 5
- i: 마지막에서 4번째 → 이동 거리 = 4
- o: 마지막에서 3번째 → 이동 거리 = 3
- n: 마지막에서 2번째 → 이동 거리 = 2
- l: 마지막에서 1번째 → 이동 거리 = 0
- 기타 문자: 패턴 길이(8)만큼 점프.

✅원시적 매칭 vs. 개선된 매칭 의사코드 비교 (Naive Matching vs. Improved Matching Pseudocode)

원시적 매칭 의사코드와 비교하면 크게 다른점은 없다. 하나 다른 점은 computeJump(P, jump) 점프하는 테이블을 준비하는 것과 실제 점프시 점프테이블을 이용해 점프한다는 점이다. i<- i + jump[A[i+m-1]]

공통점 (Similarities):
- 텍스트와 패턴을 비교하며 매칭 여부 확인.
- while 루프를 통해 텍스트를 순차적으로 탐색.
차이점 (Differences):
- 원시적 매칭 (Naive Matching):
  - 매칭 실패 시 무조건 한 칸씩 이동.
- 개선된 매칭 (Improved Matching):
  - 점프 테이블을 사용하여 불일치 시 여러 칸 점프.
  - 점프 테이블은 computeJump(P,jump)를 통해 미리 생성.

✅보이어-무어-호스풀 알고리즘 (Boyer-Moore-Horspool Algorithm)

개선된 매칭 알고리즘의 확장된 형태를 보이어-무어-호스풀 알고리즘이라고 한다. 개선된 매칭은 단순한 점프 방식을 따르고, 보이어-무어-호스풀은 점프 전략이 더 정교하다. 따라서 두 알고리즘은 유사하지만, 비교 방식과 점프 전략에서 차이가 있다.

computejum(P.jump) 점프 테이블을 만들고 그 자리에서 매칭이 발견되지 않았을때는 점프 테이블에 저장된 위치만큼 점프하며 문자열매칭을 수행하게 된다.

✅ 수행시간 분석

최악의 경우: Θ(mn)
- 텍스트 전체가 동일한 문자로 구성된 경우, 한 칸씩만 점프 → 원시적 매칭과 동일.
- 이 경우가 실제적인 상황은 거의 없고 실무적으로 보았을땐 매우 빠르게 동작한다.
일반적인 경우:
- 실제 텍스트와 패턴이 다양한 경우에는 불필요한 비교를 줄이므로, 평균적으로 Θ(n)보다 빠름.
최선의 경우: Θ(n/m)
- 긴 텍스트에서 패턴이 나타나지 않는 경우, 패턴 길이만큼 점프.
- 예: 패턴 길이가 20인 경우, 한 번의 비교 후 20칸 점프 가능.

Advanced Study of Greedy Algorithms and Matroids

Heesu Noh — Sat, 07 Dec 2024 14:45:30 GMT

Contents

1️⃣그리디 알고리즘(탐욕법)의 특징 (Overview of Greedy Algorithms)
2️⃣그리디 알고리즘의 예 (Example of Greedy Algorithms)
3️⃣그리디 알고리즘의 최적해 조건 (Conditions for Optimal Solution with Greedy Algorithm)
4️⃣ 그리디 알고리즘 문제 (Example of Greedy Algorithms)
5️⃣매트로이드 (Matroid)
6️⃣매트로이드의 확장(Matroid Expansion)
7️⃣ 문제 공간 탐색 (Problem Space Exploration)

오늘은 그리디 알고리즘에 대해서 배워볼 예정이다. 그리디 알고리즘의 특징에 대해서 먼저 알아본다. 그리디는 “탐욕”이라는 뜻으로 빠른시간에 높은 성능을 보이도록 하기 위해 이것저것 따지지 않고 그대로 현재 시점에서 가장 좋은 옵션을 찾아 나가는 방식이다. "현재 시점" 만 보기떄문에 시야가 좁은 알고리즘이다. 대부분 "최적해"와는 거리가 있다. 하지만 드물게 최적해가 보장되는 경우도 있다. 그 예로는 프림, 크루스칼알고리즘이 있다. 다익스트라 알고리즘도 마찬가지로 최소의 비용을 찾아가는 과정에서 최적해를 보장한다.

1️⃣그리디 알고리즘(탐욕법)의 특징(Overview of Greedy Algorithms)

✅ 그리디 알고리즘의 정의와 특징

그리디 (Greedy)는 탐욕이라는 뜻으로, 매 순간 가장 좋아 보이는 옵션을 선택하는 방식이다.
이 알고리즘은 현재 시점 (current state)만 고려하며, 시야가 좁다 (limited view)는 특징을 가진다.
따라서 대부분의 경우 전체 최적해 (global optimum)를 놓칠 가능성이 높다.
하지만 특정 상황에서는 최적해를 보장할 수 있는데, 그리디 알고리즘의 빠른 성능 (high performance)이 강점으로 작용한다.
어떤 특정 상황인지는 아래 장단점에서 언급하였다.

✅작동 방식: 그리디 알고리즘은 다음과 같이 작동한다.

  do {
      우선 가장 좋아보이는 선택을 한다.
  } until (해 구성 완료)

현재 시점 (current state)에서 반복적으로 가장 좋은 선택을 수행한다.
모든 선택이 완료되면 알고리즘이 종료된다.

✅장점과 단점

장점 (Advantages):
- 빠른 결과를 도출할 수 있다.
- 특정 문제(예: 프림 알고리즘 (Prim Algorithm), 크루스칼 알고리즘 (Kruskal Algorithm), 다익스트라 알고리즘 (Dijkstra Algorithm))에서 최적해를 보장 (guarantee optimal solution)할 수 있다. 성능도 빠르고 최적해를 찾을 수 있기 때문에 위 알고리즘들이 좋은 알고리즘으로 평가되는 이유이다.
단점 (Disadvantages):
- 그래프로 표현시, 전체적인 시야 (global perspective)를 보지 못해서 중간의 작은 봉우리 (local peak)나 골 (valley)에서 이를 최적해로 인식하기 때문에 전체 최적해를 놓치게 된다.
- 예를 들어, 그래프에서 전체 최적해 대신 국소적인 최적해를 선택할 위험이 있다.

✅ 실제 예시

프림 알고리즘 (Prim Algorithm)과 크루스칼 알고리즘 (Kruskal Algorithm): 최소 비용 신장 트리(MST, Minimum Spanning Tree) 를 찾는 과정에서 그리디 방식을 사용하며 최적해를 보장한다.
다익스트라 알고리즘 (Dijkstra Algorithm): 그래프에서 최소 비용 경로(minimum cost path)를 찾는 문제에서 사용되며, 최적해를 제공한다.

✅그리디 알고리즘의 국소적 최적화 (Greedy Algorithm - Local Optimization)

💡정리: 그리디 알고리즘은 매 단계에서 가장 좋아 보이는 선택을 하지만, 국소적 최소값을 선택하여 전체 최적 해(전체 최소값)를 보장하지 못할 수 있다. 단, 문제가 전역 최적 해와 국소 최적 해가 동일한 구조(예: 포물선 형태)라면 효과적다.

그래프는 여러 봉우리(peak)와 골짜기(valley)를 가진 곡선을 보여주며, 이는 가능한 해(solution)를 나타낸다.
- 전체 최대값 (Global Maximum): 그래프에서 가장 높은 지점.
- 국소 최대값 (Local Maximum): 주변 지점들보다 높지만 전체에서 가장 높지는 않은 봉우리.
- 전체 최소값 (Global Minimum): 그래프에서 가장 낮은 지점.
- 국소 최소값 (Local Minimum): 주변 지점들보다 낮지만 전체에서 가장 낮지는 않은 골짜기. 그리디 알고리즘은 이 단계에서 최적이라고 판단되는 선택을 하기 때문에 전체 최적해를 놓치는 경우가 많다.
그래프의 맨 아래에 있는 전체 최소값 (global minimum)가 전체 최적해이다.
전체 최적해(Global Optimal Solution)란, 문제의 모든 가능한 해 중에서 가장 최적(최대 또는 최소)의 값을 가지는 해를 의미한다.
최적해 (optimal solution)를 보장하려면 전체 해 공간이 단순해야 하며, 포물선 형태 (parabolic space)인 경우 그리디 알고리즘이 올바른 최적화를 보장하 게 된다.
혹은 골 (valley)이 하나뿐이거나, 해 공간이 단일 봉우리 (single peak)로 이루어진 문제는 그리디 알고리즘으로 해결 가능하다.

💡 중요한 포인트

빠른 성능 (High performance): 이것저것 따지지 않고 빠르게 해를 구할 수 있음.
시야 제한 (Limited view): 전체 최적해를 놓칠 가능성이 큼.
최적해 보장 조건 (Conditions for optimality): 해 공간이 단순한 구조일 때 가능.

그리디 알고리즘 #1: 과정 (Steps in Greedy Algorithm)

💡핵심은 현재 순간의 최적 해를 선택하고, 이를 점진적으로 전체 해로 확장하는 방식이다.

1. 해 선택 (Select Solution)

현재 시점에서 가장 최선이라고 판단되는 해를 선택한다.
이는 위에서 언급된 국소 최적화(Local Optimization) 접근 방식으로, 문제 해결의 첫 번째 단계이다.
예를 들어, 다익스트라 알고리즘에서는 출발점에서 각 지점까지의 최소 비용을 계산하며 가장 작은 비용을 가진 노드를 선택하게 되는데 현재 시점에서 가장 최선이라고 판단하여 선택하게 되었다.

2. 적절성 검사 (Feasibility Check)

선택한 해가 전체 문제의 제약 조건(constrains)에 맞는지를 검사하게 된다.
제약 조건을 통과하면 다음 단계로 진행하는데, 만약 제약 조건에 맞지 않으면 선택한 해를 제외하거나 다시 수정해야 한다.

3. 해 검사 (Solution Validation)

현재까지 선택한 해가 전체 해의 일부가 되는지 확인한다.
조건에 맞으면 해를 계속 추가하여 다음 단계로 진행한다.
만약 맞지 않다면 (조건 불충족) 다시 이전 단계로 돌아가 반복하게 된다.
예:
- 다익스트라 알고리즘 (Dijkstra Algorithm): 출발점에서 특정 지점까지의 비용을 계산하며, 새로운 노드를 선택하고 비용을 업데이트하는 경우
- 크루스칼 알고리즘 (Kruskal Algorithm): 최소 비용 간선을 선택하며, 사이클을 생성하지 않는지 확인 후 간선을 추가한다.
- 프림 알고리즘 (Prim Algorithm): 현재까지 선택된 정점 집합에서 최소 비용으로 연결되는 간선을 추가한다.

4. 종료 (Termination)

모든 정점이나 노드를 포함하면 알고리즘이 종료된다. 그 전까지는 새로운 해를 선택하고 적절성을 검사하며 최종 해를 찾는다.
최종 해 (final solution)를 출력한다.
예:
- 다익스트라 알고리즘: 모든 노드에 대한 최소 비용이 계산되면 종료된다.
- 크루스칼/프림 알고리즘: 모든 정점이 연결된 최소 신장 트리 (minimum spanning tree)가 완성되면 종료된다.

핵심 포인트 (Key Points)

해 선택 (Solution Selection): 현재 시점에서 가장 좋은 옵션을 선택한다.
적절성 검사 (Feasibility): 제약 조건에 맞는지 확인한다.
해 추가 (Solution Expansion): 조건을 만족하면 해를 계속 확장한다.
종료 (Termination): 모든 요소가 포함되면 전체 해를 출력하고 종료하게 된다.

✅ 그리디 알고리즘의 전형적인 구조 의사코드 (Typical Structure of Greedy Algorithm)

초기화 (Initialization):
- S ← ∅ 해집합 S를 공집합으로 설정한다. C는 선택 가능한 원소들의 집합이.
- S는 선택된 원소들의 집합으로, 초기에는 비어 있게된다(공집합 상태)
반복 조건(while statement condition): 알고리즘은 다음 두 조건을 만족할 때까지 반복하게 된다.
- C ≠ ∅ (원소를 선택할 수 있는 집합이 비어 있지 않음).
- S가 아직 온전한 해(완전한 솔루션)가 아님.
현재 최선의 원소 선택 (Element Selection):
- x ← C에서 원소 하나 선택: C에서 현재 시점에서 가장 좋은 해 (best solution)로 판단되는 x를 선택한다.
- 이 선택은 현재 시점에서 가장 유리한 선택을 의미한다.
- 예: 최소 비용 노드, 최대 이익 간선 등.
원소 제거 (Remove element): C ← C - {x}
- 현재 시점에서 가장 좋은 해 (best solution)로 판단되는 x를 집합 C에서 제거한다.
조건 검사 및 해 집합 추가: S ← S ∪ {x}
- 만약 S에 x를 더해도 문제가 없으면, x를 해 집합 S에 추가한다.
검사와 종료 (Validation and Termination):
- S가 온전한 해(전체 문제의 해결)를 구성하면 S를 반환한다.
- C에 더 이상 선택할 원소가 없거나 반복문이 끝났음에도 온전한 해를 찾지 못하면 "해 없음!"을 반환하게 된다.

💡중요한 부분 (Key Highlights)

현재 시점에서 최선의 선택 (Best Choice at Current State) a.k.a 탐욕적 선택 (Greedy Choice)
- 핵심은 반복문 안에서 x를 선택할 때, 현재 시점에서 가장 최선의 옵션을 고르는 것
집합 이동의 의미 (Transition Between Sets):
- C는 아직 방문하지 않은 원소 집합, S는 이미 선택된 원소 집합으로 간주된다.
- 선택한 원소는 C에서 S로 옮겨진다.
반복 (Iteration):
- 이 과정을 계속 반복하며 S가 전체 해가 될 때까지 진행한다.

2️⃣그리디 알고리즘의 예

이제 그리디 알고리즘의 최적해를 보장할때 사용하는 프림 알고리즘의 작동원리에 대해 알아본다.

그리디 알고리즘의 예 #1 : 최적해 보장하는 프림 알고리즘 (Example of Optimal Solution in Greedy Algorithm - Prim Algorithm)

✅프림 알고리즘의 특징 (Characteristics of Prim Algorithm)

탐색 범위 제한 (Limited Search Scope):
- 현재 연결된 노드와 인접한 간선만 탐색한다.
- 그래프가 크거나 복잡해도 효율적으로 작동한다.
- 이렇게 함으로써 최적해를 구성하는 선택이 각 단계에서 국소적으로도 최적임을 보장한다.
- 즉, 부분적으로 최적이면 점진적으로 전체 최적해를 구성할 수 있다는 뜻이다.
가중치 기준 선택 (Weight-Based Selection):
- 각 단계에서 가장 낮은 가중치를 가진 간선을 선택하여 진행한다.
최적해 보장 (Guarantee of Optimal Solution):
- 모든 노드를 연결할 때, 최소 가중치로 연결된 신장 트리를 생성하게 된다.

시작 노드: 0번 노드
연결된 노드와 가중치: 8번(8), 9번(9), 11번(11)
선택된 노드: 가장 낮은 가중치를 가진 8번 노드
다음 단계: 8번 노드가 현재 시점이 되며, 인접 노드의 가중치를 업데이트.

1. 초기화 (Initialization):

시작 노드(예: 0번 노드)에서 출발한다.
모든 노드의 가중치를 무한대(∞)로 초기화하였다.
시작 노드와 연결된 노드들의 가중치를 확인한다.

2. 해 선택 (Selection of Solution):

시작 노드(0번)와 연결된 8, 9, 11번 노드의 가중치를 탐색한다.
가중치: 8번 노드: 8, 9번 노드: 9, 11번 노드: 11
이 중에서 가장 낮은 가중치를 가진 8번 노드를 선택한다.

3. 반복 (Iteration):

선택된 8번 노드가 현재 시점 (current state)이 된다.
8번 노드에서 연결된 다른 노드를 탐색한다.
기존에 무한대(∞)로 초기화된 가중치 값을 업데이트하게 된다.
- 8번 노드에서 연결된 노드의 가중치가 10으로 변경된다.

4. 종료 조건 (Termination):

모든 노드가 선택되어 연결되면 알고리즘이 종료된다.
결과는 최소 신장 트리 (Minimum Spanning Tree)가 된다.

그리디 알고리즘의 예 #2 : 프림 알고리즘 의사코드 (Example of Optimal Solution in Greedy Algorithm - Prim Algorithm)

💡요약: 프림 알고리즘은 탐욕적 접근 방식을 기반으로, 그래프에서 최소 신장 트리(MST, Minimum Spanning Tree)를 점진적으로 구성한다. 이 의사코드는 정점 중심(
vertex center) 으로 작동하며, 각 단계에서 비용이 가장 작은 정점을 선택하고, 이를 통해 최적의 MST를 보한다. ExtractMin 연산과 비용 갱신이 알고리즘의 핵심이다.

입력값 정의(Input definition):
- G = (V, E): 정점 집합 V와 간선 집합 E로 이루어진 그래프.
- r: 시작 정점(알고리즘이 시작되는 지점).
- 정점(node): 위치라는 개념. (node 라고도 부름)
- 간선(edge): 위치 간의 관계. 즉, 노드를 연결하는 선 (link, branch 라고도 부름)
초기화 (Initialization):
- S ← ∅: MST에 포함된 정점 집합을 초기화(비어 있음).
- 각 정점 v의 비용(u.cost)을 무한대로 설정: u.cost←∞
- 시작 정점 r의 비용을 0으로 설정: r.cost←0
반복 (Iteration):
- 조건: S≠V (모든 정점이 MST에 포함될 때까지 반복).
- ExtractMin: S에 포함되지 않은 정점 중에서 비용(u.cost)이 가장 작은 정점 u를 선택한다.
- 선택된 정점 u를 정점 집합 S에 추가한다. S←S∪{u}
인접 정점 업데이트(Adjacent Vertex Update)
- 선택한 정점 u와 연결된 모든 인접 정점 v에 대해:
  - v가 아직 S에 포함되지 않았고, 간선 (u,v)의 가중치 wuv가 v.cost보다 작으면:
    - v.cost←wuv: 비용 갱신.
    - v.tree←u: v의 트리에 연결된 정점을 u로 설정.

선택된 정점과 인접한 노드의 비용을 업데이트한다.

종료 (Termination):
- 모든 정점이 S에 포함되면 종료된다.
- 최소 신장 트리 (Minimum Spanning Tree)가 완성된다.

💡중요포인트

탐욕적 접근(Greedy Choice Property):
- 각 단계에서 S에 포함되지 않은 정점 중에서 비용이 가장 작은 정점을 선택(ExtractMin)한다.
- 이러한 선택은 항상 최적해(MST)의 일부가 되도록 보장되는 역할을 한다.
ExtractMin:
- 우선순위 큐(Heap)를 사용하여, 비용(u.cost)이 가장 작은 정점을 효율적으로 선택한다.
- 이는 알고리즘의 핵심 연산이며, 프림 알고리즘의 시간 복잡도를 결정짓는 요소이다. O (ElogV)
MST(Minimum spanning tree)의 점진적 확장:
- 선택한 정점을 기반으로 연결된 간선을 탐색하고, 비용이 더 작은 간선을 선택하여 MST를 점진적으로 확장한다.

👀✅ 다음으로 이진트리의 최적합 경로에 대해서 알아본다. 그리디 알고리즘으로 최적해를 구할수없는 대표적인 예이다. 이로써 최적해가 보장되는 예 vs 보장되지 않는 예를 비교하며 어떤 경우에 알맞는 알고리즘을 적용할 것인가를 배우는 것이 이번시간의 목표이다.

그리디 알고리즘의 예 #3: 이진트리 - 최적합 경로 (Optimal Path in Binary Tree)

💡 주제 개요 (Topic Overview)

이진트리의 최적합 경로란 루트에서 리프노드까지 방문할때 각 노드의 가중치를 최대화하는 경로를 찾는 것이다. 그리디 알고리즘이 최적해가 되려면 이전에 선택한 결정이 이후에 영향을 주면 안된다. 현재 시점만 생각하기 때문에 그 다음 시점은 고려하지 않기 때문이다. 그런데 만약 현재 시점이 그 다음 결정에 영향을 준다고 하면 현재 시점에서 더 먼 범위를 바라봐야하므로 이 경우엔 그리디 알고리즘 적용이 불가능하게 된다.

✅ 그리디 알고리즘이 실패하는 이유 (Why Greedy Fails for Binary Tree Paths)

의존성 문제 (Dependency Issue):
- 그리디 알고리즘은 현재 시점에서만 최선의 선택을 한다.
- 그러나 이진트리의 최적합 경로에서는 현재 선택이 이후 선택에 영향을 미치기 때문에, 전체 경로를 고려해야 최적해를 구할 수 있다.
국소적 최적화와 전체 최적화의 차이 (Local vs Global Optimization):
- 그리디 알고리즘은 국소적 최적화 (local optimization)를 목표로 한다.
- 이진트리 문제에서는 전체 최적화 (global optimization)가 필요하므로, 그리디 알고리즘은 부적합하다.

✅ 최적해가 보장되는 예 vs 보장되지 않는 예

(Guaranteed vs Non-Guaranteed Examples)

💡 최적해 보장되는 경우 (Guaranteed Examples):

프림 알고리즘 (Prim Algorithm): 최소 신장 트리를 찾을 때, 간선 가중치만 고려하므로 그리디 방식이 적합하다.
다익스트라 알고리즘 (Dijkstra Algorithm): 특정 노드에서 모든 다른 노드까지의 최소 비용 경로를 찾는 경우이다.
크루스칼 알고리즘 (Kruskal Algorithm): 사이클 없이 최소 비용으로 모든 노드를 연결하는 경우이다.

💡 최적해 보장되지 않는 경우 (Non-Guaranteed Examples):

이진트리의 최적합 경로 (Optimal Path in Binary Tree): 현재 시점에서 최선의 선택이 이후에 더 나은 선택을 방해할 수 있다.
배낭 문제 (Knapsack Problem, 일반형): 물건의 이익/무게 비율만 고려하면 전체 최적해를 놓칠 수 있다.

문제를 해결하기 전에, 문제의 구조를 파악해 독립적인 결정인지, 의존적인 결정인지 분석하는 것이 중요하다. 분석 후에는 적합한 알고리즘 (그래디 vs 동적 프로그래밍)을 선택한다.

그리디 알고리즘의 예 #4 : 그리디 알고리즘이 실패하는 예 (Optimal Path in Binary Tree - Example of Greedy Failure)

1. 문제 정의 (Problem Definition):

트리의 루트 노드에서 리프 노드까지 이동하며 각 경로의 가중치 합이 최대가 되는 경로를 찾는 문제이다.

2. 그리디 알고리즘의 작동 방식 (How Greedy Works):

루트(10)에서 출발하여, 인접 노드 중 가장 큰 값을 가진 60을 선택한다.
60에서 리프 노드 2로 이동하며 경로가 종료된다.
이 과정에서 전체 경로의 가중치 합은 10 + 60 + 2 = 72 이다.

3. 전체 최적경로 (Global Optimal Path):

루트(10)에서 15를 선택한 뒤, 30 → 45 → 67 → 38 → 33 경로를 선택하는 경우, 경로의 가중치 합은 10+15+30+45+67+38+33=238로 그리디 경로보다 훨씬 큰 결과가 된다.

✅ 그리디 알고리즘의 한계 (Limitations of Greedy Algorithm)

현재 시점만 고려 (Focus on Current State Only):
- 그리디 알고리즘은 항상 현재 시점에서 가장 큰 값(최선의 선택)을 따른다.
- 이로 인해 다음 단계에서 발생할 선택의 기회를 놓치게 된다.
전역 시야 부족 (Lack of Global View):
- 전체 트리를 보지 않고 인접한 두 노드만 비교하기 때문에 전체 경로의 최적해를 구할 수 없게 된다.
의존성 문제 (Decision Dependency):
- 루트에서 특정 노드를 선택한 결과가 이후 경로 선택에 제한을 가하게 된다.
- 예: 루트에서 60을 선택하면, 이후에는 리프 노드 2로 제한되며 다른 경로를 탐색할 수 없다.

✅이진 트리 최적합 경로의 적합한 알고리즘

동적 프로그래밍 (Dynamic Programming)이 적합하다.
전체 경로를 탐색하며, 각 단계에서의 최적 경로를 저장하고 재활용하기 때문이다.

그리디 알고리즘의 예 #5: 동전 바꾸기 문제 (Coin Change Problem - Limitations of Greedy Algorithm and Optimal Solution)

동전바꾸기 문제는 쉽게 생각하면 거스름돈을 자판기에서 계산해서 출력할때 거스름돈이 600원인데 이 잔돈을 10원짜리로 60개가 나온다면 이용자는 당황할 것 이다. 거스름돈은 최소한의 동전이 되도록 계산해서 배출이 되게 된다. 이러한 상황을 알고리즘으로 구현하면 최적해가 보장되는 상황이 된다.

💡요약 (Summary)

동전 바꾸기 문제 정의 (Coin Change Problem):
- 주어진 금액을 가장 적은 수의 동전으로 교환하는 문제이다.
그리디 알고리즘의 성공 조건 (When Greedy Works):
- 동전 액면이 모두 아래 액면의 배수여야 최적해를 보장하게 된다.
- 예) 500원 = 100원 × 5, 100원 = 50원 × 2
그리디 알고리즘의 한계 (Limitations of Greedy):
- 액면 간 배수 관계가 없을 경우, 선택이 이후 값 계산에 영향을 미쳐 최적해를 보장할 수 없음.

✅그리디 알고리즘이 성공하는 경우

예제: 3,256원을 만들기 (Example: Making 3,256 KRW):
- 동전 액면: 500원, 100원, 50원, 10원, 5원, 1원.
- 그리디 알고리즘 동작:
  - 500원 × 6개 = 3,000원
  - 100원 × 2개 = 200원
  - 50원 × 1개 = 50원
  - 5원 × 1개 = 5원
  - 총 동전 수: 11개.
- 액면이 배수 관계를 유지하므로, 그리디 알고리즘이 최적해를 보장하게 된다.

✅그리디 알고리즘이 실패하는 경우

예제 : 1,300원을 만들기 (Example: Making 1,300 KRW):
- 동전 액면: 500원, 400원, 100원, 75원, 50원.
- 그리디 알고리즘 동작:
  - 500원 × 2개 = 1,000원
  - 100원 × 3개 = 300원
  - 총 동전 수: 5개.
  - 자판기 거스름돈 계산과 같은 빠른 계산, 단순 구현 상황에서 적합하다.
- 최적해 동작:
  - 500원 × 1개 = 500원
  - 400원 × 2개 = 800원
  - 총 동전 수: 3개.
  - DP 배열을 이용하여 각 금액에 대해 최소 동전 수를 계산할 때 적합하다.
- 위의 첨부된 예제에서도 설명하듯이 동전이 아래 액면의 배수가 아닌 경우 최적해를 보장하지 않게 된다. 나머지 값이 어떤 동전을 선택했느냐에 따라 나눌수있는 값이 달라지기 때문에 최적해가 보장되지 않게 되는 것이다. 위의 문제의 경우 그리디알고리즘이라면 500원 2개 + 100원 3개 = 1300원이되어 총 5개의 동전으로 만들게 된다. 최적해의 경우엔 500원 1개 + 400원 2개 = 1300원 이렇게 3개의 동전만으로 값을 완성 시키게 되는데 이는 가장 큰 값인 500원을 제일 많이 사용하지 않았으므로 그리디 알고리즘에서 나올 수 없는 결과이다. 그렇기 때문에 최적해가 보장되지 않게 된다.

그리디 알고리즘의 예 #6: 최적해가 보장되는 조건 (Conditions for Guaranteeing Optimal Solution with Greedy Algorithm)

그리디 알고리즘이 전체 최적해 (global optimal solution)를 보장하려면 다음 두 가지 조건이 반드시 만족되어야 한다.

✅ 탐욕 선택 조건 (Greedy Choice Property)

현재 시점에서 가장 최선의 선택을 했을 때, 이 선택이 이후의 선택에 영향을 미치지 않아야 한다. 즉 현재 시점에서 내린 선택이 이후의 선택에 독립적이어야 하며, 이는 현재 선택만으로 최적해를 만들 수 있음을 보장한다.
프림 알고리즘 (Prim Algorithm): 최소 가중치를 가진 간선을 선택해도 이후의 간선 선택 과정에 영향을 미치지 않음.
다익스트라 알고리즘 (Dijkstra Algorithm): 현재 최소 비용 노드를 선택해도 이후 노드 탐색에 독립적.
반대의 경우로는 이진트리 최적합 경로: 현재 선택한 노드가 이후 선택 가능한 경로를 제한하므로, 탐욕 선택 조건이 성립하지 않게 된다.

✅ 최적 부분 구조 (Optimal Substructure)

문제를 부분 문제 (subproblem)로 나누었을 때, 부분 문제의 최적해가 전체 문제의 최적해에 포함되어야 한다.
이는 동적 프로그래밍에서도 중요한 성질로, 문제를 재귀적으로 나누고 각 부분의 최적해를 조합해 전체 최적해를 보장할 수 있게 된다.
동전 거스름돈 문제 (Coin Change Problem): 3,256원을 만드는 문제를 500원, 100원, 50원 등으로 나누어 각각의 최적해를 구하고 조합한다.
크루스칼 알고리즘 (Kruskal Algorithm): 최소 신장 트리를 구성할 때, 부분적으로 선택된 간선들의 최적해가 전체 트리의 최적해에 포함한다.
반대의 경우, 1300원 동전 문제: 부분적으로 500원을 최대한 많이 사용하는 선택이 전체 최적해를 구성하지 않음.

4️⃣ 그리디 알고리즘 문제 (Example of Greedy Algorithms)

❤️동전 개수 최솟값 구하기 문제 (Coin Change Problem)
❤️카드 정렬하기 (Card Sorting Problem)
❤️회의실 배정하기 (Meeting Room Allocation Problem)
❤️최솟값을 만드는 괄호 배치 찾기(Finding Minimum Value with Proper Parentheses)

❤️ 동전 개수 최솟값 구하기 문제 (Coin Change Problem - Minimizing Coin Count)

✅ 문제 정의:

일반적으로 자판기에서 상품의 거스름 돈을 받을때 동전 개수가 최소가 되도록 하는 알고리즘이다.
동전은 총 N 종류가 있고 각 동전의 개수는 충분히 많다고 가정한다.
동전의 종류는 미리 정해져있다.
주어진 금액 K 를 동전개수가 최소가 되도록 채워야한다.

✅ 입출력의 예제

오름차순으로 동전의 액면이 입력으로 주어졌다 1원 , 5원 , 10원이렇게해서 50000원까지 포함하였다.

예제 1: 목표 금액 4200원을 만들기 위한 최소 동전 개수 = 6개
예제 2: 목표 금액 4790원을 만들기 위한 최소 동전 개수 = 12개

✅손으로 풀어보기 (Step-by-Step Example)

목표 금액 4200원(K = 4200)

동전 리스트: [1,5,10,50,100,500,1000,5000,10000,50000]

첫 번째 선택

동전리스트 A를 보면 5000, 10000, 50000은 목표 금액보다 크기 때문에 선택 할수없다. 결국 1000원짜리 부터 선택이 된다.
K=4200, 가장 큰 동전은 1000원
4200 ÷ 1000=4 (몫 = 4개, 나머지 = 200).
동전 개수 += 4.

두 번째 선택

200원을 가지고 다시 진행하게 된다. 동전리스트에서 500원은 200원보다 크므로 선택할수없다. 결국 100원짜리를 선택하게 된다.
K(목표금액) = 200, 가장 큰 동전은 100원
200 ÷ 100=2 (몫 = 2개, 나머지 = 0).
동전 개수 += 2.

세 번째 선택

k = 0이 되며 알고리즘이 종료하게 된다. 4+2 의 계산결과 6이 최소한의 동전개수가 되는 것이다.
K = 0, 최소 동전 개수 4 + 2= 6

동전 개수 최솟값 구하기 문제 #2: 의사코드 (Pseudocode - Coin Change Problem)

주어진 금액 K를 가장 적은 수의 동전으로 만들기 위해 큰 동전부터 사용한다.
동전 액면값은 내림차순(Descending Order)으로 정렬되어 있다고 가정한다.

`내림차순 (Descending Order)` 값이 큰 것부터 작은 것으로 정렬되는 순서

2. 동전 액면값 저장

for N만큼 반복:
    A 리스트 저장

목적: N개의 동전 액면값을 입력받아 A리스트에 저장한다.
A는 동전의 액면값(예: 500, 100, 50, 10, 5, 1)을 포함하는 배열이다.

3. 큰 동전부터 사용

for N만큼 반복 (N - 1 → 0으로 역순으로 반복): # N: 사용할 동전의 종류(개수).
    if 현재 K보다 동전 가치가 작거나 같으면:
        동전 수 += 목표 금액 // 현재 동전 가치
        목표 금액 = 목표 금액 % 현재 동전 가치

탐색 순서: A(동전 데이터 리스트)를 큰 값부터 작은 값으로 탐색한다.
조건 확인: 현재 동전 A[i]의 가치가 K(목표 금액)보다 작거나 같으면, 해당 동전을 사용할 수 있게 된다.
동작:
- 동전 사용 개수 계산: K ÷ A[i] (목표 금액에서 해당 동전을 최대한 사용 가능한 개수).
- 남은 금액 계산: K % A[i] (현재 동전을 사용하고 남은 금액).

4. 결과 출력

반복이 종료되면, 사용한 동전의 총 개수를 출력한다.

동전 개수 최솟값 구하기 문제 #3: 파이썬 코드 (Coin Change Problem - Python Code)

1. 입력 처리

N, K = map(int, input().split())  # 동전 개수 N, 목표 금액 K 입력
A = [0] * N                      # 동전 리스트 초기화 (N개의 0으로 시작)

입력값: N: 사용할 동전의 종류 수, K: 만들고자 하는 목표 금액.
A: 동전의 액면값을 저장할 리스트로, 초기값은 0으로 설정한다.

2. 동전 액면값 저장

for i in range(N): # N개의 동전 액면값을 입력받아 리스트 A에 저장
    A[i] = int(input())

N개의 동전 액면값을 입력받아 리스트 A에 저장한다.
입력된 액면값은 오름차순 정렬된 상태로 저장된다.
오름차순 (Ascending Order) 값이 작은 것부터 큰 것으로 정렬되는 순서를 말함

3. 동전 개수 계산 (큰 동전부터 사용)

count = 0                    # 사용된 동전 개수를 기록하는 변수 초기화

for i in range(N - 1, -1, -1):# 큰 동전부터 반복 (리스트를 역순 탐색)
    if A[i] <= K:             # 현재 동전 가치가 목표 금액보다 작거나 같으면:
        count += K // A[i]    # 현재 동전의 최대 개수를 추가
        K = K % A[i]          # 남은 금액 계산

탐색 순서: 리스트 A를 역순으로 탐색하여 큰 동전부터 처리한다.
조건: 현재 동전의 액면값 A[i]이 목표 금액 K보다 작거나 같을 경우, 해당 동전을 사용할 수 있다.
동전 개수 계산:
- K//A[i]: 현재 목표 금액에서 사용할 수 있는 최대 동전 개수.
- count: 사용한 동전의 총 개수를 누적한다.
남은 금액 계산:
- K = K % A[i] : 현재 동전을 사용하고 남은 금액을 업데이트한다.

4. 결과 출력

print(count)  # 사용된 동전의 총 개수 출력

모든 반복이 끝난 후, 사용된 동전의 최소 개수를 출력한다.

💡 동전 개수의 최솟값 문제와 우선순위 큐 학습의 연계

✅ 우선순위 큐는 그리디 알고리즘에서 자주 사용되는 도구

그리디 알고리즘의 핵심은 현재 시점에서 가장 최적의 선택을 반복적으로 수행하는 것이다. 이를 구현하기 위해 가장 적합한 자료구조 중 하나가 우선순위 큐이다.

동전 문제에서는 단순히 "가장 큰 동전부터 탐색"하는 방식으로 해결되지만,
복잡한 문제에서는 "가장 비용이 낮은 옵션"을 빠르게 찾기 위해 우선순위 큐가 필요하다.
- 예: Prim 알고리즘, Dijkstra 알고리즘 등.

따라서, 동전 문제를 통해 그리디 알고리즘의 개념을 배우고, 우선순위 큐를 활용할 수 있게 학습하는 것이다.

✅동전 문제는 간단한 예제, 우선순위 큐는 복잡한 문제로 확장 가능

동전 문제는 비교적 단순한 그리디 문제로, 우선순위 큐 없이도 효율적으로 해결된다.
하지만 복잡한 문제(예: 그래프 탐색, 작업 스케줄링 등)로 확장하면, 최적의 선택을 효율적으로 관리하기 위해 우선순위 큐가 필요하게 된다.
만약 동전의 리스트가 동적으로 변경되거나, 조건이 추가된다면?
모든 후보 중 최적의 선택을 효율적으로 관리해야 한다면?

✅우선순위 큐의 학습이 그리디 알고리즘 확장에 도움된다.

우선순위 큐는 그리디 알고리즘을 구현하거나 확장하는 데 유용한 도구가 된다. 동전 문제에서 단순히 "최적의 선택"을 반복적으로 수행하는 과정을 확장하여, 다음과 같은 문제로 적용할 수 있기 때문이다.

Prim 알고리즘 (최소 신장 트리):
- 우선순위 큐를 사용해, 현재 가장 낮은 비용의 간선을 빠르게 선택.
Dijkstra 알고리즘 (최단 경로):
- 우선순위 큐를 사용해, 현재 가장 짧은 거리의 노드를 효율적으로 탐색.
Huffman 코딩 (압축 알고리즘):
- 우선순위 큐를 사용해, 최소 비용으로 이진 트리를 구성.

✅정리: 동전 개수의 최솟값 문제는 그리디 알고리즘의 단순하고 직관적인 예제이다. 우선순위 큐는 이 문제를 직접적으로 최적화하지 않더라도:

현재 시점에서 최적 선택을 관리하는 도구로서 그리디 알고리즘의 본질과 연결된다.
복잡한 문제로 확장할 때, 우선순위 큐의 효용성을 이해하도록 돕는다.
학습의 흐름으로, 간단한 문제에서 출발해 알고리즘의 기본 개념과 도구 활용법을 배우게 된다.

동전 개수 최솟값 구하기 문제 #4: 우선순위 큐(Priority Queue - Coin Change Problem)

💡 우선순위 큐란?

우선순위 큐는 "가장 중요한 데이터"를 먼저 꺼내도록 설계된 구조를 뜻한다.

예시: 만약 은행에서 VIP 고객이 일반 고객보다 먼저 처리되길 원한다면, 우선순위를 VIP > 일반 고객으로 설정할 수 있다.
큐에 넣는 순서와 상관없이, 우선순위가 높은 것부터 꺼내는 것이다.

Python에서 우선순위 큐를 만드는 두 가지 방법

✅ `PriorityQueue` (스레드-안전 큐)

Python에서 PriorityQueue는 멀티스레드 환경에서도 안전하게 작동하는 우선순위 큐이다.

사용법 간단 요약:
1. PriorityQueue를 우선순위 큐로 생성한다.
2. put(data): 데이터를 큐에 넣는다.
3. get(): 가장 우선순위가 높은 데이터를 꺼낸다.
4. qsize(): 큐사이즈를 가져온다.
5. empty(): 큐가 비어 있는지 확인한다.

    from queue import PriorityQueue

    # 우선순위 큐 생성
    myque = PriorityQueue()

    # 데이터 넣기
    myque.put(5)  # 숫자 5 추가
    myque.put(1)  # 숫자 1 추가
    myque.put(3)  # 숫자 3 추가

    # 우선순위 높은 데이터 꺼내기
    print(myque.get())  # 출력: 1 (가장 작은 숫자가 우선)
    print(myque.get())  # 출력: 3
    print(myque.get())  # 출력: 5

✅`heapq` (간단한 힙 구조 큐)

heapq는 Python에서 우선순위 큐를 구현하는 또 다른 방법이다. 리스트를 최소 힙(Min Heap) 구조로 바꿔서 데이터를 관리한다.

사용법 간단 요약:
1. 리스트를 생성한다.
2. heappush(): 데이터를 힙구조로 삽입하며, 자동으로 정렬한다.
3. heappop(): 가장 작은 값을 꺼내고 힙을 유지한다.
4. heapify(): 기존 리스트를 힙으로 변환한다.

    import heapq

    # 빈 리스트 생성
    mylist = []

    # 데이터 넣기
    heapq.heappush(mylist, 1)  # 숫자 1 추가

    # 우선순위 높은 데이터 꺼내기
    heapq.heappush(mylist, data)  # 힙에 데이터 추가
    heapq.heappop(mylist)         # 힙에서 가장 작은 데이터 꺼내기
    heapq.heapify(mylist)         # 일반 리스트를 힙 구조로 변환

보통 빠른 성능을 원할 때는 heapq를 많이 사용한다.

❤️카드 정렬하기(Card Sorting Problem)

✅ 문제 정의 (Problem Definition)

정렬된 두 묶음 A와 B를 하나로 합치려면 A+B만큼의 비교가 필요하다.
정렬된 여러 묶음의 숫자 카드가 있을 때, 이들을 두 묶음씩 골라 서로 합쳐 나가는 과정을 반복한다.
- 이 과정에서 합치는 순서에 따라 비교 횟수가 달라지게 된다.
목표 (Goal): 묶음의 순서를 잘 조정하여, 최소한의 비교 횟수를 구하는 것이다.

✅입출력의 예

세 가지의 묶음을 두 개씩 합쳐서 전체적으로 하나의 묶음으로 만들때 최소한의 비교횟수를 출력하는 것이 이 문제이다. 이 예제에서는 100이 출력되었다.

입력 형식 (Input Format)

첫째 줄: 숫자 카드 묶음의 개수 N (1 ≤ N ≤ 100,000)
다음 N줄: 각 줄에 카드 묶음의 크기 K가 주어짐 (1 ≤ K ≤ 1,000,000)

✅입출력의 예제 문제 분석하기

Case 1 : 10장과 20장을 먼저 합치는 경우

첫 번째 합침: 10 + 20 =30
두 번째 합침: 합쳐진 묶음 30과 40을 합침. 30 + 40 = 70
총 비교 횟수: 30 + 70 = 100

Case 2: 10장과 40장을 먼저 합치는 경우

첫 번째 합침: 10 + 40 = 50
두번째 합침: 합쳐진 묶음 50과 20을 합친다. 50 + 20 = 70
총 비교 횟수: 50 + 70 = 120

✅ 분석 결과

비교 횟수는 선택 순서에 따라 달라진다.
- Case 1 (10 + 20 먼저 합침): 총 비교 횟수 = 100.
- Case 2 (10 + 40 먼저 합침): 총 비교 횟수 = 120.
초기 선택이 중요한 이유:
- 처음 선택된 묶음은 이후에 계속 포함되어 추가 비교가 발생한다.
- 따라서, 처음 선택하는 묶음은 카드 개수가 작은 것이 유리하다.
- 한 번 합쳐진 묶음은 다음 합칠 때 더 많은 비교에 포함되기 때문이다.
- 매번 가장 작은 두 묶음을 선택하는 방식이 최적의 선택인데 이는 그리디 알고리즘이다.

✅손으로 풀어보기 (Step-by-Step)

현재 카드 묶음 중 가장 작은 두 묶음을 선택해 합친다.
합쳐진 묶음을 다시 카드 묶음 집합에 추가한다.
위 두 과정을 카드 묶음이 하나만 남을 때까지 반복한다.
합쳐진 묶음의 비교 횟수를 모두 더하여 최소 비교 횟수를 구한다.

우선순위 큐 작동 방식:
- 카드 묶음을 우선순위 큐에 삽입한다.
- 우선순위 큐는 항상 가장 작은 값이 먼저 나오도록 정렬 상태를 유지한다.
구체적인 동작 과정:
- 초기 상태: [40,20,10]
  - 큐에서 가장 작은 두 묶음 10과 20을 꺼냄.
  - 합침: 10 + 20 = 30
  - 합친 결과 30을 다시 큐에 삽입: [40,30]
  - 순서가 중요한 것은 아니지만 현재 가장 작은 두 묶음을 선택해 합치는 방식이 전체적으로 최적의 결과를 보장하는 그리디 알고리즘의 개념이 구현된 부분이다.
- 두 번째 반복: [40,30]
  - 큐에서 가장 작은 두 묶음 30과 40을 꺼냄.
  - 합침: 30 + 40 = 70
  - 합친 결과 70을 다시 큐에 삽입: [70]
최종 상태:
- 큐에 카드 묶음이 하나만 남으면 종료.
- 사용된 모든 비교 횟수를 합산: 30 + 70 = 100

카드 정렬하기 #1: 의사코드 (Pseudocode in Card Sorting Problem)

초기화: N: 카드 묶음의 개수, pq: 우선순위 큐(작은 값이 우선적으로 처리됨).
데이터 입력: N만큼 반복하며, 각 카드 묶음의 크기를 우선순위 큐에 저장한다.
알고리즘 동작:
- 반복 조건: 우선순위 큐의 크기가 1이 될 때까지 진행.
- 동작 과정:
  1. 우선순위 큐에서 가장 작은 두 개의 카드 묶음을 꺼냄.
  2. 두 카드 묶음을 합치는 데 필요한 비교 횟수를 결과 값에 더함.
  3. 합친 카드 묶음의 크기를 다시 우선순위 큐에 넣음.
최종 결과: 큐의 크기가 1이 되면 최소 비교 횟수가 계산 완료.

✅주요 특징

우선순위 큐: 각 단계에서 가장 작은 두 값을 쉽게 꺼낼 수 있어 효율적이다.
그리디 전략: 매번 가장 작은 두 카드 묶음을 합치는 국소 최적 선택을 통해 전체 최적 결과를 도출한다.

이 알고리즘은 허프만 코딩(Huffman Coding)과 유사한 방식으로, 주어진 문제에서 최소 비용으로 작업을 수행하기 위해 설계되었다.

카드 정렬하기 #2: 파이썬코드 (Python in Card Sorting Problem)

우선순위 큐 초기화:
- from queue import PriorityQueue를 사용하여 Python의 우선순위 큐 모듈을 가져온다.
- pq = PriorityQueue()를 통해 우선순위 큐 객체를 생성한다.
입력 처리:
- N = int(input()): 카드 묶음의 개수를 입력받는다.
- 반복문 for _ in range(N)::
  - 각 카드 묶음의 크기를 입력받아(date = int(input())) 우선순위 큐에 저장한다.(pq.put(date)).
초기 변수 설정:
- data1, data2, sum 변수를 0으로 초기화한다.
- sum은 최종 최소 비교 횟수를 저장하는 변수이다.
그리디 알고리즘 실행:
- while pq.qsize() > 1: 우선순위 큐의 크기가 1보다 클 때까지 반복.
  - 두 카드 묶음 꺼내기: data1 = pq.get(), data2 = pq.get().
  - 두 카드 묶음 합치기:
    - temp = data1 + data2: 두 카드 묶음의 크기를 더한다.
    - sum += temp: 비교 횟수를 합산.
  - 합친 묶음을 다시 큐에 삽입: pq.put(temp).
결과 저장:
- sum에 모든 비교 횟수의 합이 저장된다.

✅ 작동 방식 요약

작은 두 카드 묶음을 반복적으로 합치며 비교 횟수를 최소화하는 방식이다.
우선순위 큐를 사용해 매번 가장 작은 두 값을 빠르게 선택한다.
이는 허프만 코딩 문제의 원리와 동일하며, 최소 비용으로 작업을 수행할 수 있는 대표적인 그리디 알고리즘이다.

❤️회의실 배정하기 (Meeting Room Allocation Problem)

💡요약: 이 알고리즘은 종료 시간이 빠른 순서대로 회의를 선택하여 최대한 많은 회의를 배치하는 그리디 알고리즘의 대표적인 사례이다. 이 알고리즘을 이용하면 강의실, 세미나실 대여와 같은 실제 상황에서 효과적으로 활용할 수 있게 된다.

✅ 문제 정의 (Problem Definition)

목표 (Goal): 하나의 회의실에서 겹치지 않게 최대한 많은 회의를 배정하는 스케줄을 작성한다.
입력 조건 (Input Conditions)
- 첫 번째 줄에 회의의 개수 n이 주어진다.
- 두번째 줄부터는 N+1줄까지 각 회의의 시간과 끝 시간이 주어진다.
- 이후 각 회의의 시작 시간과 종료 시간이 주어진다.
출력 조건 (Output Conditions)
- 겹치지 않고 진행 가능한 최대 회의 수를 출력하게 된다.

✅ 문제 분석 및 해결 방법 (Problem Analysis and Solution Approach)

종료 시간 기준 정렬 (Sort by End Time)
- 회의를 가장 많이 개최하려면 종료 시간이 빠른 회의를 먼저 선택해야 한다.
- 종료 시간이 같을 경우, 시작 시간을 기준으로 다시 정렬한다.
그리디 알고리즘 전략 (Greedy Algorithm Strategy)
- 종료 시간이 가장 빠른 회의를 먼저 선택하고, 이후로 겹치지 않는 회의만 추가한다.
구현 순서 (Implementation Steps)
1. 종료 시간을 기준으로 회의를 정렬한다.
2. 첫 번째 회의를 선택한다.
3. 이후로 순차적으로 겹치지 않는 회의를 추가한다.

입력으로 회의 개수가 먼저 주어진다. 그 다음 줄엔 회의의 시작 시간과 종료 시간이 주어지고, 결과적으로 4개의 회의가 가능함을 예제 출력을 통해 보여주고 있다.

✅ 손으로 풀어보기 (Step-By-Step)

위에서 주어진 입력 값으로 0 시에서 13시까지 배정하는 예제이다.

각 회의의 시작 시간과 끝시간이 컬러 블록으로 표시되어 정렬되어 있다. 정렬의 기준은 "끝시간"이된다. 그 다음, 순서대로 회의를 하나씩 스케줄화 하여 등록하게 된다.

가장 처음에 배치되어있던 배열이 우선순위가 먼저이므로 첫 번째 회의 (1, 4) 선택한다
두번째 세번째는 배정 순위가 겹치기 때문에 배정할 수 없다. (3,5) & (0,6)
겹치는 다음 배열은 모두 무시하고 겹치지 않으면서 종료시간이 빠른 회의를 선택한다. (5, 7), (8, 11), (12, 14)
최종적으로 총 4개의 회의가 배치되었다.
(Select meetings in order: (1, 4), (5, 7), (8, 11), (12, 14).)

회의실 배정하기: 의사코드 (Pseudocode in Meeting Room Allocation Problem)

✅ 입력 데이터

N: 회의의 개수 (총 몇 개의 회의가 있는지).
A: 각 회의의 시작 시간과 종료 시간 정보가 저장된 리스트.

✅정렬 과정 A리스트 정렬 수행

회의의 종료 시간 기준으로 회의를 정렬한다.
만약 종료 시간이 같다면 시작 시간 기준으로 정렬한다.
- 이유: 종료 시간이 빠른 회의를 우선적으로 배정해야 최대한 많은 회의를 배치할 수 있기 때문이다.

✅ 회의 배정

이전에 선택한 회의의 종료 시간을 기준으로 잡고, 그 이후에 시작할 수 있는 회의를 선택한다.
선택된 회의를 진행 가능하다고 판단하고, 종료 시간을 업데이트한다.

✅결과 계산: 반복문이 끝날 때까지 배정된 회의의 개수를 출력한다.

회의실 배정하기: 파이썬 코드 (Python Code in Meeting Room Allocation Problem)

✅입력 데이터 초기화

N = int(input())  # 회의의 개수 (총 N개의 회의)
A = [[0] * 2 for _ in range(N)]  # 각 회의의 [종료 시간, 시작 시간]을 저장할 리스트

N: 총 몇 개의 회의가 있는지 사용자로부터 입력받습니다.
A: 각 회의의 정보를 저장할 2차원 리스트이다.
- [종료 시간, 시작 시간] 형식으로 저장한다.

✅ 회의 데이터 입력받기

for i in range(N):
    S, E = map(int, input().split())  # 각 회의의 시작 시간(S)과 종료 시간(E)을 입력받음
    A[i][0] = E  # 종료 시간을 첫 번째 값으로 저장
    A[i][1] = S  # 시작 시간을 두 번째 값으로 저장

입력 형식:
- 각 회의의 시작 시간과 종료 시간이 공백으로 구분되어 주어진다.
- 예를 들어, 1 4는 시작 시간이 1, 종료 시간이 4인 회의를 의미한다.
A 리스트:
- 종료 시간을 첫 번째, 시작 시간을 두 번째 값으로 저장한다.
- 종료 시간이 우선 정렬의 기준이 되기 때문이다.

✅ 회의 정렬

A.sort()

정렬 기준:
- 기본적으로 Python의 리스트 정렬은 첫 번째 값을 기준으로 정렬한다.
- 따라서, 종료 시간을 기준으로 오름차순 정렬된다. (오름차순: 작은 것부터 큰 것의 순)
- 만약 종료 시간이 같다면, 시작 시간을 기준으로 정렬한다.

✅ 회의 배정 과정

count = 0  # 배정된 회의 개수
end = -1   # 현재 진행 중인 회의의 종료 시간 (-1로 초기화)

count: 최종적으로 배정된 회의의 개수를 저장하는 변수.
- 초기에 0으로 설정.
end: 현재 선택된 회의의 종료 시간을 저장한ㄷ,.
- 초기값은 -1로 설정하여 어떤 회의도 배정되지 않았음을 나타낸다.

✅ 반복문을 통해 회의 배정

for i in range(N):
    if A[i][1] >= end:  # 현재 회의의 시작 시간이 이전 회의의 종료 시간 이후라면
        end = A[i][0]   # 현재 회의의 종료 시간으로 업데이트
        count += 1      # 배정된 회의 개수 증가

조건: 현재 회의의 시작 시간이 이전 회의의 종료 시간 이후라면, 현재 회의를 배정할 수 있다.
동작: 현재 회의를 배정한 뒤, 종료 시간을 업데이트한다.
- count를 1 증가시켜 배정된 회의의 개수를 기록한다.

✅ 결과 출력

print(count)

최종적으로 배정된 최대 회의 개수를 출력한다.

❤️ 최솟값을 만드는 괄호 배치 문제 (Finding Minimum Value with Proper Parentheses)

✅ 문제 정의 (Problem Definition)

입력 형식: 0~9 사이의 숫자와 +, - 연산자로 이루어진 수식이 주어지게 된다.
- 예제 입력: 100-40+50+74-30+29-45+43+11
출력 형식: 수식의 값을 최소로 만드는 결과를 출력해야 한다.
- 예제 출력: -222

✅ 풀이 과정 (Step-by-Step Solution)

1. 문제 분석

최솟값을 구하려면:
- 덧셈(+) 부분을 먼저 계산해서 최대한 큰 값을 만든다.
- 이 값을 - 연산과 함께 한꺼번에 빼준다.
예를 들어)

100 − (40+50+74) − (30+29) − (45+43+11)

계산하면 100 − 164 − 59 − 99 = −222

💡중요포인트:

핵심 아이디어: - 뒤의 값을 모두 더한 후 한꺼번에 빼주는 방식이 가장 작은 값을 만든다.
풀이 전략: 수식을 -로 분리하고, 각 부분의 덧셈을 계산한 뒤 첫 번째 값에서 차감한다.
시간 복잡도: 효율적으로 O(N)의 시간에 계산 가능하다.

최솟값을 만드는 괄호 배치 문제 - 의사코드 (Pseudocode in Finding Minimum Value with Proper Parentheses)

answer 변수: 정답을 저장하는 변수이고 초기값은 0으로 설정한다.
A 리스트: 입력된 수식을 - 기호를 기준으로 나눈다.
- 예: 100 - 40+50+74 - 30+29 → ['100', '40+50+74', '30+29'].
mySum(string) 함수: 입력받은 문자열에서 + 기호를 기준으로 나누고 split 수행, 각각의 숫자를 더한 값을 반환한다.
- 예: mySum('40+50+74') → 40 + 50 + 74 = 164

for 반복문:
- A 리스트의 각 요소를 순회하며, 값을 더하거나 뺀다.
- 첫 번째 요소는 항상 더하기 연산.
- 두 번째 요소부터는 모두 빼기 연산.
최종 출력:
- 반복문이 종료되면 answer에는 최솟값이 저장되며, 이를 출력하게 된다.

💡이 의사코드에서 중요포인트:

그리디 접근: 첫 번째 값은 더하고, 나머지 값은 모두 빼주는 방식으로 최소값을 구하는 전형적인 그리디 알고리즘의 형태이다.
시간 복잡도:
- 입력된 수식의 길이를 N이라 할 때, split과 mySum 함수 모두 O(N)에 처리된다.
- 따라서, 전체 알고리즘의 시간 복잡도는 O(N)가 된다.
장점: 코드가 간결하며, 효율적이다. 수식이 길어져도 빠르게 처리 가능하다.

최솟값을 만드는 괄호 배치 문제 - 파이썬 코드 (Pseudocode in Finding Minimum Value with Proper Parentheses)

A 리스트 생성: 입력된 수식을 -로 나눈다.
mySum 함수: 이 함수는 + 연산을 처리한다. -로 나뉜 그룹들(문자열) 안의 값을 모두 더해 반환하는 함수이다.
최종 계산: 첫 번째 그룹은 항상 더하기 연산을 수행한다. 나머지 그룹은 모두 빼기 연산을 수행한다.

💡중요포인트

코드 흐름:
- mySum 함수는 그룹 내부의 덧셈을 처리한다.
- for 반복문은 그룹 간 연산(더하기/빼기)을 처리한다.
동작 원리: 첫 번째 그룹은 무조건 더하고, 나머지 그룹은 모두 빼준다.
최소값을 만드는 이유: 가장 큰 값을 빼기 위해 - 뒤에 있는 모든 값을 더한 후 한꺼번에 빼준다.

5️⃣매트로이드(Matroid)

✅ 매트로이드란? 그리디 알고리즘으로 최적해(optimal solution)가 보장되는 공간 구조(spatial structure)를 의미한다.

✅ 매트로이드와 그리디 알고리즘의 관계

매트로이드의 중요성: 매트로이드로 정의된 문제는 항상 그리디 알고리즘으로 최적해를 구할 수 있다.
반례:
- 그리디 알고리즘으로 최적해를 구할 수 있다고 해서 모든 문제가 매트로이드가 되는 것은 아니다.
- 예: 다익스트라 알고리즘은 매트로이드 이론과 직접적으로 관련되지 않지만 최적해를 보장한다.
포인트: 매트로이드 구조가 성립하면 그리디 알고리즘의 적용이 쉽고, 최적화 문제를 효율적으로 풀 수 있게 된다.

위의 그림을 예로 들면, 매트로이드는 그리디 알고리즘으로 최적해가 보장되는 부분집합이다. 즉, 전체 집합의 일부로, 최적화 문제를 해결하기 위한 성질을 만족한다.

✅ 실무에서의 활용: 매트로이드 이론을 이해하면, 그리디 알고리즘을 적용할 수 있는 문제를 더 잘 구분하고 최적의 결과를 도출할 수 있게 된다.

매트로이드(Matroid) #1: 상속성과 증강성 = 독립성

✅ 독립성 (Independence)

매트로이드의 두 가지 성질인 상속성 (Hereditary Property)과 증강성 (Exchange Property)을 합쳐 독립성이라고 부른다. 즉 부분해들이 독립적으로 최적 조건을 만족하여, 이들이 합쳐져도 전체 최적해를 이루어야 한다는 조건을 뜻한다.

💡 독립성의 핵심

매트로이드 공간에서 모든 부분해는 독립적으로 최적 조건을 만족한. 이를 통해, 매트로이드 공간에서는 그리디 알고리즘을 안전하게 사용할 수 있게된다.

✅ 상속성 (Hereditary Property)

정의: 해집합의 모든 부분집합은 여전히 해집합에 속해야 한다.

만약 𝐴 ∈ 𝐼 이고 𝐵 ⊆ 𝐴 라면 𝐵 ∈ 𝐼야 한다.
즉, 해집합에 속한 집합 A의 모든 부분집합 B도 해집합에 속해야 한다는 뜻이다.
- ❓🤔당연한 것 같은데 왜 이 조건이 언급된 것 일까? 앞에서 그리디 알고리즘의 특성에 대해 생각해보자. 전체적인 골이 하나가 있을때 그리디 알고리즘을 적용하기 좋은 상태가 된다. 이에 따라 집합도 정의가 되는데 실제 그래프의 경우는 항상 포물선의 형태가 아닌 여러개의 골짜기가 생기는 경우도있다. 이럴 경우엔 B가 A에 속하지만 해집합이 아니게 되게 된다. 그래서 “상속성”이 당연히 모든 상황에 일어날 것 같지만 항상 당연한 경우는 아니기 때문에 매트로이드가 수행되려면 상속성이 보장되어야한다.
∈ (원소 포함, "belongs to") : 어떤 요소가 특정 집합의 원소임을 나타낸다.
⊆ (부분 집합, "subset of") 한 집합의 모든 원소가 다른 집합에 포함될 때, 부분 집합 관계를 나타낸다.
해집합 (Solution Set) 문제나 방정식을 만족하는 값들의 집합

그리디 알고리즘과의 연관성:

큰 해인 A가 작은 해인 B를 품고 있는 형태는 그리디 알고리즘에서 언급된 "최적 부분 구조(Optimal Substructure)"와 밀접한 연관이 있다.
- "최적 부분 구조"란, 전체 해의 최적해가 포함하는 모든 부분도 최적해라는 성질을 의미한다.
Key Point: 매트로이드의 성질인 상속성에서 그리디 알고리즘의 최적 부분 구조를 이해하는 것이 핵심이다.

✅ 증강성 (Exchange Property)

정의: 작은 집합을 확장해도 여전히 해집합에 속해야 한다.
- 두 집합 A,B ∈ I가 주어지고, ∣A∣ < ∣B∣일 때:
  - B∖A에 속하는 원소 x를 A에 추가해도 A ∪ {x} ∈ I 가 되어야 한다.
- 즉, 작은 집합 A를 확장해도 여전히 전체 해집합에 속할 수 있어야 한다.
- ∈ (원소 포함, "belongs to") : 어떤 요소가 특정 집합의 원소임을 나타낸다.
- ∣A∣: 집합 A의 원소 개수
- B ∖ A 집합 B에서 A에 속하지 않는 원소들만 포함하는 집합
- A ∪ {x} A에 x를 추가한 새로운 집합
- ∪ {x}: 합집합. 집합에 원소 x를 추가.
그리디 알고리즘과의 연관성:
- 증강성은 "탐욕 선택 조건(greedy choice property)"과 밀접하게 연관된다.
  - "탐욕 선택 조건"이란, 이전에 선택된 해가 이후에 추가된 해에 영향을 주지 않아야 함을 의미한다.
- Key Point: 매트로이드의 성질인 증강성(Exchange Property)에서 그리디 알고리즘의 탐욕 선택 조건(Greedy Choice Property)를 이해하는 것이 핵심이다.

매트로이드(Matroid) #2: 그래픽 매트로이드 (Graphic Matroid)

✅ 그래픽 매트로이드란?

그래픽 매트로이드는 간단하게 "숲(Forest)"으로 정의된다.
숲: 사이클이 없는 간선 집합 또는 여러 개의 트리로 이루어진 집합이다.

숲 집합 𝐹: 모든 가능한 사이클이 없는 부분 집합의 집합을 뜻한다.
2^E : 그래프E에서 나올 수 있는 모든 부분 집합을 뜻한다.
𝐹 ⊆ 2𝐸 : 그래프의 모든 간선 중 사이클이 없는 부분 그래프들만 선택된 집합이라는 뜻
F가 매트로이드라는 것은, 숲 집합이 독립성 조건 (independence condition)을 만족한다는 것을 뜻한다.

✅ 숲(Graphic Matroid)의 예제

그래프와 숲:
- 주어진 그래프 G는 정점과 간선으로 이루어져 있다.
- 이 그래프에서 간선의 부분집합을 선택하여 숲을 구성하게 된다.
- 숲은 사이클이 없는 간선 집합으로 이루어진 부분 그래프이다.
- 여러 개의 트리로 구성되므로 이를 "숲(Forest)"이라 부른다.
숲의 집합:
- 그래프에서 간선들을 선택하여 사이클이 없는 트리들로 이루어진 집합이 된다.
- 이 숲의 집합은 매트로이드의 조건을 만족한다.

✅ 숲이 매트로이드임을 증명하기

상속성 (Hereditary Property): 숲의 모든 부분집합도 숲이어야 한다.
- 트리는 사이클이 없는 간선 집합이다.
- 트리의 간선을 일부 선택한 부분집합도 사이클이 없으므로 숲이다.
- 결론: 숲의 상속성이 성립하기 때문에 매트로이드이다.
증강성 (Exchange Property):
- 정의: 두 숲 A, B에서 ∣A∣ < ∣B∣ 일 때, B∖A 속하는 간선 e를 A에 추가해도 A∪{e}는 숲이어야 한다.
- 설명: 숲 A와 B가 있다고 가정하자. 작은 숲에 간선을 추가해도 여전히 숲이다.
- 결론: 숲의 증강성이 성립하기 때문에 매트로이드이다.

6️⃣매트로이드의 확장(Matroid Expansion)

매트로이드의 확장과 포화 (Extension and Maximal Set of Matroid)

💡요약: 확장 (Extension)은 집합 A에 원소 x를 추가해도 독립적이라면, x는 A를 확장한다.포화 (Maximal Set)란 A가 확장되지 않는 상태라면, A는 포화 상태임을 뜻한다.

✅ 매트로이드의 확장 (Extension)

매트로이드 I는 "독립적인 집합"의 모음이다.
집합 A가 이미 독립적이고, x라는 새 원소를 추가해도 여전히 독립적이라면,
이 원소 x는 A를 확장할 수 있다는 뜻이다.
의미:
- 그리디 알고리즘에서 해를 하나씩 확장하여 최적해를 구성하는 과정과 동일합니다.
- 원소를 추가해가며 전체 해집합으로 확장하는 과정을 나타냅니다.
예제:
- 최소 신장 트리 (Minimum Spanning Tree) 문제에서 간선 e를 현재 트리에 추가할 때 사이클이 없으면, 이는 트리를 확장하는 과정에 해당한다.
- S={1,2,3}, I={∅,{1},{2},{1,2}}라고 가정하고 A = {1}일 때, 원소 x = 2를 추가하면 A∪{x} = {1,2}이고, {1,2} ∈ I 이므로 x는 A를 확장할 수 있다.

✅ 매트로이드의 포화 (Maximal Set)

집합 A에 새로운 원소 x를 추가함으로써 독립성을 잃는다면, A는 더 이상 확장될 수 없다. 즉, 모든 가능한 원소를 추가해 더 이상 확장할 수 없는 상태이다.
이 상태의 A를 포화 집합이라고 한다.
의미:
- 그리디 알고리즘의 종료 조건과 유사하다.
- 최적해를 찾고 모든 해를 포함하면 알고리즘이 종료된다.
예제:
- 최소 신장 트리에서 모든 정점을 연결하고 더 이상 추가할 간선이 없으면, 이는 포화 상태를 나타낸다.
- 위 예제에서S = {1,2,3}, I = {∅,{1},{2},{1,2}} 에서 A = {1,2}일 때, 새로운 원소 x = 3를 추가하면 A∪{x} = {1,2,3}이 되고, 결과적으로 {1,2,3} ∉ I 이므로 AAA는 더 이상 확장되지 않는다. 따라서 A={1,2}는 포화 집합이다.

✅ 매트로이드의 확장 정리

매트로이드의 포화 집합 (Maximal Set of a Matroid)
- 매트로이드 I⊆2S 의 모든 포화 집합은 항상 같은 크기를 가진다는 것을 의미한다.
- 포화 집합이란, 더 이상 확장될 수 없는 독립 집합이다.
숲 집합의 경우 (Forest Set Example)
- 그래프 이론에서 숲 집합은 사이클이 없는 간선 집합(즉, 트리 또는 트리들의 집합)이다.
- 숲 집합 F ⊆ 2E 의 포화 집합은 트리(Tree)가 되며, 이는 항상 ∣V∣−1 개의 간선을 포함한다. 정점이 5개라면 ∣V∣−1로 인해 트리는 4개의 간선을 가지게 된다.
설명: 최소 신장 트리 (MST) 문제를 예로 들면:
- 서로 다른 방법으로 트리를 구성하더라도, 선택된 간선의 수는 항상 ∣V∣−1로 동일하다. 포화된 집합의 크기는 항상 일정하다.

✅ 가중치 매트로이드와 그리디 알고리즘 (Weighted Matroid & Greedy Algorithms)

💡요약: 그리디 알고리즘의 최적해를 보장하는 매트로이드 구조는 있을까? 있다. 가중치 매트로이드(Weighted Matroid) 이다.

가중치 매트로이드란?
- 매트로이드에서 원집합 S의 원소들이 가중치(weight)를 가지는 매트로이드이다.
- 매트로이드 I ⊆ 2S에서 각 원소에 가중치(양의 값)가 부여된 구조를 말한다.
- 즉, 원소마다 중요도나 값어치를 나타내는 숫자가 매겨져 있다.
- 예를들어 그래픽 매트로이드에서 간선의 가중치의 합이 최대가 되는 숲을 찾는 경우 → 최대 신장 트리 (Maximum Spanning Tree) 문제가 된다.
목표:
- 가중치 매트로이드에서, 가중치의 합을 최대화하거나 최소화하는 해를 찾는것이 목표이다.

✅ 최대 가중치 합 구하기 (그리디 알고리즘)

알고리즘:
- 초기 해 집합 A=∅
- 원소들을 가중치 w기준으로 내림차순 정렬한다.
- 각 원소 x ∈ S에 대해:
  - A ∪ {x} 이면, A ← A∪{x}
설명:
- 가중치가 가장 큰 원소부터 하나씩 선택한다.
- 선택된 원소들이 매트로이드의 독립 조건을 만족해야 한다.
- 결과적으로 최대 가중치를 가지는 해를 구성하게 된다.

7️⃣ 문제 공간 탐색 (Problem Space Exploration)

💡요약: 매트로이드에서 문제 공간 탐색은 주어진 문제의 가능한 모든 독립 집합(independent sets)을 효율적으로 탐색하여 최적의 해를 찾는 과정이다. 매트로이드의 독립성 조건과 그리디 알고리즘의 구조적 특성은 이러한 탐색을 효율적으로 찾게 한다. 문제 공간 탐색 문제를 해결하는 방법은 크게 두 가지 알고리즘 유형으로 나뉜다.

✅ 구축형 알고리즘 (Constructive Algorithm)

공집합에서 시작하여 해를 하나씩 추가하며 최적해를 구축해나가는 방식이다.
기존에 배웠던 알고리즘과 비슷하다. 첫번째 해 선택, 두번째 해 선택, 세번째 해 선택하며 전체적으로 종료 조건을 만족할 때 해집합이 완료되고 종료된다.
"건설자적인 개념"으로 접근하는 방식이다.

✅ 구축형 알고리즘 과정(Process of Constructive Algorithms)

공집합 → 최적해 원소 추가 → 온전한 해 완성
매 단계마다 탐색 공간이 줄어들고, 최적해를 향해 나아간다. 각 단계에서 최적의 원소를 선택하게 된다.

✅ 구축형 알고리즘의 활용 예제

최소 신장 트리 (MST): 프림 알고리즘, 크루스칼 알고리즘
Shortest Path: 다익스트라 알고리즘
- 트리의 간선을 하나씩 추가하며 최적해를 구축해나간다.

💡중요포인트: 최적해를 구할 때, 그리디 알고리즘을 활용하여 효율적으로 해를 구성한다.

✅ 개선형 알고리즘 (Iterative Improvement Algorithm)

처음부터 어떤 해에서 시작하는데, 이 해는 조건은 만족하지만 최적해는 아닌 해이다. 이 해를 조금씩 바꿔가며 최적해를 찾아간다.
여행자의 관점으로 해를 조금씩 바꿔가며 최적해로 이동하는 알고리즘이다.

✅ 개선형 알고리즘 과정(Process of Improvement Algorithms)

초기해 → 해를 수정 → 최적해 도달.
초기해는 임의의 값일 수 있으며, 점진적으로 개선하여 최적해에 도달하게 된다.

✅ 개선형 알고리즘 활용 예제

인공지능 (AI)에서 개선형 알고리즘이 많이 사용되고 있다.
AI 알고리즘은 최적해 탐색 과정에서 매트로이드 구조를 활용하여 효율성을 높인다.

💡중요포인트: 개선형 알고리즘은 탐색 과정에서 다양한 해를 경험하며 최적해를 탐색합니다.

✅ 매트로이드 구조와 두 유형의 알고리즘의 연관성

구축형 알고리즘: 매트로이드 구조는 구축형 알고리즘에서 최적해를 보장한다.
- 예: 프림 알고리즘, 크루스칼 알고리즘.
개선형 알고리즘: 매트로이드 구조는 개선형 알고리즘에서도 최적해를 보장한다.
- 이는 탐욕적 접근법과 독립적 조건을 이용해 효율적으로 최적해를 탐색할 수 있음을 의미한다.
- 개선형 알고리즘의 유형은 아래에 설명할 예정이다.

문제 공간 탐색 (Problem Space Exploration) #1:

매트로이드 구조의 개선형 알고리즘 의사코드(Pseudocode of Improvement Algorithms in Matroid Structure)

초기 상태
- I: 매트로이드의 독립 집합.
- A: 초기 해 집합 (온전한 해지만 최적해는 아님).
- w[]: 각 원소의 가중치 배열.
조건 검사:
- w(a) < w(x) 기존 원소 a의 가중치가 새 원소 x의 가중치보다 작을 때.
- A ∪ {x} - {a} ∈ I 원소 a를 제거하고 x를 추가한 새로운 집합이 매트로이드 조건을 만족할 때
동작:
- A ← A U {x} - {a} 조건을 만족하면 a를 제거하고 x를 추가한 새로운 해를 구성한다.
- 조건을 더 이상 만족하지 않으면 반복 종료한다.
반환: 최적화된 해 A를 반환한다.

💡중요 포인트 기존 원소 a를 제거하고, 새로운 원소 x를 추가한다. 변경된 해가 여전히 매트로이드 조건을 만족하면 이 변경을 유지한다.

문제 공간 탐색 (Problem Space Exploration of Matroid) #2:개선형 알고리즘에 필요한 개념(Concept you should know in Improvement Algorithms)

💡중요 포인트: 인접성과 지역 최적해는 개선형 알고리즘의 작동 방식과 한계를 이해하는 핵심 개념이다. 개선형 알고리즘이 "인접성"을 통해 탐색을 진행하지만, "지역 최적해"에 머물 수 있는 한계를 가진다는 점이다. 이를 탈출하거나 전체 최적해로 나아가기 위한 추가 전략이 필요하다.

✅ 인접성 (Adjacency)

한 원소를 추가하거나 제거하여 다른 해로 이동 가능할 경우, 두 해가 인접 관계에 있다고 한다.
인접성은 개선형 알고리즘에서 현재 해를 다른 해로 이동하는 기준이 된다.

✅ 지역 최적해 (Local Optimum, 끌개)

개선형 알고리즘에서, 현재 해를 계속 개선해 나가다 보면 더 이상 나아갈 수 없는 지점에 도달하게 되는데, 이 지점이 지역 최적해(Local Optimum)이다.
앞서 배운 그리디 알고리즘의 포물선 예제에서 A에서 출발해 인접한 해로 이동시에 더 나은 품질(더 작은 비용)을 가진 해로 계속 이동한다. 더 이상 나아갈 수 없을 때, 이는 지역 최적해가 된다.
지역의 의미: 다양한 골짜기(최소값)가 존재할 경우,
- 현재 위치한 골의 최저점이 지역 최적해가 된다.
- 지역 최적해는 전체 최적해(global optimum)에 도달하지 못할 수 있다.
- 전체 최적해와 달리 지역 최적해는 다양한 골(최소값)이 존재할 경우, 하나의 골에 머물게 된다.

문제 공간 탐색 (Problem Space Exploration of Matroid) #3: 개선형 알고리즘과 매트로이드 공간 정리 (Improvement Algorithms & Matroid Space Overview)

💡 핵심 개념

매트로이드 공간 (Matroid Space) 에서 개선형 알고리즘은 국소 탐색 (Local Search) 만으로도 전역 최적해 (Global Optimum) 를 찾을 수 있다.
이는 매트로이드 공간이 하나의 봉우리 (Peak) 또는 골 (Valley) 만 가지는 구조이기 때문에 가능한 것이다.

✅ 정리 1: 품질 좋은 해의 존재 (Existence of High-Quality Solution)

매트로이드 공간에서, 어떤 품질 좋은 해 (High-Quality Solution) a가 존재한다면,
a와 인접한 해 (Adjacent Solution) 중에서도 품질 좋은 해가 반드시 존재한다.

의미 (Meaning): 개선형 알고리즘이 해를 변경하면서도 품질을 유지할 수 있는 근거가 된다.

✅ 정리 2: 국소 최적해와 전역 최적해의 관계 (Relationship Between Local and Global Optima)

특정 해 a 주변에 더 나은 품질의 해가 없다면, a는 전역 최적해 (Global Optimum)이다.

의미 (Meaning): 매트로이드 공간에서는 지역 최적해 (Local Optimum) 가 곧 전역 최적해가 된다. 이는 매트로이드 공간이 봉우리 (Peak) 나 골 (Valley) 이 하나만 있는 구조이기 때문이다.

✅ 정리 3: 전역 최적해의 가중치 합 동일성 (Weight Consistency in Global Optima)

매트로이드 공간에서, 만약 두 개의 전역 최적해 (Global Optima) 가 존재한다면,
두 해의 가중치 합 (Sum of Weights) 은 항상 동일하다.

예시 (Example): 프림 알고리즘과 크루스칼 알고리즘은 서로 다른 방식으로 최소 신장 트리를 찾는다. 두 알고리즘이 구한 트리는 다를 수 있지만, 가중치 합 (Sum of Weights) 은 동일하다.
의미 (Meaning): 개선형 알고리즘에서도 최적해의 형태는 다를 수 있지만,
결과적으로 "최적 비용 (Optimal Cost)" 은 동일하게 보장된다는 뜻이다.

✅ 정리 4: 동일한 가중치를 가진 해의 연결성 (Connectivity Between Equal-Weight Solutions)

매트로이드 공간에서, 동일한 가중치 (Weight) 를 가진 두 해는 인접 관계 (Adjacency Relationship) 를 따라 서로 연결될 수 있다.

의미 (Meaning): 개선형 알고리즘은 인접 관계를 통해 해를 이동하며, 동일한 품질의 해 사이에서도 연결성 (Connectivity) 을 보장한다.

✅ 매트로이드의 특징

봉우리 (Peak) 또는 골 (Valley) 이 하나인 구조:
- 매트로이드 공간은 하나의 봉우리 또는 골만 가지는 단순한 구조를 가진다.
- 이로 인해, 개선형 알고리즘이 지역 최적해에서 멈출 필요 없이 바로 전역 최적해를 찾을 수 있게된다.
그리디 알고리즘의 적용 가능성 (Applicability of Greedy Algorithms):
- 매트로이드 공간은 탐욕적 선택 조건 (Greedy Choice Property) 을 만족하므로,
  개선형 알고리즘으로 최적해를 보장한다.

From Dijkstra to A: A Deep Dive into Graph Algorithms 2

Heesu Noh — Fri, 06 Dec 2024 07:52:13 GMT

Contents

1️⃣위상 정렬 (Topological Sorting)
2️⃣최단 경로 알고리즘 (Shortest Path Algorithm)
3️⃣다익스트라 알고리즘 (Dijkstra's Algorithm)
4️⃣벨만-포드 알고리즘 (Bellman-Ford Algorithm)
5️⃣모든 쌍 최단 경로 (All-Pairs Shortest Path)
6️⃣강연결 요소 구하기 (Finding Strongly Connected Components, SCC)
7️⃣ A* 알고리즘 (A* Search Algorithm)

Summary: Graph Algorithms Review (그래프 알고리즘 리뷰)

1. Topological Sorting (위상 정렬)

Description (설명): Orders vertices in a Directed Acyclic Graph (DAG) such that no vertex points back to its predecessors (DAG에서 모든 간선의 방향과 위배되지 않도록 정점을 정렬).
Key Points (핵심 포인트):
- Achievable using Depth-First Search (DFS) (DFS를 이용하여 구현 가능).
- 간선이 순방향으로만 흐르도록 보장하며 순환 구조가 없음.

2. Shortest Path Algorithms (최단 경로 알고리즘)

Description (설명): Calculates the path with the minimum cost between two vertices (두 정점 사이의 경로들 중 간선의 가중치 합이 최소가 되는 경로를 구함).
Key Points (핵심 포인트):
- DAG에서는 가중치가 양수여야 함.
- 음의 가중치 합계가 포함된 사이클이 있으면 계산 불가.

3. Cycle-Free Shortest Path (사이클이 없는 최단 경로)

Description (설명): Uses topological sorting to ensure paths are acyclic (DAG에서의 최단 경로는 위상 정렬을 이용하여 수행 가능).
Key Points (핵심 포인트) 각 정점들마다 거리값을 업데이트하여 최소값을 구함.

4. Dijkstra's Algorithm (다익스트라 알고리즘)

Description (설명): Computes shortest paths from a single source vertex, only for non-negative weights (단일 시작점의 최단 경로 알고리즘으로 음의 가중치를 허용하지 않음).
Key Points (핵심 포인트): 출발 정점과 현재 정점 간의 최단 거리를 저장.

5. Bellman-Ford Algorithm (벨만-포드 알고리즘)

Description (설명): Handles graphs with negative weights, unlike Dijkstra’s (다익스트라 알고리즘과 달리 음의 가중치를 허용하는 최단 경로 알고리즘).
Key Points (핵심 포인트): 이중 for문으로 다익스트라 알고리즘에 비해 시간 복잡도가 높음.

6. All-Pairs Shortest Path (모든 쌍 최단 경로)

Description (설명): Finds shortest paths between all pairs of vertices (그래프의 모든 정점들 간의 상호 최단거리를 구하는 문제).
Key Points (핵심 포인트): 네비게이션 또는 네트워크 경로 탐색에서 유용.

7. Floyd-Warshall Algorithm (플로이드-워샬 알고리즘)

Description (설명): Uses dynamic programming to solve all-pairs shortest path problems (정점 집합을 하나씩 증가시키면서 동적 프로그래밍으로 최단 거리 구함).
Key Points (핵심 포인트): 음수 가중치도 허용되며, 밀집 그래프에서 효율적임.

8. Strongly Connected Components (강연결 요소 구하기 알고리즘)

Description (설명): Identifies maximal subgraphs where every pair of vertices is mutually reachable (강하게 연결된 부분 그래프를 DFS를 응용하여 구하는 알고리즘).
Key Points (핵심 포인트): 그래프와 역방향 그래프를 순차적으로 탐색하여 요소를 분리.

9. *A Algorithm**

Description (설명): Combines path cost and heuristic estimates to find the shortest path efficiently (출발점에서 도착점까지 가는 최단 거리를 추정값을 활용하는 알고리즘).
Key Points (핵심 포인트):
- 평가 함수: f(n) = g(n) + h(n)
  - g(n): 출발점에서 현재 노드까지의 실제 비용.
  - h(n): 목표 노드까지의 추정 비용.
- 탐색과 최적화를 균형 있게 수행.

1️⃣위상 정렬

그래프 고급 알고리즘을 배우기에 앞서 우선 위상 정렬 (Topological Sorting)의 개념과 이를 설명하기 위한 DAG (Directed Acyclic Graph)에 대해 알아본다. DAG는 사이클이 없는 유향 그래프 (Directed Acyclic Graph)로, 방향이 있는 그래프이며 특정 정점 간 순서 관계를 나타낸다. “위상 정렬”은 이 DAG의 모든 정점을 일렬로 배치하는 알고리즘으로, 모든 간선의 방향이 위배되지 않도록 정렬하는 것을 목표로 한다.

topological: the study of shapes that can be stretched and moved while points on the shape continue to stay close to each other.

✅ DAG (Directed Acyclic Graph)란?

DAG는 방향이 있는 그래프 (Directed Graph)로, 사이클이 없는 (Acyclic) 구조를 가진다.
즉, 정점에서 출발하여 간선을 따라가다 보면 다시 원래 정점으로 돌아올 수 없는 그래프이다.
DAG의 정점들은 특정 간선의 방향에 따라 정렬된다.
- 예) 정점 A → 정점 B라면, A는 반드시 B보다 먼저 나오게 된.

✅ 위상 정렬 (Topological Sorting)이란?

위상 정렬은 DAG에서 모든 정점을 순서대로 정렬하는 알고리즘 (Algorithm to order all nodes in sequence in a DAG)이다. 즉 DAG의 간선 방향에 따라 정렬된 순서를 만들게 된다.
정렬된 순서에서 간선의 방향이 모두 올바르게 유지되도록 보장한다.

위상정렬 #1: 필요한 개념 (Key Concepts for Topological Sorting)

위상 정렬 (Topological Sorting)을 이해하기 위해 필수적인 두 가지 개념을 설명한다. 위상정렬을 실행 하려면 간선들이 들어오고 나가는지 파악해야 하는데 이는 u를 기준으로 한다.

✅ 진입 간선 (Incoming Edges)

진입 간선은 특정 정점으로 들어오는 방향의 간선 (Edges pointing toward a node)이다.
- 예: 정점 u에 도달하는 화살표가 “진입 간선(Incoming Edges)”이다.

✅ 진출 간선 (Outgoing Edges)

진출 간선은 특정 정점에서 나가는 방향의 간선 (Edges pointing away from a node)이다.
- 예: 정점 u에서 다른 정점으로 나가는 화살표가 진출 간선(Outgoing Edges)이다.

✅진입 차수 (In-degree)

진입 차수는 정점으로 들어오는 간선의 개수 (The number of incoming edges)이다.
- 예: 정점 u에 들어오는 화살표가 3개라면, u의 진입 차수는 3이 된다.
진입 차수가 0이면:
- 이 정점은 다른 정점에 의존하지 않고 가장 먼저 처리될 수 있다.
진입 차수가 1 이상이면:
- 이전에 처리해야 할 정점이 존재하며, 제약 조건이 발생하게 된다.

위상정렬 #2: 라면 끓이기 작업의 선후 관계 (Dependency Graph for Cooking Instant Ramen)

라면 끓이기 작업을 선후 관계에 따라 그래프로 표현한 것이다. 각 작업은 노드 (Nodes)로, 작업 간 관계는 간선 (Edges)으로 나타낸다. 위상 정렬이 어떻게 실생활 작업에 적용될 수 있는지를 보여준다.

선후 관계(Precedence Relationship): 어떤 사상 A가 일어난 뒤에 사상 B가 관측되는 관계를 뜻한다. the specific order in which certain components or joints of a product must be disassembled, based on their relationship to each other.

✅ 그래프 설명:

초기 상태 (Step a): 진입 차수가 0인 노드 선택 (Choose Nodes with In-degree 0):
- "냄비에 물 붓기"와 "라면 봉지 뜯기"는 진입 차수가 0이다.
- 예제에서는 "냄비에 물 붓기"를 먼저 선택하였다.
진출 간선 제거 (Step b): "냄비에 물 붓기"와 연결된 진출 간선 (Outgoing Edges)을 제거하였다. 이 작업으로 "점화"가 진입 차수 0이 되어 선택된다.
점화 이후 (Step c): "점화"와 연결된 노드 3개가 활성화된다: "라면 넣기", "수프 넣기", "계란 풀어넣기". 하지만 이들은 여전히 다른 간선과 연결되어 있어 바로 선택할 수 없다.
다음 작업 선택 (Step d): "라면 봉지 뜯기"는 더 이상 연결된 간선이 없으므로 선택되었다.

라면과 수프 (Step e): "라면 넣기"와 "수프 넣기" 중에서 임의로 "수프 넣기"를 선택하여 제거한다. 순서에 상관없이 추가할 수 있다.
마지막 단계 (Step f): 이제 남은 노드는 "계란 풀어넣기"로, 이를 제거하면 정렬이 완료된다.

✅ 위상 정렬의 활용:

위상 정렬을 사용하면 선후 관계를 유지하면서 작업을 순서대로 수행할 수 있게 된다.
- 예: 불 켜기 → 라면 봉지 뜯기 → 수프 또는 라면 추가.
실생활 예시 (Real-world Analogy):
- 서류를 작성해야 프린트를 할 수 있는 것처럼, 작업 간 의존성을 정리할 때 사용된다.

💡 중요 포인트 (Key Points)

작업 간 선후 관계 (Dependency Relationship)를 이해하는 것이 중요하다.
점화 (Start the Stove)는 모든 작업의 시작점이다.
특정 작업을 수행하려면 필요 조건 (Prerequisites)을 먼저 완료해야 한다.
위상 정렬은 이러한 순서를 결정해주는 효율적 알고리즘 (Efficient Algorithm)이다.

위상정렬 #3: 의사코드 (Pseudocode for Topological Sorting)

💡요약: 위상 정렬은 진입 차수가 0인 정점을 반복적으로 선택하여 결과 배열에 추가하고, 해당 정점과 연결된 간선을 제거하는 방식으로 진행된다. DAG 구조에서는 진입 차수가 0인 정점이 시작점이 되며, 정렬이 끝날 때까지 반복한다. 복수의 정렬이 가능한 이유는 진입 차수가 0인 정점이 동시에 여러 개 있을 수 있기 때문이다.

G: 그래프, v: 정점
진입 차수 0인 정점 선택 (Choose a Node with In-degree 0): 그래프에서 진입 차수가 0인 정점 u를 선택한다. 그 이유는 DAG에서 a, g는 들어오는 간선이 없기 때문에 진입 차수가 0이 된다.
결과 배열에 추가 (Add to Result Array): A[i] ← u 정점 u를 결과 배열 A[i]에 추가한다.
간선 제거 (Remove Outgoing Edges): 정점 u에 진출한 간선을 모두 제거한다.연결된 다른 정점들의 진입 차수를 업데이트한다.
반복 (Repeat): 정점의 개수만큼 위 과정을 반복하여 모든 정점을 정렬하게 된다.
복수의 정렬 가능성: 만약 진입 차수 0인 정점이 여러 개 (Multiple Nodes with In-degree 0)라면, 임의로 하나를 선택해도 된다. 선택 순서에 따라 여러 위상 정렬 결과가 나올 수 있다.

✅ 중요 포인트 (Key Points)

진입 차수 0인 정점 (Nodes with In-degree 0): 그래프에서 시작점으로 사용할 수 있는 정점.
진출 간선 제거 (Remove Outgoing Edges): 정렬된 정점의 간선을 제거하며, 연결된 다른 정점들의 진입 차수를 업데이트
복수의 정렬 가능 (Multiple Sorting Orders): 선택 순서에 따라 결과가 달라질 수 있다.
시간 복잡도 (Time Complexity): O(E + V), 여기서 E는 간선 수, V는 정점 수를 뜻한다.

위상정렬 #4 : 자료구조 (Topological Sorting with Data Structures)

✅ 진입 차수 계산하기 (Calculate In-degree): 노드의 진입 차수를 계산하여 배열에 저장

노드 1에서 2, 3으로 연결된 상태이다. D[2]와 D[3]의 진입 차수를 각각 1씩 증가시킨다.
결과: 진입 차수 리스트 D[N]는 다음과 같이 설정된다
- 1번 노드: 0 (들어오는 간선 없음).
- 2번, 3번 노드: 1 (노드 1에서 들어오는 간선).
- 4번, 5번 노드: 2 (다른 두 노드에서 들어오는 간선).

✅진입 차수 0인 노드 선택 (Select Nodes with In-degree 0)

진입 차수(In-degree) 0인 노드를 선택하여 정렬 리스트에 저장한다.
- 처음에는 노드 1이 진입 차수 0이므로 선택되었다.
- 노드 1과 연결된 간선을 제거하면서, 2번과 3번의 진입 차수를 각각 1 감소시킨다.
- 결과: 2번, 3번 노드의 진입 차수가 0이 된다.

✅ 과정 반복 (Repeat Until All Nodes are Processed)

진입 차수 0이 된 2번 노드를 선택하고, 정렬 리스트에 추가한다.
- 노드 2와 연결된 노드 4와 5의 진입 차수를 각각 1 감소시킨다.
그다음 3번 노드를 선택하고, 진입 차수를 업데이트한다.
4번 노드와 5번 노드가 순차적으로 선택되며, 정렬이 종료된다.
2와 3의 순서는 임의로 선택할 수 있으므로 복수의 정렬 결과가 가능하다.

💡 중요포인트

진입 차수 계산 (Calculate In-degrees): 각 노드의 진입 차수를 미리 계산하여 정렬 과정을 준비한다.
진입 차수 0인 노드 선택 (Select Nodes with In-degree 0): 진입 차수 0인 노드를 우선 선택하여 처리한다. 연결된 간선을 제거하며 다른 노드들의 진입 차수를 업데이트한다.
복수 정렬 가능성 (Multiple Valid Orders): 동일한 진입 차수를 가진 노드 순서는 임의로 선택할 수 있으므로, 여러 정렬 결과가 가능하다.

위상정렬 #5 : 의사코드 - DFS 버전 (Topological Sorting with DFS)

위상 정렬의 이 버전은 깊이 우선 탐색 (Depth-First Search, DFS) 알고리즘을 사용한다.

DFS는 끝까지 탐색한 후 뒤로 돌아오는 재귀적인 구조를 가집니다.
DFS 기반 위상 정렬은 깊이 우선 탐색을 사용하여, 노드의 방문 여부를 확인하며 모든 노드를 탐색하개 된다.
따라서, 위상 정렬에서는 가장 마지막에 도달한 노드부터 정렬이 시작된다.
거꾸로 정렬 (Reverse Order): DFS는 결과를 거꾸로 쌓으므로, 위상 정렬 리스트는 DFS의 반환 순서대로 앞에서부터 정렬되게 된다.

초기화 (Initialization)
- 모든 노드를 방문 하지 않았다는 v.visited←NO 로 초기화한다.
- 정점의 순서는 중요하지 않으므로, 임의의 정점 (Arbitrary Node)부터 탐색을 시작한다.
DFS 호출 (DFS-TS): 아직 방문하지 않은 노드라면, DFS를 호출한다.
깊이 우선 탐색 (Perform DFS):
- 현재 노드를 방문했음을 표시한다: v.visited←YES
- 현재 노드의 인접한 노드들 (Adjacent Nodes)을 탐색한다.
  - 방문하지 않은 인접 노드가 있다면, 해당 노드에서 다시 DFS를 호출한다.
결과 리스트 삽입 (Insert into Result):
- 모든 인접 노드의 탐색이 끝난 후, 현재 노드를 결과 리스트의 맨 앞에 삽입한다.
- DFS의 특성상, 가장 마지막에 방문한 노드부터 정렬이 시작된다.

💡 중요 포인트 (Key Points):

깊이 우선 탐색 (Depth-First Search): DFS를 사용해 모든 노드를 탐색하며, 재귀적으로 호출한다.
정렬 순서 결정 (Order of Sorting): 가장 나중에 방문한 노드가 정렬 리스트의 앞에 위치한다.
재귀 호출 (Recursive Calls): 인접한 노드를 모두 탐색할 때까지 DFS를 계속 호출한다.
시간 복잡도 (Time Complexity):
- O(V + E) 여기서 V는 정점 수, E는 간선 수이다.

위상정렬 #6 : 작동방식 - DFS 버전 (Topological Sorting with DFS)

(b) 수프 넣기 선택: DFS를 시작하며 "수프 넣기" 노드가 선택되었다.
(c) 계란 풀어넣기 발견:
- DFS를 수행하며 "계란 풀어넣기"까지 탐색한다.
- "계란 풀어넣기"는 정렬의 마지막 자리로 설정된다.
(d) 수프 넣기로 돌아가기:
- 계란 풀어 넣기는 갈 곳이 없으므로 DFS 경로를 되돌아가며 "수프 넣기"를 정렬 리스트의 두 번째 자리에 설정하게 된다.
(e) 임의 선택 (냄비에 물 붓기):
- DFS를 수행할 새로운 시작 노드로 "냄비에 물 붓기"를 선택하였다.

(f) 점화로 이동:
- "냄비에 물 붓기"에서 DFS를 통해 "점화"로 이동한다.
(g) 라면 넣기 탐색:
- "점화"에서 연결된 "라면 넣기"로 이동하며, "라면 넣기"가 DFS 종료 지점으로 설정된다.
- "라면 넣기"는 정렬의 세 번째 자리를 차지한다.
(h) DFS 경로 거슬러 올라가기:
- DFS를 되돌아가며 "점화"를 네 번째 자리로 설정한다.

(i) 냄비에 물 붓기 정렬:
- "냄비에 물 붓기"가 정렬의 다섯 번째 자리에 설정된다.
(j) 라면 봉지 뜯기:
- 마지막 남은 노드인 "라면 봉지 뜯기"가 정렬의 첫 번째 노드로 설정된다.

최종 순서 (Final Order):

"라면 봉지 뜯기"→"계란 풀어넣기"→"수프 넣기"→"라면 넣기"→"점화"→"냄비에 물 붓기"

💡 중요 포인트 (Key Points):

DFS 특성 활용: 깊이 우선 탐색의 특성을 통해 가장 깊은 노드부터 역순으로 정렬되었다.
거꾸로 정렬 (Reverse Order): DFS 종료 시점부터 정렬 리스트에 추가하므로, 결과가 역순으로 쌓이게 된다.
임의 선택 가능 (Arbitrary Selection): 시작할 노드가 여러 개일 경우, 임의로 선택할 수 있으며 결과적으로 복수의 정렬 순서가 가능하다.

2️⃣최단 경로 알고리즘 (Shortest Path Algorithm)

💡정리: 최단 경로 (Shortest Path)란 두 정점 사이의 경로들 중 간선의 가중치 합 (Sum of Edge Weights)이 가장 작은 경로를 말한다. 예를 들어, 노드 A → 노드 B로 가는 여러 경로 중 가중치가 최소인 경로가 최단 경로가 된다.

최단 경로 알고리즘은 그래프 이론에서 중요한 개념으로, 실생활의 네트워크 경로 탐색, 물류 최적화, 길 찾기 등에서 널리 활용되고 있다.

✅ 최단 경로 알고리즘을 위한 조건

방향성 (Direction):
- 그래프가 방향성을 가지면, A → B와 B → A의 경로가 달라진다.
- 따라서 방향성을 명확히 정의해야 한다.
가중치 (Weights):
- 각 간선은 특정 비용(가중치)을 가진다
- 가중치는 경로의 총 비용을 계산하는 데 사용된다.

✅ 음수 가중치와 사이클

음수 가중치 (Negative Weights):
- 간선의 가중치가 음수인 경우에도 최단 경로를 구할 수 있다.
- 단, 사이클이 없는 경우에만 적용 가능하다.
음의 사이클 (Negative Cycle):
- 사이클(순환)이 존재하며, 그 가중치 합이 음수일 경우 문제가 발생한다.
  - 예: 음수 가중치를 가진 간선을 따라 계속 순환하면 비용이 무한히 줄어드는 현상이 발생하게 된다.
- 이럴 경우엔, 최단 경로를 정의할 수 없게 되며, 알고리즘이 무한 루프에 빠질 수 있다.
- 하지만 음수 가중치가 있어도문제가 되지 않는다. 알고리즘을 통해 최단경로를 구할 수 있다. 사이클이 생기는 경우만 안되는 것이다.
음의 사이클 탐지 (Detecting Negative Cycles):
- 이따 배울 벨만-포드 알고리즘 (Bellman-Ford Algorithm)은 음의 사이클 여부를 탐지할 수 있는 알고리즘이다. 최단 경로 문제를 해결하는 데 사용된다.

✅ 무방향 그래프와 방향 그래프

무방향 그래프는 양방향 간선을 포함하는 방향 그래프로 변환하여 최단 경로 알고리즘을 적용할 수 있다. 예) A — B → A → B로 변환

최단 경로 알고리즘 #1 : 분류 (Type of Shortest Path Algorithm)

💡정리: 최단 경로 알고리즘은 크게 두 가지로 나뉜다. 알고리즘은 문제 상황에 맞게 선택되며, 다음 강의에서는 각 알고리즘의 구현 및 세부 원리를 배울 예정이다.

단일 시작점 최단 경로: 하나의 출발점에서 모든 노드까지의 최단 경로를 계산.
모든 쌍 최단 경로: 모든 노드 간의 최단 경로를 계산.

✅단일 시작점 최단 경로 (Single-Source Shortest Path)

개념: 그래프에서 하나의 시작점에서 출발하여 모든 노드까지의 최단 경로(최솟값)를 구하는 알고리즘이다.
- 시작점에서 종료점을 정하지 않고, 시작점에서 그래프의 모든 정점으로 가는 경로를 계산한다.
특징: 좀 복잡해 보이는 과정이다. 비교적 복잡도가 낮은 알고리즘을 사용할 수 있지만, 그래프의 크기에 따라 여전히 복잡도가 높은 편이다.
주요 알고리즘:
1. 다익스트라 알고리즘 (Dijkstra's Algorithm):
  - 간선 가중치가 모두 양수일 때 사용한다.
  - 시간 복잡도: O(V log V + E) (우선순위 큐 활용 시).
2. 벨만-포드 알고리즘 (Bellman-Ford Algorithm):
  - 음수 가중치를 포함한 그래프에서도 사용 가능하다.
  - 음의 사이클 탐지 가능하다.
  - 시간 복잡도: O(V⋅E)
3. 사이클이 없는 그래프 (Acyclic Graph):
  - 그래프에 사이클이 없을 경우, 더 간단하고 효율적인 알고리즘 사용이 가능하다.
  - 위상 정렬 기반으로 구현한다.
  - Acyclic : Not displaying or forming part of a cycle.

✅ 모든 쌍 최단 경로 (All-Pairs Shortest Path)

개념: 그래프 내 모든 정점 쌍 사이의 최단 경로를 계산하는 알고리즘이다.
- 단일 시작점보다 복잡도가 당연하게도 훨씬 높다.
특징: 모든 노드와 모든 노드 사이의 경로를 고려하므로, 그래프의 크기가 클수록 계산량이 기하급수적으로 증가하게 된다.
주요 알고리즘:
1. 플로이드-워샬 알고리즘 (Floyd-Warshall Algorithm):
  - 모든 쌍 최단 경로를 계산하는 대표적인 알고리즘이다.
  - 시간 복잡도: O(V³)
2. 존슨 알고리즘 (Johnson's Algorithm):
  - 음수 가중치를 포함한 그래프에서도 사용 가능한 알고리즘이다.
  - 다익스트라 알고리즘과 벨만-포드 알고리즘을 조합하여 효율적으로 계산할 수 있다.
  - 시간 복잡도: O(V² log V + V ⋅ E)

💡 알아야 할 핵심

최단 경로 알고리즘의 두 가지 분류:
- 단일 시작점 최단 경로 (Single-Source Shortest Path):
  시작점에서 모든 노드까지의 최단 경로.
- 모든 쌍 최단 경로 (All-Pairs Shortest Path):
  그래프의 모든 정점 쌍 사이의 최단 경로.
각 알고리즘의 목적에 따른 사용:
- 시작점과 종료점이 명확하면 단일 시작점 알고리즘을 사용.
- 전체 그래프의 모든 정점 쌍 경로가 필요하면 모든 쌍 최단 경로 알고리즘을 선택.
주요 알고리즘 목록:
- 단일 시작점: 다익스트라, 벨만-포드, 위상 정렬 기반 알고리즘.
- 모든 쌍: 플로이드-워샬.

최단 경로 알고리즘 #2 : 알아야 할 개념, 완화 (Concept you should know in Single-Source Shortest Path Algorithms)

“완화(relaxation)”란?

최단 경로 알고리즘에서 현재 계산된 거리 값을 더 작은 값으로 갱신하는 과정이다. 이 과정은 새로운 정점이나 경로를 추가적으로 고려했을 때 더 짧은 거리 경로를 발견할 가능성이 있을 때 수행된다.

현재 상태:
- 시작점 r에서 특정 정점 v까지의 거리 A는 이미 계산되어 있다.
- u는 현재 탐색 중인 정점이다.
- w_u,v는 u에서 v로의 간선 가중치를 의미한다.
새로운 경로 탐색:
- r → u 까지의 거리 B가 이미 계산되어 있다고 가정한다.
- u → v 를 거쳐 r → u 의 거리를 계산하면, 총 거리는 B+w{u,v}가 된다.
완화 조건:
- 기존 거리 A와 새로운 경로 거리w_{u,v}를 비교한다.
- 만약 새로운 경로의 거리가 기존 거리보다 작다면:
  - A > B + w{u,v}
  - v까지의 최단 거리 값을 A에서 B + w_{u,v}로 업데이트한다.
업데이트:
- 갱신된 거리 값은 A ← B + w{u,v} 로 저장된다.
- 이 과정을 모든 가능한 간선에 대해 반복하여 최적의 경로를 찾아가게 된다.

✅ 완화의 목적

완화는 기존에 계산된 거리 값이 정확하지 않을 가능성을 염두에 두고, 새로운 경로를 통해 더 짧은 거리 값을 찾아내는 과정이다. 최종적으로 모든 간선을 완화하여 최단 경로를 도출할 수 있다.

💡 이미지의 주요 포인트

기존 거리 A: 시작점 r에서 정점 v까지의 거리.
새로운 거리 B + w_{u,v}: r에서 u를 거쳐 v까지 가는 거리.
비교 조건: 새로운 경로가 더 짧으면 기존 값을 갱신.
결과: 최단 거리 리스트에서 정점 v의 값을 업데이트.
다익스트라(Dijkstra) 및 벨만-포드(Bellman-Ford) 알고리즘에서 이 완화 과정을 활용하여, 인접 노드 간의 최단 거리를 반복적으로 갱신해 나간다.

최단 경로 알고리즘 #4 : 최단 경로 알고리즘의 동적 프로그래밍 적용

✅ DP 테이블

정의: 시작점 r로부터 특정 정점 v까지의 거리를 저장한다. 이를 v.dist로 표시한다.

✅ 점화식(Recurrence relation)

점화식의 목적은 현재까지 계산된 거리 값을 새로운 경로를 통해 최소화하는 것이.
- v.dist : 현재까지 계산된 시작점에서 정점 v까지의 거리.
- u.dist : 시작점에서 정점 u까지의 거리.
- w_{u,v}: 정점 u에서 v로 이동하는 간선의 가중치.

현재 상태:
- 시작점 r에서 u까지의 거리 u.dist는 이미 계산된 값이다.
- u에서 v까지의 간선 가중치 w_{u,v}를 추가로 고려한다.
새로운 거리 계산:
- 시작점 r에서 v까지의 기존 거리 v.dist와 새로운 경로를 통해 계산된 거리 u.dist + w_{u,v}를 비교한다.
최소 거리 선택:
- 기존 거리 v.dist와 새로운 거리 u.dist + w{u,v} 중 더 작은 값을 v.dist로 업데이트한다.
결과:
- u를 거친 새로운 경로가 더 짧다면 DP 테이블에서 v.dist 값을 갱신하게 된다.

최단 경로 알고리즘 #5 : 사이클이 없는DAG(Directed Acyclic Graph) 최단 경로를 구하는 알고리즘

사이클이 없는(DAG) 최단 경로 알고리즘은 DFS 기반 위상 정렬과 같이 위상 정렬 기반으로 동작하는 알고리즘이다. 양수 및 음수 가중치를 모두 허용하며, 음수 사이클이 없는 DAG에서 안전하게 동작한다.

초기화 (Initialization):
- 모든 정점 u ∈ V의 초기 거리 값을 무한대 (∞)로 설정한다. u.dist ← ∞ u
- 시작점 r의 거리 값은 0로 설정한다 : r.dist ← 0
위상 정렬 (Topological Sort):
- 그래프 G의 정점들을 위상 정렬 순서로 나열한다.
- 위상 정렬은 DAG에서 모든 정점의 선행 관계를 유지하며 정렬한다.
거리 갱신 (Relaxation):
- 위상 정렬된 순서대로 각 정점 u를 순회한다.
- 각 정점 u의 인접 리스트에 포함된 모든 정점 v ∈ u.adjlist에 대해:
  - 만약 u.dist + w{u,v} < v.dist 이면, v.dist 값을 u.dist + w{u,v}로 업데이트 한다.
  - v.prevv를 u로 설정하여 최단 경로를 추적 가능하도록 유지한다.
최종 결과:
- v.dist는 시작점 r에서 각 정점 v까지의 최단 거리를 저장한다.
- v.prev 를 통해 각 정점에 도달하는 최단 경로를 역추적할 수 있게 된다.

최단 경로 알고리즘 #6: 사이클이 없는DAG(Directed Acyclic Graph) 최단 경로를 구하는 알고리즘의 동작 예

1단계: 그래프 입력 및 시작 정점 설정

DAG(Directed Acyclic Graph)가 주어졌다.
시작 정점은 r로 설정되었다.
모든 간선은 가중치가 있으며, 정점 간의 방향이 지정되어 있다.

2단계: 위상 정렬 수행

DAG에서 위상 정렬을 수행하여 정점들의 처리 순서를 정한다.
위상 정렬 순서는 r → 원형 정점 → 하트 정점 → 마름모 정점 → 삼각형 정점으로 결정된다.

3단계: 거리 초기화

시작 정점(r)의 거리는 0으로 설정되고, 나머지 정점의 초기 거리는 무한대(∞)로 설정된다.
초기 상태: r: 0 , 나머지 정점: ∞

4단계: 정점 r의 거리 갱신

정점 r(0)에서 인접 정점으로 거리 값을 갱신한다.
- r → 원형 정점: 거리 3, r → 하트 정점: 거리 7
- 결과: r: 0, 원형: 3, 하트: 7, 나머지 정점: ∞

5단계: 원형 정점의 거리 갱신

정점 원형에서 인접 정점으로 거리 값을 갱신한다.
- 원형 → 하트 정점: 기존 거리 7과 새로운 거리 3 + 4 = 7 비교 → 갱신 필요 없음.
- 결과: r: 0, 원형: 3, 하트: 7, 나머지 정점: ∞

6단계: 하트 정점의 거리 갱신

정점 하트에서 인접 정점으로 거리 값을 갱신한다.
- 하트 → 마름모 정점: 거리 7 + (-2) = 5
- 결과: r: 0, 원형: 3, 하트: 7, 마름모: 5, 삼각형: ∞

7단계: 마름모 정점의 거리 갱신

정점 마름모에서 인접 정점으로 거리 값을 갱신한다.
- 마름모 → 삼각형 정점: 거리 5 + (-3) = 2
- 결과: r: 0, 원형: 3, 하트: 7, 마름모: 5, 삼각형: 2

8단계: 최종 결과

모든 정점의 최단 거리가 계산되었다. 최단 경로는 아래와 같다:
- r → 원형(3)
- r → 하트(7)
- r → 하트 → 마름모(5)
- r → 하트 → 마름모 → 삼각형(2)
굵은 간선으로 최단 경로를 표시하였다.

💡시작점이 있는 최단 경로 알고리즘의 유형

단일 시작점 최단 경로(Single Source Shortest path)란?

그래프에서 하나의 시작점 (Single Source)에서 출발하여, 모든 정점 (All Nodes)까지의 최단 경로를 구하는 알고리즘이다.
이번 시간에는 두 가지 주요 알고리즘을 배우게 된다:
1. 다익스트라 알고리즘 (Dijkstra's Algorithm)
2. 벨만 포드 알고리즘 (Bellman-Ford Algorithm)

✅ 다익스트라 알고리즘 (Dijkstra's Algorithm)

특징: 음의 가중치 (Negative Weights)를 허용하지 않음 (Not Allowed).
- 예: 간선의 가중치가 -2인 경우 사용 불가.
- 시작점에서 가장 가까운 정점부터 탐색하며 비용을 갱신 (Update Costs)하는 알고리즘이다..
수행 시간 (Time Complexity):
- O(E log V) 여기서 E는 간선의 개수, V는 정점(vertex, node)의 개수이다.
- 매우 빠른 알고리즘이다.
- 정점(vertex) node를 의미한다. 간선(edge) 노드 간에 연결되어 있는 선을 의미
사용 조건:
- 모든 간선의 가중치가 양수일 때 (All Positive Weights) 사용한다.
- 대규모 그래프에서 효율적이다.

✅ 벨만 포드 알고리즘 (Bellman-Ford Algorithm):

특징: 음의 가중치 (Negative Weights)를 허용한다.
- 예: 간선의 가중치가 -2인 경우에도 사용 가능하다.
- 모든 간선을 반복적으로 확인하며 비용을 갱신 (Update Costs)한다.
수행 시간 (Time Complexity):
- O(E ⋅ V), 상대적으로 느린 알고리즘이다.
사용 조건:
- 그래프에 음의 가중치 (Negative Weights)가 포함되어 있을 경우 사용한다.
- 음의 가중치 사이클(무한히 비용을 감소시키는 경로)을 탐지할 수도 있다.

3️⃣다익스트라 알고리즘(Dijkstra's Algorithm)

다익스트라 알고리즘의 과정 (Detailed Analysis of Dijkstra's Algorithm)

1. 초기화 (Initialization):

시작점에서 출발하기 위해, 시작점을 제외한 모든 정점의 거리 (Distance)를 무한대 (Infinity)로 설정한다.
- 이 단계는 알고리즘이 최단 경로를 찾기 시작하기 위한 준비 단계이다.

2. 신규 정점 선택 (Select New Node):

방문하지 않은 정점 중에서 거리가 가장 짧은 정점 (Node with the Shortest Distance)을 선택한다.
- 예: 시작점 A에서 출발하여 인접한 노드 B, C 중에서 가장 비용이 작은 노드를 선택한다.
- 이 과정을 "신규 정점 탐색 (New Node Selection)"이라고 한다.

3. 거리 완화 (Distance Relaxation):

선택된 정점(노드)의 인접한 노드들 (Adjacent Nodes)을 탐색하며 거리를 업데이트 (Update Distance)합니다.
- 만약 새로운 경로를 통해 도달하는 비용이 더 작다면, 그 값을 갱신한다.
- 예: A → B의 비용이 5이고, A → C → B를 통해 비용이 4라면, B의 거리 값을 4로 갱신한다.

4. 반복 (Repeat Until All Nodes Are Visited):

모든 정점이 방문될 때까지 위 과정을 반복한다
- 새로운 정점을 선택 → 인접한 정점의 거리 업데이트

5. 종료 (Termination):

모든 정점의 최단 경로가 계산되었으면 알고리즘이 종료되게 된다.
- 결과적으로, 시작점에서 모든 정점으로 가는 최단 경로가 계산된다.

✅중요 포인트 (Key Points)

다익스트라 알고리즘은 단계별로 정점과 거리를 갱신 (Update Step by Step)하며 최단 경로를 계산하는 방식이다.
매 단계마다 최단 경로를 확정하기 때문에 효율적이고 빠르다. 단순한 구조 덕분에 널리 사용된다.
음의 가중치가 없을 때만 사용 가능하며, O(E log V)의 시간 복잡도를 가진다.

다익스트라 알고리즘 #1: 의사코드 분석 (Analysis of Dijkstra's Algorithm Pseudocode)

1. 알고리즘 초기화 (Initialization):

방문한 정점 집합 S (Visited Nodes Set):
- 초기 상태에서는 아무 정점도 방문하지 않았으므로, S는 공집합 (Empty Set)으로 설정된다. S←ϕ
- ϕ는 수학에서 공집합 (empty set)을 나타내는 기호이다. 이는 비어 있는 집합을 의미하며, 집합 안에 어떤 요소도 포함되지 않은 상태를 나타낸다. 알고리즘이 진행되면서 방문한 정점들은 S에 추가되어, 점점 채워지게 된다.
거리 초기화 (Distance Initialization):
- 시작점을 제외한 모든 정점의 거리를 무한대 (Infinity)로 초기화한다. u.dist ← ∞
- 시작점의 거리를 0으로 설정한다. r.dist ← 0

2. 반복문 (Main Loop):

반복문은 모든 정점이 방문될 때까지 실행되게 된다.
- 방문한 정점의 집합 S가 전체 정점 집합 V와 같아질 때 종료된다. while (S ≠ V)

3. 방문하지 않은 정점 중 최소 거리 정점 선택 (ExtractMin):

방문하지 않은 정점 중에서 최단 거리 값을 가진 정점 u를 선택한다: u ← extractMin(V−S)
- 이 과정은 앞에서 배운 그리디 알고리즘 (Greedy Algorithm)의 특성을 따른다.

4. 방문 집합에 추가 (Add to Visited Set):

선택된 최단 거리 값을 가진 정점 u를 방문한 집합 S에 추가한다: S ← S∪{u}
S: 이미 방문한 정점들의 집합
{u}: 새롭게 방문한 정점 u를 포함하는 집합.
S ∪ {u}: 현재 집합 S에 새 정점 u를 합쳐 새로운 방문 집합을 만든다는 뜻.
특정 정점 u를 방문한 것으로 기록하는 과정을 수학 기호로 표시하였다.

5. 거리 완화 (Distance Relaxation):

u와 인접한 노드들 (Adjacent Nodes)의 거리를 업데이트한다.
- 새로운 경로를 통해 비용이 더 작아지는 경우, 거리 값을 갱신하게 된다:
  - v.dist ← u.dist + wuv
  - v.prev ← u
- 이 과정을 완화 (Relaxation)이라고 부른다.
💡 v.dist ← u.dist + wuv 추가 설명
- 현재 정점 u를 통해 정점 v로 가는 새로운 경로를 고려하여, v까지의 거리 값(v.dist)을 갱신한다는 뜻이다.
- u.dist: 시작점에서 정점 u까지의 최단 거리이다.
- w{uv}: 정점 u에서 v로 가는 간선의 가중치이다.
- u.dist + w{uv}: 시작점에서 u를 거쳐 v까지 가는 경로의 총 거리를 뜻한다.
- v.dist: 기존에 저장된 시작점에서 v까지의 최단 거리 값이다.
💡v.prev ← u 추가 설명
- 정점(노드) v까지의 최단 경로에서 바로 이전 정점(노드)이 u임을 기록하는 곳이다.
- v.prev는 최단 경로를 추적하기 위해 사용된다.
- 갱신된 경로로 v.dist가 더 짧아졌다면, u가 v의 직전 노드임을 저장하는 곳이다.

💡중요 포인트 (Key Points)

다익스트라 알고리즘은 그리디 알고리즘 (Greedy Algorithm)으로 작동한다.
- 현재 상태에서 최적의 선택(최소 거리 정점)을 반복적으로 수행하게 된다.
시간 복잡도 (Time Complexity):
- O(E log ⁡V)(우선순위 큐를 사용할 경우).
음의 가중치 허용 불가 (Negative Weights Not Allowed):
- 음수 가중치가 있는 그래프에서는 사용 불가능하다.

다익스트라 알고리즘 #2: 그래프 예제 (Dijkstra's Algorithm in Action)

1. 초기화 (Initialization): 시작점 노드 0에서 알고리즘이 시작된다.

노드 0의 거리 값은 0 (0.dist = 0)으로 설정되고, 나머지 노드들의 거리 값은 무한대 (Infinity)로 초기화된다.

2. 첫 번째 단계 (Step 1): 0에서 인접한 노드들(8, 9, 11)의 거리 값을 업데이트한다.

업데이트 결과: 노드 8: 0 + 8 = 8, 노드 9: 0 + 9 = 9, 노드 11: 0 + 11 = 11
방문한 노드(0)는 고려하지 않는다. 가장 작은 거리 값(8)을 가진 노드 8이 선택된다.

3. 두 번째 단계 (Step 2): 노드 8에서 인접한 노드를 탐색하여 거리 값을 업데이트한다.

8 → 10 : 8 + 10 = 18
- 업데이트 결과: 노드 10의 거리 값이 무한대에서 18로 설정되었다.
- 다음으론 나머지 노드에서 가장 작은 거리 값(9)을 가진 노드 9가 선택된다.

4. 세 번째 단계 (Step 3): 노드 9에서 인접한 노드를 탐색한다.

9 → 10: 기존 거리 값(18)보다 새로운 값(9 + 1 = 10)이 더 작으므로 업데이트한다.
기존의 값이 더 작은 경우에는 업데이트 되지 않으므로 11은 변하지 않고 그대로이다.
- 업데이트 결과: 노드 10: 18 → 9+1 = 10
- 다음으로 가장 작은 거리 값(10)을 가진 노드 10이 선택되게 된다.

5. 네 번째 단계 (Step 4): 노드 10에서 인접한 노드를 탐색한다.

10 → 12: 기존 거리 값(무한대)에서 새로운 값(10 + 2 = 12)로 업데이트된다.
- 업데이트 결과: 노드 12: 무한대 → 10 + 2 = 12
- 업데이트 결과를 확인 한 후, 다음으로 가장 작은 거리 값(11)을 가진 노드 11이 선택된다. (이미 방문한 10은 선택 옵션에서 제외되었다)

6. 다섯 번째 단계 (Step 5): 노드 11에서 인접한 노드를 탐색합니다.

11 → 19: 업데이트.
- 업데이트 결과: 11이 바라보는 노드 2개, 19 : 무한대 → 11 + 8 = 19
- 비용값이 최소인 값을 보니 다음으로 가장 작은 거리 값(12)을 가진 노드 12가 선택된다.

7. 여섯 번째 단계 (Step 6): 노드 12에서 인접한 노드를 탐색한다.

12하고 연결된 노드는 16 하나밖에 없으므로 기존의 값보다 더 작은 16으로 설정이 된다.
12 → 16: 기존 거리 값보다 새로운 값(12 + 4 = 16)로 업데이트한다.
- 업데이트 결과: 노드 16: 19 → 12 + 4 = 16
- 다음으로 가장 작은 거리 값(16)을 가진 노드 16이 선택된다.

8. 마지막 단계 (Step 7): 노드 16에서 인접한 노드를 탐색한다.

16 → 19: 이미 방문한 노드로 더 이상 업데이트가 필요없다.
모든 노드를 방문했으므로 알고리즘이 종료된다.

시작점 노드 0에서 모든 노드까지의 최단 경로가 계산되었다.
최적 경로 그래프: 각 노드로 가는 최단 경로가 붉은색 화살표로 표시되었다.

💡 중요 포인트 (Key Points)

다익스트라 알고리즘은 그리디 방식 (Greedy Approach)으로 최적의 해를 단계적으로 선택한다.
거리 값이 작은 노드부터 업데이트를 진행한다.
업데이트(완화) 과정 (Relaxation): 기존 거리 값보다 작은 값이 발견되면 갱신된다.

💡👀다익스트라(Dijkstra)❓

다익스트라는 이를 만든 수학자의 이름에서 유래되었다. 에츠허르 비버 다익스트라(Edsger Wybe Dijkstra)라는 네덜란드의 유명한 컴퓨터 과학자이다. 다익스트라는 컴퓨터 과학 발전에 큰 기여를 한 인물로, 특히 그래프 알고리즘과 소프트웨어 엔지니어링 분야에서 잘 알려져 있다. 1956년에 고안되어 오늘날까지도 다양한 응용 분야에서 사용되고 있다.

다익스트라 알고리즘 #3 : 자료구조 (Dijkstra's Algorithm Data Structure Example)

1. 인접 리스트로 그래프 구현 (Graph Representation Using Adjacency List)

adjacencyThe fact of being very near, next to, or touching something

그래프를 인접 리스트 (Adjacency List)로 표현한다.
- 각 노드는 자신과 연결된 인접 노드와 가중치 (Adjacent Node and Weight)를 저장한다.
  - 노드 1 → [노드 2(8), 노드 3(3)]
  - 노드 2 → [노드 4(4), 노드 5(15)]
  - 노드 3 → [노드 4(13)]
  - 노드 4 → [노드 5(2)]

파이썬에선 리스트이지만 일반적으론 배열이다.
2. 최단 거리 리스트 초기화 (Initialize Distance List): 최단 거리 리스트 (Distance List)를 초기화한다.
- 시작 노드(1번)의 거리는 0으로 설정한다. D[1] = 0
- 나머지 노드의 거리는 무한대 (Infinity)로 초기화한다. ∞

3. 가장 작은 거리 노드 선택 (Select Node with Minimum Distance)

초기화된 리스트에서 가장 작은 값을 가진 노드를 선택한다.
- 첫 번째로 선택되는 노드는 항상 시작점(1번)이다.
- 선택된 노드(1번)와 연결된 인접 노드(2번, 3번)의 거리를 업데이트 (Update)한다.
  - 2번 노드: D [2] = min⁡ (∞, 0 + 8) =8
  - 3번 노드: D [3] = min ⁡(∞, 0 + 3) = 3

4. 거리 완화 (Relaxation): 선택된 노드와 연결된 노드들의 거리를 계속 업데이트한다.

인접 리스트를 탐색하며, 새로운 경로가 더 짧다면 거리 값을 갱신한다.
완화 과정:
- 선택된 노드의 거리 값 + 간선의 가중치가 기존 거리 값보다 작다면 갱신.
- 방문한 노드는 다시 선택되지 않도록 방문 리스트를 관리합니다.

5. 반복 (Repeat Until All Nodes Are Visited)

위 과정을 모든 노드를 방문할 때까지 반복한다.
- 1번 → 3번 → 2번 → 4번 → 5번 순서로 진행된다.
- 각 단계에서 선택된 노드와 인접한 노드의 거리 값을 업데이트한다.
- 모든 노드를 방문하면 알고리즘이 종료된다.

💡중요 포인트 (Key Points)

인접 리스트 사용: 그래프를 효율적으로 표현하고 탐색한다. (리스트 = 배열)
거리 완화: 최단 거리 값을 갱신하며 최적의 경로를 찾는다.
반복: 모든 노드를 방문할 때까지 최소 거리 노드를 반복적으로 선택하고 업데이트한다.

다익스트라 알고리즘 #4 : 상세 의사 코드 (Detailed Pseudocode of Dijkstra's Algorithm)

1. 주요 데이터 구조 (Key Data Structures):

노드와 엣지 (Nodes and Edges):
- V: 노드 개수 (Number of Nodes)
- E: 엣지 개수 (Number of Edges)
- K: 출발 노드 (Start Node)
거리 리스트 (Distance List):
- 각 노드로부터의 최단 거리 (Shortest Distance)를 저장.
- 초기 값은 무한대 (Infinity)로 설정된다.
방문 리스트 (Visited List):
- 각 노드가 방문되었는지 확인하는 체크리스트 (Checklist)이다.
인접 리스트 (Adjacency List):
- 각 노드와 연결된 노드와 가중치를 저장한다.
- 예: 노드 1 → [(노드 2, 가중치 8), (노드 3, 가중치 3)]
우선순위 큐 (Priority Queue):
- 노드의 거리 값을 기준으로 가장 작은 값을 자동으로 선택해주는 큐.
- 예: 거리 값이 작은 순서로 데이터를 출력한다.

2. 알고리즘 실행 과정 (Algorithm Steps):

초기화 (Initialization):
- 거리 리스트: 출발 노드의 거리를 0으로 설정, 나머지는 무한대로 초기화.
- 우선순위 큐: 출발 노드를 큐에 넣는다.
- 위에서 설명했듯이 우선선위 큐는 노드의 거리 값을 기준으로 가장 작은 값을 자동으로 선택해주는 큐이다. 자동으로 거리가 최소인 노드를 선택하는 것이다. 출발노드의 거리는 0이므로 최소인 노드가 된다.
노드 선택 (Select Node):
- 우선순위 큐에서 가장 작은 거리 값을 가진 노드를 선택한다.
- 이미 방문한 노드인지 확인한다.
거리 업데이트 (Distance Update - Relaxation):
- 선택된 노드와 연결된 인접 노드들의 거리 값을 확인한다.
- 현재 노드의 거리 값 + 엣지 가중치와 기존 거리 값을 비교하여 더 작은 값으로 갱신한다.
우선순위 큐 업데이트 (Update Priority Queue):
- 거리 값이 갱신된 인접 노드를 우선순위 큐에 추가한다.
반복 (Repeat): 위 과정을 모든 노드가 방문될 때까지 반복한다.
종료 (Termination): 모든 노드의 최단 거리 값이 계산되면 알고리즘이 종료된다.

💡중요 포인트 (Key Points)

우선순위 큐 (Priority Queue): 노드 선택 시 가장 작은 거리 값을 자동으로 제공하므로 효율적이다.
거리 완화 (Relaxation): 현재 노드와 연결된 노드의 거리를 비교하여, 더 작은 값으로 갱신한다.
시간 복잡도 (Time Complexity): O (E log ⁡V) (우선순위 큐 사용시)

4️⃣벨만-포드 알고리즘 (Bellman-Ford Algorithm)

벨만 포드알고리즘 과정에 대해서 살펴본다. 벨만포드는 음의 가중치를 허용한다는 특징이있다.

1. 초기화 (Initialization): 시작점 외 모든 정점의 거리 (Distance)를 무한대 (Infinity)로 설정한다.

시작점의 거리는 0으로 설정한다.
이 과정은 다익스트라 알고리즘과 동일하다.

2. 모든 간선 완화 (Relax All Edges)

다익스트라에서는 for문을 활용하여 알고리즘을 최적화 하는 방향이있었다. 하지만 벨만 포드는 음의 가중치를 허용하다 보니 같은 방식으로 진행이 불가능하다. 이유는 최소 비용을 선택 하였는데 음의 가중치가 있다면 그것보다 더 작은 값이 나타나게 되므로 음의 가중치를 허용하지 않는 다익스트라의 특성이 적용되기 때문이다 . 이로인해 모든 간선들을 순서대로 거리를 업데이트 하게 된다. 다익스트라보다 더 많은 완화과정이 필요한 단점이 있다.

벨만-포드 알고리즘은 모든 간선을 반복적으로 완화 (Relaxation)한다. 다익스트라처럼 가장 작은 값을 가진 노드를 선택하지 않고, 모든 간선을 하나씩 확인하며 거리 값을 갱신한다
- 이유❓
  - 음의 가중치 (Negative Weight)가 포함된 그래프에서는, 특정 경로를 선택하더라도 더 작은 값이 나올 가능성이 있기 때문이다.
  - 따라서 모든 간선을 탐색하며 거리 값을 업데이트하는 방식으로 작동한다.
완화 과정 (Relaxation Process):
- 각 간선을 탐색하며, 다음을 수행한다.
- 새로운 거리 값 = 현재 거리 + 간선의 가중치
- 만약 새로운 거리 값이 기존 거리 값보다 작다면, 거리 값을 업데이트한다.

3. 음의 사이클 여부 확인 (Detect Negative Cycle):

벨만-포드 알고리즘의 장점은 음의 사이클 (Negative Cycle)을 탐지할 수 있다는 점이다.
- 음의 사이클이란? 반복적으로 탐색할수록 비용이 계속 줄어드는 사이클.
  - 예: 한 사이클을 돌 때마다 비용이 -10씩 감소.
탐지 방법:
- 모든 간선에 대해 반복적으로 완화 과정을 수행한 뒤,
- 또다시 거리 값을 업데이트했는데도 비용이 줄어든다면, 음의 사이클이 존재한다는 뜻이다.
음의 사이클 여부에 따른 결과:
- 사이클 있음 (Negative Cycle Exists): 경로가 무한히 줄어들기 때문에, 최단 경로를 정의할 수 없다.
- 사이클 없음 (No Negative Cycle): 지금까지 계산된 최단 거리를 출력하며 알고리즘이 종료된다.

✅벨만-포드 알고리즘과 다익스트라 알고리즘 비교 (Comparison)

벨만-포드 알고리즘 #1: 의사코드 (Bellman-Ford Algorithm Pseudocode)

1. 초기화 (Initialization):

시작점에서 출발하기 위해, 모든 정점의 거리를 무한대 (Infinity)로 설정한다.
시작점의 거리 (Start Node Distance)는 0으로 초기화한다: r.dist ← 0
이 과정은 다익스트라 알고리즘과 동일하다.

2. 정점 반복 (Vertex Loop):

정점의 수가 V라면, V−1 반복한다 for i ← 1 to ∣V∣−1
- 이유: 그래프에 정점이 4개라면, 최단 경로를 완전히 계산하기 위해 최대 3번의 거리 업데이트가 필요하기 때문이다.

3. 간선 완화 (Relaxation of Edges):

각 정점 반복 내부에서는 모든 간선 (Edges)을 따라가며 거리 값을 업데이트한다: for each (u, v) in E
완화 조건 (Relaxation Condition): if (u.dist + wuv < v.dist)
- - 현재 노드 u를 거쳐 v로 가는 경로가 기존 경로보다 짧다면,
    - v.dist ← u.dist + wuv
    - v.prev ← u (최단 경로의 이전 노드 기록).
이 과정은 다익스트라와 비슷하지만, 모든 간선을 탐색한다는 점에서 차이가 있다. (for each)

4. 음의 사이클 탐지 (Negative Cycle Detection):

정점 반복이 끝난 후, 추가로 한 번 더 모든 간선을 탐색한다.
만약 다음 조건이 참이라면, 음의 사이클 (Negative Cycle)이 존재한다는 뜻이다:
- if (u.dist + wuv < v.dist) 이미 계산이 끝난 상태에서 더 작은 값이 발견되었다면, 이는 음의 사이클로 인해 발생한 것이다.
  - 출력: "음의 사이클 발견, 해 없음 (Negative Cycle Detected, No Solution)".

5. 알고리즘 종료 (Termination):

음의 사이클이 없는 경우, 계산된 최단 거리 값을 출력하며 종료한다.

벨만-포드 알고리즘 #2: 작동 예제 (Bellman-Ford Algorithm in Action)

1. 초기 상태 (Initial State): 노드 0이 시작점이다.

시작점의 거리는 0, 나머지 노드들의 거리는 무한대 (Infinity)로 초기화된다.

2. 첫 번째 라운드 (First Round): 0번 노드와 연결된 모든 노드를 업데이트한다.

0 → 8: 거리 값은 0 + 8 = 8 0 → 9: 거리 값은 0 + 9 = 9 0 → 11: 거리 값은 0 + 11 = 11
결과: D[8] = 8, D[9] = 9, D[11] = 11

3. 두 번째 라운드 (Second Round): 모든 간선을 따라 거리 값을 업데이트 한다.

- 9 → 10: 기존 값(∞)보다 새로운 값 9 + 1 = 10 으로 업데이트 되었다. 그 뒤에
  - 9 → -15: 기존 값(8)보다 작은 9−15 =−6 으로 업데이트 되었다.
  - 9 → 11: 9 + 3 = 12가 되어 더 큰값이므로 작은값인 11이 계속 유지된다.
  - 11 → 19: 기존 값(∞)보다 새로운 값 11 + 8 = 19 두개의 노드가 업데이트 되었다.
결과: D[−6], D[10], D[19] 가 새롭게 계산된다.

4. 반복 과정 (Iterations):

다음 라운드들:
- 모든 간선을 탐색하며 거리 값을 반복적으로 갱신한다.
- 벨만-포드는 정점의 수(V)가 5라면 V−1 = 4번 반복.
- 각 라운드에서 모든 간선의 거리 값을 계산하고 일률적으로 업데이트한다.
- 중요한 점은 다익스트라에서는 가장 작은 값을 선택해서 그것에 인접한 노드를 업데이트하는 반면 벨만포트는 간선들을 모두 일률적으로 업데이트한다는 점이다. 실질적으로 순서가 중요하진 않고 의사코드의 과정이 중요하다.

5. 최종 결과 (Final Result):

각 노드로의 최단 거리가 계산된다.
벨만-포드는 모든 간선을 탐색하며 최단 경로를 확인하므로, 특정 순서에 의존하지 않는다.

✅ 벨만-포드와 다익스트라의 차이 (Difference from Dijkstra):

방식 (Method):

다익스트라는 가장 작은 값을 가진 노드부터 업데이트한다.
벨만-포드는 모든 간선을 반복적으로 탐색하며 업데이트한다.

음의 가중치 (Negative Weights):

다익스트라는 음의 가중치를 허용하지 않는다.
벨만-포드는 음의 가중치를 허용하며 음의 사이클도 탐지할 수 있다.

벨만-포드 알고리즘 #3: 자료구조( Data Structure in Bellman-Ford Algorithms)

1. 초기화 (Initialization): 벨만-포드 알고리즘은 간선(Edge)을 중심으로 작동한다.

간선의 정보를 담기 위해 엣지 리스트 (Edge List)를 사용한다.
- 엣지 리스트는 다음과 같이 구성된다: 출발 노드 (Start Node),종료 노드 (End Node), 가중치 (Weight)
최단 거리 리스트 (Shortest Distance List):
- 시작점(0번 노드)의 거리는 0으로 설정.
- 나머지 모든 노드의 거리는 무한대 (Infinity)로 초기화.
- 초기화된 리스트는 알고리즘의 첫 번째 단계가 된다.

2. 반복적 업데이트 (Iterative Updates): 모든 간선을 확인하며, 각 간선을 따라 거리 값을 업데이트한다.

업데이트 횟수 (Number of Updates):
- 정점의 수(V)가 5라면, V − 1 = 4번 반복.
- 이유: 정점 간 최단 경로는 최대 V−1개의 간선을 거칠 수 있기 때문이다.
업데이트 과정:
- 첫 번째 라운드: 시작점(0)과 연결된 2, 3번 노드의 비용이 갱신된다.
- 두 번째 라운드: 4,5 번 노드와 연결된 노드들의 비용이 갱신된다.
- 이렇게 순차적으로 모든 간선을 탐색하며, 최단 거리 리스트를 갱신한다.

3. 음수 사이클 탐지 (Negative Cycle Detection):

음수 사이클 (Negative Cycle): 특정 경로를 반복해서 탐색할 때, 비용이 계속 줄어드는 경우
탐지 방법:
- V−1번 반복 후, 한 번 더 모든 간선을 탐색한다.
- 만약 추가 업데이트가 발생하면, 음수 사이클이 존재한다는 뜻이다.
- 따라서 아래와 같이 결과를 출력한다.
  - "음수 사이클 존재 (Negative Cycle Exists)"

벨만-포드 알고리즘 #4: 파이썬 코드 (Bellman-Ford Algorithm Python Code Example)

1. 데이터 초기화 (Initialize Data):

노드(N)와 간선(M) 읽기: 그래프의 노드 개수 N과 간선 개수 M을 입력받는다.
```
  N, M = map(int, input().split())
```
간선 리스트와 거리 리스트 선언 (Edge List and Distance Array):
```
  edges = []  
  distance = [sys.maxsize] * (N + 1)
```
- edges: 간선 정보를 저장하는 리스트.
- distance: 각 노드까지의 최단 거리를 저장하는 리스트.
  - sys.maxsize로 초기화하여 무한대 값을 나타낸다.
엣지 데이터 입력받기 (Store Edge Data):
```
  for _ in range(M):  
      start, end, time = map(int, input().split())  
      edges.append((start, end, time))
```
각 간선의 출발 노드, 도착 노드, 가중치를 입력받아 edges 리스트에 저장한다.

3. 음수 사이클 여부 확인 (Negative Cycle Detection):

pythonCopy codemCycle = False  
for start, end, time in edges:  
    if distance[start] != sys.maxsize and distance[end] > distance[start] + time:  
        mCycle = True

반복 이후에도 값이 더 작아지면 음수 사이클(Negative Cycle)이 존재한다는 뜻이다.

4. 결과 출력 (Output Results):

if not mCycle:  
    for i in range(2, N + 1):  
        if distance[i] != sys.maxsize:  
            print(distance[i])  
        else:  
            print(-1)  
else:  
    print(-1)

음수 사이클이 없으면, 각 노드까지의 최단 거리를 출력한다.
음수 사이클이 있으면, -1을 출력한다.

5️⃣모든 쌍 최단 경로(All-Pairs Shortest Path)

✅ 모든 쌍 최단 경로 소개 (All-Pairs Shortest Path):

이전에는 단일 시작점 최단 경로 (Single-Source Shortest Path)를 배웠다. 단일 시작점 최단 경로란 특정 시작점에서 다른 모든 노드까지의 최단 경로를 구하는 방법이다.
이제는 모든 쌍 (All-Pairs)을 대상으로 하는 경로이다.
- 그래프에 있는 모든 노드 쌍 간의 최단 경로를 계산한다.
- 예: A → B, A → C, B → C 등 모든 경로
언제 사용하나요? (When to Use):
- 네비게이션 (Navigation): 도시 간의 최단 경로를 구하는 경우.
- 네트워크 통신 (Network Communication): 데이터가 여러 서버 간에 전송될 때 가장 빠른 경로를 계산하는 경우
특징 (Key Characteristics):
- 이 알고리즘은 모든 노드 쌍 간의 최단 경로를 구하므로 계산량이 많다.
- 복잡도가 높다 (High Complexity): 계산량이 커서 시간 소모가 크다.

✅ 대표 알고리즘 (Representative Algorithm):

플로이드-워셜 알고리즘 (Floyd-Warshall Algorithm):
- 모든 쌍 간의 최단 경로를 구하는 대표적인 알고리즘이다.
- 동적 프로그래밍 (Dynamic Programming) 방식을 사용한다.
- 그래프에 음수 가중치가 있어도 동작 가능하지만, 음수 사이클은 허용되지 않는다.

모든 쌍 최단 경로 #1: 동적 프로그래밍 적용(Dynamic Programming for All-Pairs Shortest Path)

✅ 동적 프로그래밍 (Dynamic Programming) 적용:

즉, 경로를 점진적으로 늘려가며 계산한다.

최소 간선부터 시작해서 처음에는 노드 간의 직접 연결된 거리만 확인한다. m=1
m > 1일 때, 추가 경로를 통해 더 짧아질 수 있는지를 확인한다.

m = 1: 노드 vi 와 vj 간의 직접 거리.
m > 1: 중간 노드 k를 통해 vi → k 로 이동할 때 거리

모든 쌍 최단 경로 #2: 단순 최단 경로 알고리즘(Simple Shortest Path Algorithm)

💡요약 (Summary)

목표: 모든 정점 쌍 간의 최단 경로 계산.
방법: 모든 정점에서 경로를 확장하면서 거리 비용을 업데이트.
문제: 시간 복잡도가 O(n4)로 비효율적.
개선: 플로이드-워샬 알고리즘으로 최적화.

✅ 단순 최단 경로 알고리즘:

그래프에서 모든 정점 쌍 간의 최단 경로를 계산한다.
경로를 점진적으로 확장하면서 최단 거리 비용을 업데이트한다.

✅동작 과정 (How It Works):

초기화 (Initialization):
- 모든 정점 쌍의 가중치를 초기화한다. (dij = wij)
- 직접 연결된 거리를 사용하거나 연결이 없는 경우 무한대 (∞)로 설정한다.
모든 정점 탐색 (All Nodes Exploration):
- 정점 k를 하나씩 추가하면서 i → k → j 로 이동하는 경로를 확인한다.
- 기존 거리 dij 와 dik + wkj를 비교하여 더 짧은 값을 업데이트한다.
점화식 (Recurrence Formula):
- dij: 정점 i에서 j로 가는 최단 거리.
- k: 중간에 추가된 노드

✅ 문제점 및 개선 방법:

단점 (Drawbacks): 성능이 느림 (Slow Performance)
- 이중 루프와 중첩된 계산으로 인해 시간 복잡도가 O(n4)로 매우 비효율적이다.
개선 방법 (Optimized Approach): 플로이드-워샬 알고리즘 (Floyd-Warshall Algorithm)
- 중간 정점 집합을 활용하여 계산을 최적화하고 수행 시간을 단축할 수있다.

모든 쌍 최단 경로 #2: 플로이드-워샬 알고리즘의 동적 프로그래밍 적용 (Floyd-Warshall Algorithm with Dynamic Programming)

플로이드-워샬 알고리즘은 그래프의 모든 정점 쌍 간 최단 경로를 효율적으로 계산하는 알고리즘이다.

✅ 목적 (Goal): 모든 정점 vi에서 vj로 가는 최단 경로를 찾는 것.

✅ 특징 (Feature): 이전 단계에서 계산한 최단 거리 값을 재활용하여 시간 복잡도를 줄인다.

✅DP 테이블 정의 (DP Table Definition)

- n개의 간선이 아닌 vertex set을 명확하게 지정한다. 정점 집합 {v1,v2,⋯ ,vk}만 거쳐 vi에서 vj로 가는 최단 거리이다.
  - 초기값으로 직접 연결된 거리 wij를 사용하거나 연결이 없으면 무한대 (∞)로 설정한다.

✅점화식 (Recurrence Formula):

기본 경우 (Base Case): k = 1 일 때
일반 경우 (General Case): k ≥ 1 일 때
- 해석 (Interpretation):
  - dijk−1: vk 를 거치지 않고 vi 에서 vj 로 가는 최단 거리.
  - dikk−1 + dkjk−1 : vk를 거쳐가는 경로의 거리.
  - 이 둘 중 더 짧은 값을 선택한다.
정점 집합 활용 (Vertex Set Utilization):
- 정점 집합을 점진적으로 확장하며 최단 거리를 업데이트한다.
- 불필요한 계산을 줄여 효율성을 높일 수 있다.

✅주요 특징 및 장점:

효율성 향상 (Improved Efficiency): 이전의 단순 알고리즘보다 O(n4)에서 n³ 로 시간 복잡도로 최적화되었다.
재활용 (Reusability): 이전 단계에서 계산된 거리 값을 활용하여 중복 계산을 줄인다.

모든 쌍 최단 경로 #3: 플로이드-워샬 알고리즘의 점화식 이해 (Understanding the Recurrence Formula in Floyd-Warshall Algorithm)

핵심 아이디어: 플로이드-워샬 알고리즘은 모든 정점 쌍 간의 최단 경로를 찾기 위해 점화식을 사용할 수 있다. 특정 정점 k를 경유할 때와 경유하지 않을 때의 최단 거리를 비교하여 더 짧은 값을 선택한다.

✅ 점화식 해석:

기존 경로 ( i ➡️ j )
- i에서 j까지 k를 경유하지 않은 상태에서의 최단 거리이다.
경유 경로 ( i ➡️ k ➡️ j)
- i에서 k를 거쳐 j로 가는 경로의 거리이다.
- 이 경로는 k−1 단계까지의 계산된 최단 거리를 사용한다.
최종 선택:
- k를 경유하지 않는 기존 경로와 k를 경유하는 새로운 경로를 비교한다.
- 위 두 값 중 더 작은 값을 선택하여 최단 거리를 업데이트하게 된다.
- O(n3)로 효율적이다.

All-Pairs Shortest Path #3: 플로이드-워샬 알고리즘의 의사코드 (Understanding the Pseudocode in Floyd-Warshall Algorithm)

✅알고리즘 구조 설명

초기화 (Initialization):
- 그래프의 각 간선 가중치를 초기 거리 값으로 설정한다. dij = wij
- 정점 i와 j가 직접 연결되어 있지 않은 경우, 초기값을 무한대로 설정한다. ∞
- 간선(Edge) : 노드와 노드를 연결하는 선
중간 정점 추가 (Adding Intermediate Vertices):
- 정점(Vertex) : 각 노드를 뜻한다.

바깥쪽 for문 (k): 중간에 포함할 정점의 범위를 1에서 n−1지 확장한다. for k ← 1 to n-1
중간 for문 (i): 모든 시작 정점 i를 확인한다.
안쪽 for문 (j): 시작 정점 i에서 도착 정점 j로 가는 최단 거리를 점화식으로 업데이트한다.

점화식 (Recurrence Formula):
- k를 포함하지 않은 경로와 k를 포함한 경로의 거리를 비교한다.
- 작은 값을 선택하여 최단 경로를 갱신한다.
결과: 알고리즘이 종료되면, dij에는 i에서 j까지의 최단 경로가 저장된다.

All-Pairs Shortest Path #4: 플로이드-워샬 알고리즘의 자료구조 순 (Floyd-Warshall Data Structure Example)

1️⃣ 초기화 (Initialization)

우선 그래프의 각 노드 쌍에 대해 최단 거리 리스트를 초기화한다.
자기 자신으로 가는 거리는 0으로 설정하고, 나머지 경로는 무한대(∞)로 설정한다.

2️⃣그래프 데이터 저장 (Store Graph Data)

초기 리스트는 가중치로 초기화 된다. 그래프의 경로 정보를 리스트에 저장한다.
예를 들어, 1에서 2로 가는 경로의 가중치가 8이라면 D[1][2] = 8로 설정한다.
D[2][4] = -4에서 볼 수 있듯이 음수 가중치도 처리할 수 있는 장점이 있다.

3️⃣점화식 업데이트 (Update with Recurrence Relation)

모든 경로를 탐색하며, 중간 노드(K)를 거쳤을 때 더 짧은 경로가 있는지 확인한다.
점화식: D[S][E] = Math.min(D[S][E], D[S][K] + D[K][E])
이 과정에서 D[S][E]와 D[S][K] + D[K][E] 중에서 더 작은 값을 리스트를 업데이트한다.

4️⃣결과 출력 (Output the Final List)

모든 경로를 반복적으로 탐색한 후, 최종적으로 완성된 최단 거리 리스트를 출력한다.
이 리스트는 각 노드 쌍 간의 최단 거리를 포함한다.

💡중요한 부분과 요약본

플로이드-워샬은 모든 정점 쌍의 최단 거리를 구한다. (Floyd-Warshall calculates the shortest path between all pairs of nodes.)
초기화, 그래프 저장, 점화식 업데이트, 최종 결과 출력 순으로 진행한다. (It proceeds in the order of initialization, storing graph data, updating with a recurrence relation, and printing the final result.)
점화식을 이용해 더 짧은 경로를 반복적으로 갱신한다. (Uses a recurrence relation to iteratively update to shorter paths.)
음수 가중치를 허용하지만, 음수 사이클은 처리할 수 없다.

All-Pairs Shortest Path #5: 플로이드-워샬 알고리즘의 파이썬 코드 (Floyd-Warshall Algorithm in Python)

💡정리: 첫 번째 이미지는 그래프의 최단 거리를 초기화하고, 플로이드-워샬 알고리즘의 핵심 반복문인 for 루프를 통해 완화 과정을 구현한다. 두 번째 이미지는 최종 결과를 출력하는 부분으로, 각 정점 쌍의 최단 거리를 행렬 형태로 보여주고 있다.

초기화 (Initialization)
- 각 정점에서 자기 자신으로 가는 거리 (distance[i][i])는 0으로 설정한다.
- 두 정점 사이에 간선이 있으면 그 가중치로 초기화한다. 만약 간선이 없으면 초기 값으로 무한대 (infinity)를 설정한다.
- 간선(Edge) : 노드와 노드를 연결하는 선
- 정점(Vertex) : 각 노드를 뜻한다.
반복문을 통한 완화 (Relaxation through Loops):
- 세 개의 for 루프를 사용한다:
  - 가장 바깥쪽 루프는 중간 경유지 (k)를 반복한다.
  - 그 안쪽 두 개의 루프는 출발 정점 (i)과 도착 정점 (j)의 쌍을 반복한다.
- 점화식 (distance[i][j] = min(distance[i][j], distance[i][k] + distance[k][j]))을 통해, 현재 계산된 최단 거리와 경유지를 거쳤을 때의 거리를 비교해 더 작은 값으로 업데이트한다.

결과 출력 (Output):

모든 정점 쌍에 대해 최단 거리 행렬을 출력한다.
만약 두 정점 간 경로가 존재하지 않으면 0으로 출력하게 된다.

6️⃣강연결 요소 구하기 (Finding Strongly Connected Components, SCC)

강연결 요소 구하기(Finding Strongly Connected Components)는 그래프 이론과 알고리즘에서 매우 중요한 개념이다. 방향 그래프에서 강하게 연결된 요소는 같은 부분 집합 내의 모든 정점이 해당 집합의 다른 모든 정점으로 방향 간선을 따라 도달할 수 있는 정점들의 부분 집합을 의미한다. 그래프의 SCC를 찾는 것은 그래프의 구조와 연결성을 이해하는 데 중요한 통찰력을 제공하며, 이는 소셜 네트워크 분석, 웹 크롤링, 네트워크 라우팅 등 다양한 분야에 응용될 수 있다. 또한 이 알고리즘은 Kosaraju's Algorithm의 핵심 아이디어를 따른다.

강연결 요소 (Strongly Connected Components, SCC)는 유향 그래프에서 특정 조건을 만족하는 노드들의 집합이다.

✅ 조건: 강하게 연결된 부분 그래프란, 그래프의 모든 정점 쌍에 대해 양방향으로 이동할 수 있는 경로가 존재하는 경우를 의미한다.

예를 들어) A → B로 가는 경로가 있고, 동시에 B → A로 가는 경로도 존재한다면, A와 B는 강하게 연결되어 있다고 한다.

✅ SCC 찾기의 목표: 그래프를 강연결 요소들로 나누어 각 요소의 경로 특성을 분석한다.

✅ 시간 복잡도: 이 알고리즘은 DFS를 두 번 사용하며, 수행 시간 복잡도는 O(V+E)로 매우 효율적이다.

첫 번째 그래프:
- 노드 11, 12, 13이 서로 강하게 연결(Strongly connected)되어 있다.
- 예) 11에서 12로 가는 경로와 12에서 11로 가는 경로가 존재하기 때문에 양방향 연결이 가능하다다.
두 번째 그래프:
- 전체 그래프는 여러 강연결 요소로 나뉜다.
- 단일 노드의 SCC: 노드 9와 10은 다른 노드들과 강하게 연결되지 않아 독립적인 요소를 형성한다.
- 강하게 연결된 서브그래프:
- 노드 6, 7, 8은 서로 강하게 연결되어 하나의 요소를 만든다.

강연결 요소 구하기 #1: 의사코드 (Pseudocode in Finding Strongly Connected Components, SCC)

그래프 탐색 (DFS 수행):
- 그래프 G에서 DFS(깊이 우선 탐색)를 수행한다.
- 각 정점 v의 완료시간 f[v]를 계산한다.
  - f[v]: DFS 탐색 중 정점 v에서 더 이상 갈 곳이 없을 때 기록되는 시간
  - 이 완료시간은 이후 역방향 그래프에서 탐색 순서를 결정하는 데 중요하다.
  - 간선(Edge) : 노드와 노드를 연결하는 선
  - 정점(Vertex) : 각 노드를 뜻한다.
역방향 그래프 생성 (Reverse the Graph):
- 그래프 G의 모든 간선 방향을 뒤집어 새로운 그래프 Gr를 만든다.
- Gr: G의 모든 간선이 반대로 연결된 그래프.
다시 탐색 시작 (DFS on Gr):
- Gr에서 다시 DFS를 수행한다.
- 이번에는 완료시간 f[v]가 가장 큰 정점부터 탐색을 시작한다.
  - 완료시간이 크다는 것은 G에서 가장 나중에 종료된 정점임을 의미한다.
강연결 요소 반환 (Return SCCs):
- Gr에서 탐색을 통해 분리된 트리(서브그래프)를 구한다.
- 각 트리는 하나의 강연결 요소 (SCC)가 된다.

💡 중요한 점 요약 (Key Points Summary)

DFS 완료시간 (Finish Time): DFS 수행 중 완료된 순서를 기준으로 역방향 탐색을 시작한다.
역방향 그래프 (Reverse Graph): 원래 그래프의 간선을 반대로 뒤집어 새로운 그래프를 생성한다.
DFS 재탐색: 완료시간 f[v]이 큰 정점부터 탐색을 시작하여 SCC를 구한다.
결과: 분리된 트리 형태로 강연결 요소들이 반환된다.

강연결 요소 구하기 #2: 작동 과정 (Components Process in Finding Strongly Connected Components, SCC)

1️⃣ DFS 수행

그래프 G에서 각 정점에 대해 깊이 우선 탐색(DFS, Depth First Search) 을 수행한다.

DFS를 통해 각 정점의 완료 시간 f[v]을 기록한다.
완료된 순서가 1 → 2 → 3 → ... 순으로 설정된다.

2️⃣ 간선 방향 뒤집기

그래프의 모든 간선 방향을 뒤집어 역 그래프(G^R) 를 만든다.
G^R는 G의 모든 연결 방향을 반대로 한 것이다.

3️⃣강연결요소 구하기

GR에서 f[v] 값이 가장 큰 정점부터 시작해 DFS를 다시 수행한다.
한 번의 DFS 탐색으로 묶인 노드들이 하나의 강연결요소(SCC) 를 형성한다.
이 과정을 모든 정점이 방문될 때까지 반복한다.

4️⃣ 결과 출력

G^R에서 DFS로 묶인 각 부분 집합이 강연결요소이다.
위 이미지에서 강연결요소는 서로 다른 색으로 구분된다.

7️⃣ A* 알고리즘 (A* Search Algorithm)

💡요약: A* 탐색 알고리즘은 경로 탐색 및 그래프 순회에서 가장 우수하고 널리 사용되는 기술 중 하나이다. 왜 A 탐색 알고리즘을 사용하는가?라고 묻는다면 A* 탐색 알고리즘은 다른 순회 기법과는 달리 "뇌(brains)"를 가지고 있다. 이는 정말로 똑똑한 알고리즘이라는 의미이며, 이를 통해 다른 전통적인 알고리즘과 차별화된다. 또한 많은 게임과 웹 기반 지도에서 이 알고리즘을 사용하여 매우 효율적으로(근사값으로) 최단 경로를 찾게 된다.

✅ 최단경로 문제의 복잡성

최단경로 구하는 문제는 매우 복잡한 문제로 최적화를 짧은시간에 구하는것은 어렵다.
최단경로를 찾는 문제는 계산량이 매우 많고, 시간이 오래 걸리는 알고리즘이다.
예를 들어, 벨만-포드나 플로이드-워셜 알고리즘은 여러 중첩된 for문으로 구성되어 있으며, 수행 시간이 길어질 수 있다.

✅ A 알고리즘의 접근 방식

네비를 따라 운전하다 보면 좀 이상하다?싶을 때가 있을 것 이다. 특히 이상한 길을 안내할 때 더욱 그렇다. 이렇게 최단 경로를 구하는 것은 매우 어렵기때문에 A* 알고리즘은 정확한 최단경로를 찾는 대신, 효율성을 높이기 위해 근사값(Heuristic) 을 사용한다.
특정 정점에서 목표 정점까지의 비용을 예측하는 함수 h(x)를 활용한다.
이 값은 목표에 "얼마나 가까운가"를 추정하며, 정확하지는 않지만 경로 탐색을 더 효율적으로 만든다.

✅ 활용 예시

네비게이션 시스템에서 최적의 경로를 찾을 때 자주 사용된다.
예를 들어, 도로 상태, 거리, 예상 시간 등을 고려하여 실제 최적의 길을 안내할 때이다.
하지만 휴리스틱 함수 h(x)가 부정확하면, 예상치 못한 "이상한 경로"를 안내할 수도 있다.

💡 중요한 부분 (Key Points)

휴리스틱 함수 h(x): 목표 정점까지의 비용을 추정하며, 탐색의 효율성을 높이는 핵심.
A 알고리즘의 장점: 정확한 비용 계산 없이도 최적 경로에 가까운 결과를 빠르게 제공한다.
단점: h(x)가 부정확하거나 잘못 정의되면 최적 경로를 보장하지 못한다.

A* 알고리즘 #1: 자세한 설명 (Detailed Explanation of the A * Search Algorithm)

💡요약: 네비게이션 시스템은 A* 알고리즘을 활용해 최단경로를 탐색한다.

최단 거리를 기준으로 한다면: h(n)은 "직선 거리"를 기준으로 계산된다.
최단 시간을 기준으로 한다면: h(n)은 "예상 시간"을 기준으로 계산된다. 이처럼 h(n)의 정의가 달라지면 결과도 달라질 수 있다.

A* 알고리즘은 이 두 정보를 결합해, 최단 경로를 찾으면서도 효율적으로 계산하려는 시도이다.

✅ 평가 함수 f(n)

A* 알고리즘은 평가 함수 f(n) = g(n) + h(n)을 사용한다.
g(n): 출발점에서 정점 n까지의 실제 경로 비용
- 예: 지금까지 이동한 거리나 시간.
h(n): 정점 n에서 도착점까지의 추정 경로 비용
- 정확한 값이 아닌, 도착점까지 얼마나 가까운지 "예상"하는 값입니다.
- h(n)은 알고리즘의 성능을 좌우하는 중요한 요소이다.

✅ 작동 과정

f(n) 값을 기준으로 가장 비용이 적은 정점을 선택해 탐색을 진행한다.
탐색은 다음 두 가지 정보를 기반으로 진행된다.
- 지금까지 이동한 거리 g(n)
- 앞으로 남은 예상 거리 h(n)
최적 경로는 f(n) 값을 최소화하는 경로를 찾는 것이다.

✅ h(n)의 역할

h(n)이 정확 할수록 탐색이 효율적이고 빨라진다.
반대로, h(n)이 부정확하면 잘못된 경로로 탐색하거나 효율이 떨어질 수 있다.
예를 들어, 네비게이션 시스템의 A* 알고리즘은 도로 거리나 예상 시간을 h(n)으로 사용한다.

✅ 예시: 네비게이션의 차이점

네비게이션 앱 간의 길 안내가 다른 이유는 h(n)을 정의하는 방식이 다르기 때문이다.
- 어떤 앱은 최단 거리를 기준으로 하고,
- 다른 앱은 최단 시간을 기준으로 h(n)을 정의하기 때문이다.
도로 상황, 교통량, 속도 제한 등을 반영하는 방식이 달라 알고리즘 결과가 조금씩 차이가 나게 된다.

✅ 장점과 단점

장점: 실제 경로 비용과 예상 비용을 함께 고려해 더 효율적이고 현실적인 탐색이 가능하다.
단점: h(n)을 설계하는 것이 까다롭고, 정확도에 따라 성능이 달라지게 된다. 경우에 따라 시간 복잡도가 높아질 수 있다.

A* 알고리즘 #2: 의사코드 (Pseudo-code of A Algorithm)

초기화 (Initialization)
- 시작 노드의 비용 g(start_node)+h(start_node)를 계산하여 우선순위 큐 (priority queue)에 삽입한다.
  - 우선순위 큐는 비용이 가장 작은 노드부터 처리하도록 정렬된 데이터 구조이다. 시작점에서 출발하여 탐색을 시작한다.
탐색 반복 (While loop)
- 우선순위 큐가 비어 있지 않은 동안 반복한다.
  1. 큐에서 가장 작은 비용의 노드를 꺼낸다. (node=pq.dequeue)
  2. 현재 노드가 목표 노드인지 확인한다.
    - 목표 노드라면 탐색을 종료한다.
  3. 목표 노드가 아니라면 현재 노드에서 이동 가능한 다음 노드들을 확인한다.
다음 노드 탐색 (Exploring Next Nodes)
- for 루프를 사용해 현재 노드에서 이동 가능한 모든 이웃 노드를 확인한다.
  1. 각 이웃 노드의 비용을 계산합니다:
    - f(next_node)=g(node)+cost+h(next_node)
    - cost: 현재 노드에서 이웃 노드로 이동하는 비용.
    - h(next_node): 이웃 노드에서 목표 노드까지의 예상 비용 (휴리스틱).
  2. 계산된 비용을 우선순위 큐에 삽입한다 pq.enqueue(...)
- 이 과정을 반복하며 비용이 낮은 경로를 따라 탐색한다.
목표 도달 후 종료 (Termination)
- 목표 노드에 도달하면, 해당 경로의 총 비용을 출력하고 알고리즘을 종료하게 된다.

💡 주요 코드 설명 (Key Steps)

우선순위 큐 (Priority Queue): 비용이 가장 낮은 노드를 우선적으로 처리해 탐색 효율성을 높인다.
평가 함수: f(n)=g(n)+h(n)을 결합해 최적의 노드를 선택한다.
휴리스틱 h(n): 도착점까지의 예상 비용으로, 정확도가 높을수록 알고리즘 성능이 개선된다.

Join Algorithms for Database Optimization

Heesu Noh — Thu, 05 Dec 2024 02:39:07 GMT

Contents

1️⃣조인 (Overview Of Join)
2️⃣조인의 동작 방식(How Join Works)
3️⃣Comparison of Join Methods (조인 방식 비교)

Database Joins and Performance Optimization (데이터베이스 조인과 성능 최적화)

지금까지 데이터 베이스의 물리적설계와 구현에 대해서 배웟다. DB의 물리적 설계 부분은 dbms와 굉장히 밀접한 연관이 있다. 저장 구조를 어떻게 설정할 것인지,인덱스를 어떻게 만들 것인지, 파티션 여부 등 저장 구조와 관련된 부분, 동시성 제어, 락 관리, 트랜잭션은 구현, 구축 부분이다. 이러한 데이터베이스 설계 구축에서 가장 중요시 하는 첫번째 목표는 “반응속도”이다. 사용자가 쿼리를 주었는데 늦게 나온다면 문제가 된다. 잘 가져오는것도 중요하지만 속도가 중요한 부분을 차지한다. 그렇게 하기위해 dbms는 많은 노력을 하고있다. 관계형 데이터 베이스를 테이블의 형태로 저장하는데 이때 가장 안좋은 점은 "조인"을 할 때이다. 하나의 테이블만 보면 데이터가 아무리 많아도 인덱스로 평균적인 퍼포먼스 활용이 가능하다. 혹은 파티션 생성으로도 접근속도를 빠르게 할 수 있다. 그런데 두 개 이상의 테이블을 접근해서 조인으로 가져오는 경우는 사실 많은 노력이 필요하다. 속도 또한 갑자기 느려질 수 있다. 그래서 쿼리 설계시 특히 조인 쿼리 설계시 많은 주의가 요구된다. 조인 과정이 어떻게 동작하는지 머릿속에 그릴 수 있어야 성능 좋은 쿼리를 생성할 수 있게 된다. 이번 시간은 조인을 좀 더 정리하는 시간을 갖을 것이다.

1️⃣조인 (Overview Of Join)

💡요약: 관계형 데이터베이스는 테이블의 형태로 데이터베이스를 분리해서 저장한다. 이 과정을 정규화(normalization) 라고 하는데 이를 통해 중복을 최소화하는 relation으로 분리하게 된다. 사용자가 원하는 것은 분리되서 저장하는 것도 그렇고 데이터를 모아서 보는 것이다. 이를 위해 데이터를 결합하는 조인 과정을 통해 데이터를 가져와야 하기 때문에 조인 과정이 필요하게 된다. 조인의 종류는 크로스 조인, 내부조인, 외부조인이 있는데 주로 크로스조인, 내부조인을 사용한다.

✅ 조인의 정의 (Definition of Joins)

조인은 두 개 이상의 테이블을 묶어 하나의 결과 집합으로 만드는 것(Combining Two or More Tables into a Single Result Set)을 의미한다. 예를 들어)

departments 테이블에는 부서 정보가 있다.
locations 테이블에는 부서의 위치 정보가 저장되어 있다. 보통 부서의 정보를 가져올땐 location id를 가져오진 않는다. 우리가 원하는 것은 주로 어느 시티에 있는지, 국가에 있는지를 찾아보게 된다. 이런 데이터를 따로 보고 싶을 때 조인이 필요하다.

✅ 조인의 필요성(Need for Joins): 부서 정보와 함께 위치 정보(도시 및 국가)를 확인하려면 두 테이블의 데이터를 결합해야 한다.

✅조인 성능 최적화 (Join Performance Optimization):

조인의 성능을 높이기 위해서는,

인덱스(Indexes)를 활용한다.
파티션(Partitions)을 설정하여 데이터 접근 속도를 높인다.
조인 쿼리를 설계할 때 실행 계획을 이해한 뒤 최적화해야 한다.

✅조인 종류 (Types of Joins):

크로스 조인(Cross Join): 두 테이블의 모든 행을 조합하여 반환한다.
내부 조인(Inner Join): 두 테이블에서 매칭되는 데이터만 반환한다.
외부 조인(Outer Join): 한쪽 테이블에서 매칭되지 않은 데이터도 포함한다.

조인 #1: Cross Join and Cartesian Product (크로스 조인과 카티션 프로덕트)

💡요약: 두 테이블의 데이터의 모든 행을 곱하는 연산을 릴레이션에서는 카티션 프로덕트라고 한다. 이 카티션 프로덕트를 구현한 것이 크로스 조인이다.

✅카티션 프로덕트의 특징 (Characteristics of Cartesian Product):

모든 조합 반환(Returns All Combinations): 테이블 간 모든 행 조합을 생성한다.
조건이 없는 조인(Unconditional Join): ON 또는 WHERE 절이 없다.
큰 결과 집합 생성(Large Result Set): 테이블의 크기가 클수록 행의 수가 기하급수적으로 증가한다.

✅크로스 조인의 정의 (Definition of Cross Join)

크로스 조인(Cross Join)은 데이터베이스에서 두 개의 테이블을 곱하는 조인 방식이다. 이 연산은 테이블 간의 모든 가능한 조합을 반환하며, 카티션 프로덕트(Cartesian Product)를 생성한다.

크로스 조인은 두 테이블의 모든 행을 곱한다.
각 행은 두 테이블의 모든 가능한 조합을 나타낸다.

✅예제 (Example):

departments의 각 행이 locations의 모든 행과 매칭되었다. departments 테이블의 첫 번째 행과 locations 테이블의 23개 행이 조합된다.
같은 방식으로 departments 테이블의 나머지 26개 행도 반복된다.

departments 테이블: 27개의 행 (Rows)
locations 테이블: 23개의 행 (Rows)
크로스 조인 결과: 27 × 23 = 621개의 행이 생성되었다.

크로스 조인 쿼리:

SELECT * FROM departments CROSS JOIN locations;

또는 크로스 조인 생략(Without Explicit Cross Join):

SELECT * FROM departments, locations;

✅주의점 (Cautions)

큰 테이블 사용 시 성능 문제(Performance Issues with Large Tables): 크로스 조인은 결과 집합이 매우 커질 수 있으므로 메모리와 처리 속도에 영향을 미친다.
사용 목적(Purpose of Use): 보통 특정 테스트 또는 모든 조합이 필요한 경우에만 사용한다.
성능 저하를 방지하려면 필터 조건을 추가하는 것이 중요하다. (Add Filter Conditions to Avoid Performance Degradation)

조인 #2: Inner Join and Efficient Execution (내부 조인과 효율적인 실행)

inner join은 조건이 있다. 크로스 조인이 무조건 2개를 곱하는 것 이라면 내부 조인은 조건을 달아 연관된 데이터만 반환하는 방식이다. 이때, 데이터를 결합하는 과정은 크게 세 가지 단계로 나뉜다.

✅ 내부 조인의 정의 (Definition of Inner Join):

두 테이블의 모든 행을 카티션 프로덕트로 결합한다.
이후 ON 조건이나 WHERE 절을 사용하여 특정 조건을 충족하는 데이터만 필터링한다.

✅ 쿼리 예제와 결과 (Query Examples and Results):

기본 내부 조인 (Basic Inner Join):
```
 SELECT * FROM departments INNER JOIN locations 
 ON departments.location_id = locations.location_id;
```
- 결과: 카티션 프로덕트한 전체 집합에서 조건에 맞는 레코드만 남게된다. departments와 locations의 location_id가 같은 27개의 레코드 반환.
간단히 표현한 내부 조인 (Simplified Inner Join):
```
 SELECT * FROM departments, locations 
 WHERE departments.location_id = locations.location_id;
```
- 좀 더 간단하게 표현도 가능하다. INNER JOIN 대신 WHERE 만으로도 동일한 결과를 얻을 수 있다.
조건 추가 (Adding Conditions):
```
 SELECT * FROM departments, locations 
 WHERE departments.location_id = locations.location_id 
 AND department_id <= 100;
```
- AND를 붙여 조인 조건에 조건을 줄 수 있다. where 로 가져 왔는데 department_id가 100보다 작거나 같은 것만 꺼내라는 조건을 AND뒤에 붙인것이다.
- 결과: 부서 ID가 100 이하인 경우만 반환 (10개의 레코드).

✅내부조인의 실행 절차

카티션 프로덕트 생성 (Cartesian Product):
- SELECT * FROM departments INNER JOIN locations

두 테이블의 모든 행 조합을 생성 (621개의 행).

조건 필터링 (Filtering by Condition):
- 그 중에서 department.location_id와 locations.location_id가 같은 것을 뽑아낸다.
- ON departments.location_id = locations.location_id

location_id가 같은 데이터만 필터링되어 27개의 행이 출력된다.

추가 조건 적용 (Applying Additional Conditions):
- WHERE department_id <= 100;

department_id가 100보다 작은 것을 뽑아 낸다. 10개가 생성되었다.

💡DBMS의 입장에서 621-> 27->10 를 추출하는 과정은 비효율적인 면이 있다. 이러한 이유로 거꾸로 하면 빠르지 않을까? 라는 해결법이 제시 되었다.

✅ DBMS 최적화 방법 (DBMS Optimization):

필터 조건을 먼저 적용(Apply Filters Early):
- department_id 가 100보다 작은 결과를 원한다면, 100보다 큰 것은 처음부터 조인을 하지 않으면 된다. department_id <= 100 조건을 먼저 처리하여 10개의 행만 선택한다.
- department 조인에 참여하는 숫자를 줄인다. 불필요한 조인을 줄여 성능 향상하는 것이다.
인덱스 활용(Using Indexes):
- dept_id_pk(부서 ID)와 loc_id_pk(위치 ID) 인덱스를 사용하여 효율적으로 데이터를 조회.
- 인덱스 range scan을 통해 100보다 작은 10개를 뽑아내게 되었다. 인덱스 덕분에 필요한 데이터만 검색이 가능해졌다.
Sort Merge Join:
- 그 뒤 필터링된 데이터(10개)를 정렬한 뒤 병합하여 결합한다.

✅ 실행 계획 분석 (Execution Plan Analysis):

첨부된 실행 계획 이미지에서 주요 과정:

Index Range Scan:
- department_id <= 100 조건에 따라 필터링.
Table Access:
- 필터링된 10개의 데이터를 가져옵니다.
Sort and Merge:
- departments.location_id = locations.location_id 조건에 따라 정렬 및 병합.

💡이것이 내부조인을 dbms에서 효율적으로 구현하는 방법의 예가 된다. 쓸데없는 조인을 줄이게 된다. 조인을 설계할 때 이런식의 효과적인 접근방법이 나오게 설계하는 것이 굉장히 중요하다. 그러기 위해선 조인이 어떻게 동작 하는지를 이해하고 있어야한다.

2️⃣조인의 동작 방식(How Join Works)

💡요약: 조인의 동작방식에 대해 더 자세히 알아본다. 조인이 수행될 때 테이블 간에 접근하는 방식이 중요하다고 배웠다. 조인 방식에 따라 쿼리의 비용과 성능이 달라지기 때문이다. DBMS의 쿼리 옵티마이저가 접근 방식을 결정한다. 오라클에서 주로 사용하는 조인 방식은 크게 3가지가있다. 중첩 루프 조인(Nested Loop Join), 소트 머지 조인(Sort Merge Join), 해시 조인(Hash Join)은 오라클 DBMS에서 주로 사용되는 조인 방식이다. 각각의 조인 방식은 데이터 크기, 정렬 여부, 조건에 따라 성능이 달라진다.

조인의 동작 방식 #1: 중첩 루프 조인 (How Join Works - Nested Loop Join )

✅ 중첩 루프 조인 (Nested Loop Join):

중첩 루프 조인은 간단하지만 성능은 데이터 크기와 인덱스 여부에 따라 달라집니다.
(Nested Loop Join is Simple but Performance Depends on Data Size and Index Presence)
인덱스 설정은 중첩 루프 조인의 핵심 최적화 전략입니다. (Indexing is Key to Optimizing Nested Loop Join)

작동 방식:
- 첫번째 테이블(outer relation)의 각각의 로우에 대해서 두번째 테이블(inner relation)의 모든 로우를 비교한다. 첫번째 테이블에 46은 두번째 테이블에 없으므로 넘어간다. 다음 로우 0은 두번째 테이블에 로우 3에 있다. 그 뒤엔 어떻게 될까? 중복 여부에 따라 달라진다. 중복이 될 수 없다면 이 0 결합 후 끝나게 된다. 중복이 허용 된다면 테이블 끝까지 full scan을 진행하게 된다. 이 경우 인덱스가 있었다면 중복여부와 상관없이 정렬이 되어있기 때문에 훨식 효율적이다.
- 결론적으로 inner relation에 인덱스 여부가 중요한 지표가 된다. 인덱스가 없다면 전부 돌아야하기 때문이다. 이 예제에 중복허용이 안된다면 0에서 끝나지만 중복 허용이 되었다면 full scan을 진행하므로 첫 번째 테이블의 10은 두번째 테이블의 10과 결합 된다. 모든 레코드에 대해 모든 로우를 비교하는 것을 중접 루프 조인의 동작 방식이라고 한다.
- 우리가 흔히 생각하는 조인이다.
- 첫번째 테이블(outer relation)의 각각의 로우에 대해서 두번째 테이블(inner relation)의 모든 로우를 비교하여 조건에 맞는 로우(데이터)를 결합한다.
- 카티션 프로덕트를 생성한 뒤 조건을 적용하여 데이터를 필터링한다.
특징:
- 인덱스 구성이 되어있어야 한다. 인덱스가 없다면 엄청나게 느려질수있다.
- 조인을 한 레코드씩 순차적으로 진행한다. 먼저 액세스 되는 테이블의 처리 범위에 의해 전체 조인 성능이 결정된다. outer relation 에 있는 로우를 줄여야 한다. 그래서 이것으로 inner relation으로 스캔하게 된다. 이렇듯 조인에 참여하는 로우의 갯수를 줄이는 게 중요함을 확인 할 수있다.

작은 데이터셋에 적합하다.
조인 컬럼의 인덱스 여부와 인덱스 컬럼의 구성 방식에 따라 조인 효율이 크게 달라진다. index가 있는것이 가장좋고 적어도 secondary index라도 있어야 좋다.
두 테이블 간 정렬이 필요없다.
팁을 언급 하자면 조인 쿼리를 많이 작성해야 할 때, 많이 등장하는 컬럼에 인덱스를 설치하게 되면 접근성이 증가할 수 있게 된다.

단점:
- 인덱스 구성이 되어 있어도 대량의 데이터를 조인할 때 매우 비효율적이다.

큰 테이블에 사용하면 비효율적(연산량 증가).
테이블의 로우가 많으면 많을수록 m2이므로 선형적으로 증가하게 된다.

✅ 중첩루프조인의 PL/SQL 구현예제 1 - 조인 연산 없이

이 예제는 조인 연산 없이 PL/SQL을 구현한 예제이다.

SET serveroutput ON;

BEGIN
    FOR outer IN (SELECT * FROM departments) LOOP
        FOR inner IN (SELECT * FROM locations WHERE location_id = outer.location_id) LOOP
            DBMS_OUTPUT.PUT_LINE(
                outer.department_id || ' ' || 
                outer.department_name || ' ' || 
                inner.location_id || ' ' || 
                inner.city
            );
        END LOOP;
    END LOOP;
END;

외부 루프(Outer relation)
- FOR outer IN (SELECT * FROM departments) LOOP

departments 테이블의 모든 row(행)을 가져온다. (27개)

내부 루프(Inner relation)
- 각각의 로우에 대해 Inner relation에 있는 값을 비교한다.

FOR inner IN (SELECT * FROM locations WHERE location_id = outer.location_id) LOOP

locations 테이블에 23개의 row가 있는데 WHERE location_id = outer.location_id서 첫번째 레코드 23번 비교 두번째 레코드 23번 비교 .. 이렇게 된다.
만약 인덱스가 있다면 인덱스 1번만 비교하게 될 것이다. 621번 했어야 하는 비교가 27로 바뀌게 된다.

출력: department_id, department_name, location_id, city.

✅ 중첩루프조인의 PL/SQL 구현예제 2- 조인 연산 사용 WHERE

조인 테이블 확인

SELECT * FROM locations; 에선 country_id가 조인키가 된다. SELECT * FROM countries; 또한 country_id가 조인키가 된다.

조인 완료

SELECT l.location_id, l.city, l.state_province, c.country_id,까지는 location테이블에 있다. c.country_name은 countries에 있다.
예제의 결과처럼 조인된 결과를 보여주기 위해선 아래의 명령어를 사용한다.

SELECT l.location_id, l.city, l.state_province, c.country_id, c.country_name
FROM locations l, countries c WHERE l.country_id = c.country_id;

WHERE은 조인 조건으로써 l.country_id와 c.country_id가 동일한 조건이다.

조인 실행 계획

Nested Loops: 중첩 루프 조인을 사용.
거기에 locations, full 테이블 조인이 되었다. 23개의 레코드가 있었음을 볼 수 있다.
Index Scan: country_id에 인덱스(COUNTRY_C_ID_PK)가 설정.
- 내부 테이블을 풀 스캔하지 않고, 필요한 데이터만 검색.

✅인덱스의 중요성 (Importance of Index):

인덱스가 없는 경우: 내부 테이블의 모든 행을 풀 스캔(Full Scan), 성능 저하를 일으킨다.
인덱스가 있는 경우: 필요한 데이터만 검색, 예: location_id에 인덱스가 있으면 비교 횟수가 27번으로 줄어듦.
조인 컬럼의 인덱스 설정: 조인 조건에 자주 사용되는 컬럼에 인덱스 추가할 수 있다. 예) location_id, country_id.

조인의 동작 방식 #2: 소트 머지 조인 (How Join Works - Sort Merge Join)

💡요약: 소트 머지 조인(Sort Merge Join)에 대해 알아보자. 중첩 루프 조인은 좀 무식한 방법이기 때문에 dbms도 이를 최후의 방법으로 여기고 소트머지조인(Sort Merge Join)을 우선으로 처리하는 편이다. 소트 머지 조인(Sort Merge Join)은 두 테이블의 데이터를 정렬(Sorting)한 뒤 병합(Merge)하여 조인을 수행하는 방식이다.

✅ 소트 머지 조인의 작동 방식 (How Sort Merge Join Works):

정렬(Sort):
- PGA영역의 Sort 영역에서 정렬한 뒤 Nested Loop 조인 방식으로 진행된다.

쉽게 설명하면 두 테이블을 각각 정렬한 다음에 두 집합을 합치면서 조인을 수행하는 방식이다.
정렬 후에 합치기 때문에 inner relation은 이미 1, 1, 1 이런식으로 정렬된 상태가 된다.
즉 인덱스가 없는데도 이와 비슷한 역할을 하는 셈이다.
정렬 효율이 빨라지고 full table scan을 하지 않게 된다. 버퍼캐시를 사용하는 NL보다 빠르다.
예: employees와 departments 테이블의 department_id 기준으로 정렬.

병합(Merge):
- 정렬된 두 테이블을 순차적으로 비교하여 조인 조건에 맞는 데이터를 결합.

✅ 소트 머지 방식의 단점 (Disadvantage of Sort Merge Join)

정렬 비용 추가: 정렬 작업이 필요하므로 추가 비용 발생.
이미 정렬된 경우에는 불필요: 정렬된 데이터에선 성능 이점이 적음.

단점은 정렬을 하는데 이 정렬 비용이 추가적으로 들게 된다는 점이다. 만약 정렬하는 비용이 더 들 것 같다고 DBMS가 판단 하면 중첩 루프 조인을 선택하게 될 것 이다. 일반적으로 소트 머지 조인은 버퍼 캐시를 사용하는 Nested Loop보다 빠르게 수행된다. 인덱스가 없는 경우 인덱스를 실시간으로 생성하는 효과를 볼 수 있다. 미리 정렬이 되어 있기 때문이다. 만약 인덱스가 있는 경우엔 조인 속도가 바로 증가하게 된다. 데이터가 정렬되어 있기 때문에 비교값이 없거나 다 찾은 후에는 종료하게 된다.

✅ 예제 쿼리와 결과 (Query and Results):

이 예제를 통해 인덱스가 없어도 정렬을 통해 빠르게 접근이 가능한 것을 알수있다. Nested Loop Join과 달리, =, <, >, <=, >= 조건에 모두 적용 가능하다. 또한 대용량 데이터 처리도 가능해서 대규모 테이블에 적합하다.

SELECT e.employee_id, e.first_name, e.last_name, e.department_id, d.department_name
FROM employees e, departments d
WHERE e.department_id = d.department_id;

Employees 테이블 (Outer Relation): 모든 데이터를 읽기 때문에 Full Table Scan (107개 레코드).
Departments 테이블 (Inner Relation): department_id에 인덱스가 존재하여 인덱스를 사용하였다.
정렬 및 병합: 조건(WHERE): e.department_id = d.department_id , 두 테이블의 department_id 기준으로 정렬 후 병합되었다.

✅소트 머지 조인의 실행 계획

employee테이블에서 outer relation은 모든 레코드를 비교해야하므로 full, 107개가 확인된다.
inner relation인 department테이블이 중요한데, 인덱스가 없다면 full table scan을 하게된다. 이 경우에는 인덱스가 있으므로 첫번째 인덱스만 찾아 가져왔다.
자, 여기까지보면 nested loop와 차이가 없는것처럼 보인다. 하지만 sort merge join에서는 department id에 대해서 sort를 한다. 그래서 sort merge join 이 일반적으론 빠른편에 속한다.

✅소트 머지 조인의 실행 계획에서 알수있는 특징

1) 첫번째 테이블에 소트 연산을 대체할 인덱스가 있을 때 유용하다.

두 테이블을 정렬하기 때문이다. 부분 범위 처리가 가능하다는 뜻이다.

2) 첫번째 테이블이 이미 정렬되어 있을 때 유용하다.

group by, order by 등을 먼저 수행한 경우이다.

3) 조인 조건식이 = 조건이 아닐 때에도 적용 가능하다.

이 부분이 제일 중요하다. 참고로 해시조인은 조인 조건식이 =일 경우에만 사용할 수 있다.

조인의 동작 방식 #3: 해시 조인 (How Join Works - Hash Join)

💡 요약: 해시 조인은 기존의 중첩 루프 조인(Nested Loop Join)과 소트 머지 조인(Sort Merge Join)이 비효율적인 경우에 성능을 개선하기 위해 개발된 방식이다.

✅ 해시 조인의 특징 (Characteristics of Hash Join):

중첩루프조인과 소트머지조인이 효과적이지 않은 상황에 대한 대안으로 개발
- "효과적이지 않은 상황”을 정의하긴 좀 어렵다. 보통은 인덱스가 없거나, 대규모의 데이터 처리에 적합하지 않을 때리 할 수 있겠다.
대규모의 데이터 처리에 적합하다.
- 원리는 무엇일까? 해시라는 것은 해시함수를 적용하여 범위를 줄여준다. 해시를 이용해서 주소가 1~1000까지 있을때 이 1000까지 주소 되어 있는 부분을 10개씩 묶을 수 있게 된다. 그럼 1000개를 비교하는대신 100개를 10번 비교하는 형태가 된다. 이런식으로 비교 범위를 확 줄여주는 것이 해시조인의 기본 원리라고 할 수 있다.
일반적인 경우 중첩루프조인이나 소트머지조인보다 나은 성능을 보임
- 또 "언제" 더 나은 성능을 보이느냐? 대부분은 outer relation에 비해 inner relation의 로우의 숫자 더 많을 때 이다.
두 테이블 중 작은 사이즈의 테이블을 읽어 해시 영역에 해시 테이블 생성한다.
- Build Input(작은 테이블) 이라고 한다.
나머지 큰 테이블의 레코드를 하나씩 읽어 해시 테이블에 연결하는 방식이다.
- Probe Input(큰 테이블)이라고 한다.

✅ 해시 조인의 동작 과정 (Hash Join Execution Process):

해시 조인은 큰 두 개의 테이블에서 조인 조건에 맞는 데이터를 효율적으로 찾아내는 방법이다. 데이터를 메모리에 저장한 후, 해시 알고리즘을 통해 데이터를 비교하고 매칭한다. 이 과정은 특히 대규모 데이터에서 유용하다.

1️⃣ Table Scan (테이블 스캔) - Vehicles Table

첫 번째로, "Vehicles Table"이라는 테이블을 읽는다.
이 과정에서 데이터를 스캔하고, 조인에 필요한 컬럼 값을 뽑아낸다.
이 값들을 활용해 2️⃣ 해시 테이블(Hash Table)이라는 것을 만든다. 해시 테이블은 데이터를 빠르고 효율적으로 저장하고 검색하기 위해 사용하는 자료구조이다.
이럴때 vehicles table은 값이 작은 것을 확인할 수 있다. sales는 판매될수록 계속 늘어나게 될 것이다.
(비유): "차량"이라는 박스를 열어 안에 들어 있는 필요한 자료만 꺼내오는 과정이다.

2️⃣ Hash Table 생성

"Vehicles Table"에서 꺼낸 값들을 사용해 해시 테이블이라는 특별한 데이터 구조를 만든다.
이 해시 테이블은 메모리(PGA)에 저장되며, 검색을 빠르게 도와준다.
(비유): 필요한 자료들을 바구니(PGA)에 넣고 정리해둔 상태라고 볼 수 있다.

3️⃣ Table Scan (테이블 스캔) - Sales Table

(설명): 두 번째 테이블인 "Sales Table"을 읽는다.
이 과정에서 조인 조건에 맞는 데이터만 추려낸다. (해시 테이블 생성)
테이블 스캔이 한번 진행된다. 중요한 부분이다. 위에서 부터 아래로 한번만 한다. 이로써 해시 테이블을 구성한다.
(비유): "판매 기록"이라는 박스를 열어 차량 정보와 관련된 것만 가져오는 작업이다.

4️⃣ Row Sent to Hash Join (행을 해시 조인으로 전달)

"Sales Table"에서 가져온 데이터 중, 조인에 필요한 데이터만 선택한다.

이 데이터는 해시 조인 알고리즘에 따라 다시 해시 테이블로 보내진다.
(비유): 판매 데이터에서 차량과 관련된 부분만 선별해서 바구니에 추가한다.

5️⃣ Hash Join 수행

마지막으로 "Sales Table"에서 가져온 데이터와 "Vehicles Table"로 만든 해시 테이블을 비교하게 된다.
두 테이블의 데이터가 매칭되는지를 확인하여 결과를 출력한다.
(비유): 두 박스에서 꺼낸 데이터를 서로 비교하여 연결 가능한 것들만 짝짓는 작업이다.

✅ Hash Join with Temporary Tables (임시 테이블을 이용한 해시 조인)

해시 조인은 기존의 중첩 루프 조인(Nested Loop Join)이나 소트 머지 조인(Sort Merge Join)보다 효율적이지 않은 경우, 특히 인덱스가 없거나 대규모 데이터를 처리할 때 성능을 개선하기 위해 사용되는 조인방법이다.

1️⃣ 임시 테이블 생성 (Creating Temporary Tables)

DROP TABLE emp_temp;
DROP TABLE dept_temp;
DROP TABLE loc_temp;
DROP TABLE coun_temp;

CREATE TABLE emp_temp AS SELECT * FROM employees;
CREATE TABLE dept_temp AS SELECT * FROM departments;
CREATE TABLE loc_temp AS SELECT * FROM locations;
CREATE TABLE coun_temp AS SELECT * FROM countries;

실습을 위해 인덱스나 제약 조건 없이 테이블 생성을 하였다.
인덱스가 없으므로 Full Table Scan이 필요하며, 해시 함수를 적용하여 데이터 비교한다.
데이터만 가지고와서 똑같은 테이블을 만드는 것이고 인덱스나 다른 제약조건이 없는 상태이다.

2️⃣해시 조인 예제 1 (Hash Join Example 1)

Location ID 기준 조인 (Example 1: Join on Location ID)

SELECT * FROM dept_temp d, loc_temp l
WHERE d.location_id = l.location_id;

dept_temp와 loc_temp를 Full Scan한다.
location_id 기준으로 해시 테이블 생성한다.
해시 테이블을 사용해 조건에 맞는 데이터를 결합한다.

Full Table Scan: dept_temp (27개), loc_temp (23개).
Hash Join: 해시 테이블을 생성하고 매칭하였다.

3️⃣해시 조인 예제 2 (Hash Join Example 2 with Filter)

Location ID와 Country ID 조합 (Example 2: Join on Location ID and Filter)

SELECT l.location_id, l.city, l.state_province, c.country_id, c.country_name
FROM loc_temp l, coun_temp c
WHERE l.country_id = c.country_id
AND l.location_id <= 2000;

loc_temp에서 location_id <= 2000 조건으로 필터링한다.
coun_temp와 country_id를 기준으로 해시 테이블 생성한다.
필터링된 loc_temp와 해시 테이블을 매칭한다.

해시 조인을 통해 조건에 맞는 데이터만 결합하였다.

4️⃣해시 조인 예제 3 (Hash Join Example 3)

Department ID 기준 조인 (Example 3: Join on Department ID)

SELECT e.last_name, d.department_id, d.department_name
FROM dept_temp d, emp_temp e
WHERE d.department_id = e.department_id;

dept_temp(작은 테이블)를 Build Input으로 사용해 해시 테이블 생성한다.
emp_temp(큰 테이블)를 Probe Input으로 사용해 매칭한다.

Full Table Scan: dept_temp (27개), emp_temp (110개).
Hash Join: department_id 기준으로 해시 테이블 생성하였다.

✅해시 조인이 유용한 상황 (Scenarios Where Hash Join is Useful)

조인 컬럼에 적당한 인덱스가 없을 때
- 인덱스가 없으면 Nested Loop Join은 Inner 테이블의 모든 행을 반복적으로 스캔해야 하기 때문에 비효율 적이게 된다.
- 하지만 해시 조인을 사용하면 해시 테이블을 생성하고, 비교 범위를 줄여 효율적으로 데이터 검색이 가능해지게 된다.
인덱스가 있어도 Inner 테이블 액세스량이 많을 때:
- Inner 테이블에 대규모 데이터가 포함된 경우, 인덱스 접근만으로도 많은 비용이 발생하게 된다.
- 한번 스캔하는 것이 비용이 많이 드는 경우 해시 조인을 통해 한 번의 Full Table Scan 후 데이터를 그룹화(나누기)하여 해당하는 부분을 Nested Loop로 조인할 수 있다. 이로써 비용 절감이 가능해진다.
대용량 테이블 조인 시:
- 수행 빈도가 낮은 대용량 테이블에서 쿼리 시간이 오래 걸릴 때 해시 조인을 활용한다. 한 번의 테이블 스캔만으로 데이터를 해시 테이블에 저장하고 비교할 수 있게 된다.
스캔 비용이 높은 경우:
- 큰 테이블을 Full Scan해야 할 때, 데이터를 해시 함수로 그룹화하여 비교.
- 그룹화된 데이터는 Nested Loop Join을 통해 효율적으로 조합 가능.

✅ 해시 조인 사용 조건 (Conditions for Using Hash Join):

한쪽 테이블이 충분히 작아야 함:
- Build Input(해시 테이블 생성에 사용되는 테이블)이 해시 영역(PGA)에 들어갈 정도로 작아야 함.
- 작은 테이블에서 해시 테이블을 생성하여 비교 작업을 단순화.
Build Input의 해시 키에 중복 값이 적어야 함:
- 해시 키 컬럼에 중복 값이 많으면 해시 테이블의 효율성이 떨어질 수 있음.
- 중복 값이 많을 경우 Nested Loop Join이나 Sort Merge Join이 더 적합할 수 있음.

3️⃣Comparison of Join Methods (조인 방식 비교)

💡요약: 조인 방식은 데이터 크기, 인덱스 유무, 작업 범위에 따라 효율성이 달라진다. 중첩 루프 조인, 소트 머지 조인, 해시 조인의 특징과 적합한 상황을 비교한다.

✅ 중첩 루프 조인 (Nested Loop Join)

특징:

기본 조인 방법: 테이블의 모든 행을 다른 테이블의 모든 행과 비교한다. (카티션 프로덕트)
소량 데이터에 적합: 데이터가 적을수록 빠른 성능을 가진다.
인덱스 필수: 조인 컬럼에 인덱스가 필요하다.
순차적 접근: 테이블 접근 순서에 따라 성능이 달라지므로 테이블의 접근 순서가 중요하다.

장점: 소량 데이터 처리에 효율적이고 부분 범위 처리 가능한 점

단점: 대량 데이터에서 비효율적. 인덱스가 없으면 성능이 크게 저하된다.

✅소트 머지 조인 (Sort Merge Join)

특징:

대량 데이터 처리에 적합: 데이터를 정렬(Sort)한 뒤 병합(Merge)하는 방식이다.
인덱스 불필요: 정렬 작업이 임시 인덱스 역할을 한다.
전체 범위 처리: 모든 데이터를 처리해야 하는 작업에 적합하다.

장점: 대량 데이터 처리가 효율적이다. 정렬 작업 후 빠른 병합이 가능하다.

단점: 소트 부하가 발생한다. 즉 정렬 작업에 따른 추가 비용이 든다는 뜻이다. 데이터가 이미 정렬되어 있으면 불필요한 소트 작업 발생할 수 있다.

✅ 해시 조인 (Hash Join)

특징:

대량 데이터와 전체 범위 처리에 적합하다. 특히 작은 테이블과 큰 테이블 조인 시 유용하다.
해시 테이블 생성: 작은 테이블을 기반으로 해시 테이블 생성 후 큰 테이블과 매칭하는 방식이다.
메모리 사용: 해시 테이블 생성에 메모리 의존으로 메모리 사용량에 영향을 받는다.

장점: 대량 데이터 처리 효율적이다. 인덱스 없이도 빠른 조인 수행이 가능하다.

단점: 메모리 크기에 의존한다. 즉 작은 테이블(Build Input)이 PGA 메모리에 들어갈 정도로 작아야한다. 해시 함수에 의한 추가 작업 필요하다.

Core Elements of PL/SQL: Cursor, Stored Procedure, and Function

Heesu Noh — Wed, 04 Dec 2024 13:39:15 GMT

Contents

1️⃣커서(Cursor)
2️⃣저장 프로시저 (Stored Procedure)
3️⃣함수 (Function)

오늘은 함수와 프로시저에 대해 공부하는데 그전에 커서에 대해서 먼저 공부해본다. 커서는 데이터베이스에 존재하는 독특한 개념이다. 데이터베이스에서 결과 집합을 한 행씩 처리할 수 있도록 제공하는 특별한 도구라고 할 수 있다. 이를 통해 대량의 데이터를 제어하고 관리할 수 있게 된다.

그 후 함수와 프로시저를 학습할 때는 다음 포인트를 중점적으로 비교하며 접근하면 좋다:

함수는 반환값이 있고, 보통 특정 계산이나 값을 반환하기 위해 사용된다.
프로시저는 반환값이 없거나, OUT 매개변수를 사용하여 값을 반환하며, 주로 비즈니스 로직을 수행하는 데 적합하다.

1️⃣커서(Cursor)

커서(Cursor) - 데이터베이스 결과집합 처리

💡요약: Cursor는 데이터베이스의 SELECT 결과를 순차적으로 탐색하며 데이터를 처리할 수 있는 도구이다. SELECT문의 결과는 항상 Result Set(결과집합) 형태로 반환되며, 이를 다루기 위해 커서를 사용하게 된다. 커서를 통해 데이터를 한 행씩 처리하면서, 원하는 로직을 적용할 수 있게 된다.

✅ 결과집합 (Result Set)

Result Set: SELECT문을 실행하면 관계형 데이터 베이스를 다루고 있기 때문에 SELECT문 관계형 데이터 베이스인 테이블, 일부 부분 집합을 가져오게 된다. 혹은 두 개의 테이블을 합쳐 조인된 테이블을 가져오기도 한다. 그래서 SELECT문을 실행한 결과는 항상 집합의 형태로 나타나게 된다. employee테이블에서 조건을 가지고 데이터를 가져오게 되면 "결과집합"의 형태로 출력 된다.

즉 결과 집합은 SELECT문이 반환하는 데이터의 집합이다. 예를 들어, employee 테이블에서 조건을 이용해 데이터를 조회하면, 결과는 테이블 형태로 반환된다.

✅ 커서(Cursor)

우리가 지금까지 배운 내용은 결과 집합을 화면에 보여주는 것을 배웠다. 나타난 결과 집합을 출력만 하는 것이 아닌 각각의 데이터 결과를 다뤄 볼 수 있을까? 데이터를 다루려고 하면 그 결과를 어딘가에 store해야 한다. 프로그래밍 언어로 치면 변수 같은 곳에 저장을 해야 그 데이터를 다룰 수 있게 된다. select문의 조회 명령문을 통해 가져온 결과집합을 어딘가에 넣어두고 처리한다에서 출발한 개념이 "커서"이다. 즉 레코드 각각에 대한 개별적인 처리가 가능한 결과 집합의 확장이 된다.

결과를 저장한 뒤, 한 행(Row)씩 처리할 수 있다.

각 레코드에 개별적으로 접근하여 원하는 작업을 수행할 수 있도록 도와준다.
예)결과집합에서 SCOTT(7788) 행을 읽은 뒤 다음 행으로 이동.

✅ 커서의 역할 (Role of Cursor)

데이터를 순서대로 가져와 프로그래밍 언어에서 처리 가능하게 한다.
데이터를 화면에 단순히 출력하는 것이 아니라 조작, 계산, 조건 처리를 할 수 있다.
레코드 단위로 데이터를 다룰 수 있어 효율적이다.

커서(Cursor) #1: 데이터베이스에서 명시적/묵시적 커서의 이해

✅ 커서의 종류 (Types of Cursor)

명시적 커서 (Explicit Cursor): 우리가 일반적으로 커서라고 하면 명시적 커서를 일컫는다. "이러한 커서 A를 선언하였습니다. 그리고 여기에 SELECT문의 결과를 가지고 오세요" 이렇게 사용자가 변수를 정리하듯이 커서를 선언한다. 그리고 가져온 결과에 대해 각각의 레코드에 대해 하나씩 데이터를 가져와서 처리한다. 명시적으로 선언하고 사용하는 것이다.
- 사용자가 직접 선언하고 관리하는 커서로, SELECT문의 결과를 다루기 위해 선언부터 처리까지 모든 과정을 사용자가 제어한다.

예시: CURSOR emp_csr IS SELECT * FROM employees;

묵시적 커서 (Implicit Cursor): 반면에 묵지적 커서는 사용자에게는 보이진 않는다. 오라클이 알아서 최근에 select문한 쿼리 결과을 임시적으로 가지고있는 내부적인 커서이다. ‘SQL’ 이라는 이름으로 속성에 접근할 수 있다. 맨 마지막에 실행된 결과값, 즉 항상 최근에 실행된 SQL 문장에 대한 커서를 가지고 있다고 생각하면 되겠다.
- 오라클이 내부적으로 자동 생성하는 커서로, SELECT문 또는 DML(INSERT, UPDATE, DELETE) 실행 시 최근 실행된 결과를 임시적으로 저장한다.

속성 접근: SQL%ROWCOUNT, SQL%NOTFOUND 등을 사용해 상태를 확인할 수 있다.

커서(Cursor) #2: 명시적 커서 (Explicit Cursor)

✅명시적 커서 처리 순서 (Explicit Cursor Workflow)

커서 선언 (DECLARE): 커서는 당연히 사용하기 전에 먼저 선언되어야 한다. 커서 선언은 변수 선언과 마찬가지로 선언된 커서는 한 개의 이름이 할당되고 SELECT 문과 연결된다.
- 예시: CURSOR emp_csr IS SELECT employee_id FROM employees;

커서 열기 (OPEN): 커서를 "연다"라는 뜻은 앞에서 커서로 정의된 쿼리문을 실행시키는 것을 뜻한다. 테이블에 있는 데이터를 커서로 가져오는 명령어이다. 실행 시킨후 해당 커서로 결과 집합을 가져온다. 라고 이해하면 되겠다.

예시: OPEN emp_csr;

패치 (FETCH): 결과 집합이 커서라는 변수에 들어있는데 쿼리의 결과에 접근하여 그 데이터를 하나씩 가져오는 것이다. "하나씩"가져오는 것이 중요하다.
한번 패치 후 그 다음 패치 가져오고, 다시 패치 후 그 다음 패치 가져온다. 결과 집합의 5개의 레코드가 있 으면 5번 패치가 가능하다. 결과집합에서 한 행씩 데이터를 가져와 변수에 저장한다.

예시: FETCH emp_csr INTO emp_id;

커서 닫기 (CLOSE): 패치 후 결과 집합이 empty 가 된다면 커서를 닫고 자원을 반환하게 된다. 결과 집합을 갖고서 첫번째 레코드부터 마지막 레코드까지 하나씩 패치하여 데이터를 하나씩 가져오게된다. 이것이 명시적 커서의 처리방법이다. 커서를 닫고 자원을 해제한다.

예시: CLOSE emp_csr;

✅ 다시 한번 정리 하자면 selelct문의 결과는 결과를 그냥 보여줄 뿐이다. 그런데 이 결과집합의 각각의 레코드에 대해 무언가 해보고싶다면 커서를 이용하게 된다.

✅ 명시적 커서 예제 (Explicit Cursor Example)

DECLARE -- 커서 선언(SELECT문과 연결)
   CURSOR emp_csr IS
      SELECT employee_id FROM employees
      WHERE department_id = 100;

   emp_id employees.employee_id%TYPE;

BEGIN
   OPEN emp_csr; -- 커서 열기 (커서로 정의된 쿼리 실행)

   LOOP
      FETCH emp_csr INTO emp_id; --패치, 현재 데이터 행을 한 행씩 OUTPUT변수에반환
      EXIT WHEN emp_csr%NOTFOUND;
      DBMS_OUTPUT.PUT_LINE(emp_id);
   END LOOP;

   CLOSE emp_csr; -- 커서닫기, 커서 사용을 마치고 자원을 반납
END;

커서 선언: CURSOR emp_csr IS SELECT employee_id FROM employees WHERE department_id = 100;
- department_id가 100인 직원들의 employee_id를 가져오는 커서를 선언한다.
커서 열기: OPEN emp_csr;
- 커서가 정의되었으니 이제 오픈을 해야한다. 이 오픈 명령어는 커서의 질의문을 수행하고 그 결과집합을 커서에 저장하게 된다. 그 다음 LOOP를 작동한다.
데이터 처리: FETCH emp_csr INTO emp_id;
- 결과집합에서 한 행씩 데이터를 emp_id 변수에 저장한다. 또한 항상 맨 위에 있는 레코드 부터 접근하게 된다.
- 더이상 데이터가 없을때까지(%NOTFOUND) emp_id에 store하게 된다.
커서 닫기: CLOSE emp_csr;
- 사용이 끝난 커서를 닫고 자원 반환하게 된다.
실행 결과: 110, 109, 108... 등의 숫자가 출력되며, 이는 department_id가 100인 직원들의 employee_id이다.

✅명시적 커서의 FOR .. LOOP문 사용 (FOR Loop with Cursor)

FOR 루프를 사용하면 OPEN, FETCH, CLOSE 과정을 자동으로 처리하게 된다.

DECLARE
   CURSOR emp_csr IS
      SELECT employee_id FROM employees
      WHERE department_id = 100;

BEGIN
   FOR item IN emp_csr LOOP
      DBMS_OUTPUT.PUT_LINE(item.employee_id);
   END LOOP;
END;

FOR 루프는 명시적 커서를 간결하게 작성할 수 있는 방법이다.
item 변수를 통해 레코드를 자동으로 처리하며, 별도의 OPEN, FETCH, CLOSE가 필요 없게된다.
커서는 for문/ loop문 하고 같이 사용하는 경우가 굉장히 많다. 어떤 조건을 줄 때에는 loop가 많이 사용되고 조건 없이 커서의 모든 레코드를 탐색 할때는 for문을 사용하는 것이 유리하다.

커서(Cursor) #3: 묵시적 커서 (Implicit Cursor) - 오라클 내부의 자동 커서 처리

💡요약: Implicit Cursor (묵시적 커서)는 오라클 데이터베이스에서 자동으로 생성되며, 최근 실행된 SQL 문장(특히 DML 문장: INSERT, UPDATE, DELETE)에 대한 정보를 저장한다. 이는 내부적으로 사용되며, 명시적으로 선언하지 않아도 자동으로 생성된다.

✅ 묵시적 커서의 특징 (Key Features)

묵시적 커서와 사용자의 접점
- 묵시적 커서는 오라클 내부에서 자동으로 처리되며, 사용자가 직접 접하는 경우는 많지 않다.
- 주로 간단한 상태 확인(예: SQL 실행 성공 여부)이나 영향을 받은 행의 수를 확인하는 데 사용된.

자동 생성 (Automatically Generated)
- 묵시적 커서는 사용자가 선언하지 않아도 SQL 실행 시 자동으로 생성됩니다.
- 명시적 커서와 달리 프로그래밍적으로 선언, 열기, 닫기 과정이 필요하지 않습니다.
DML 문과 주로 사용 (Used with DML Statements)
- INSERT, UPDATE, DELETE 문 실행 시 생성되며, 쿼리 결과를 임시 저장합니다.
최근 SQL 문 정보 저장 (Tracks Last SQL Statement)
- 항상 최근 실행된 SQL 문에 대한 정보를 속성(SQL%ROWCOUNT, SQL%FOUND, SQL%NOTFOUND 등)을 통해 접근할 수 있습니다.
간단한 작업 처리에 유용 (Simple Operations)
- 묵시적 커서는 데이터를 조회하거나 변경한 결과를 빠르게 확인하기에 적합합니다.

✅ 묵시적 커서 속성 (Implicit Cursor Attributes)

SQL%ROWCOUNT: 영향받은 행의 수 (Number of Rows Affected)
- 최근 실행된 SQL 문(INSERT, UPDATE, DELETE 등)에 의해 영향을 받은 행의 개수를 나타냅니다.
- 다른 SQL 문을 실행하면 값이 업데이트됩니다.

DBMS_OUTPUT.PUT_LINE('Rows affected: ' || SQL%ROWCOUNT);

SQL%FOUND: SQL 실행 성공 여부 (SQL Execution Success)
- 최근 SQL 문에 영향을 받은 행이 1개 이상일 경우 TRUE를 반환한다.

IF SQL%FOUND THEN
   DBMS_OUTPUT.PUT_LINE('Rows affected.');
END IF;

SQL%NOTFOUND: SQL 실행 실패 여부 (SQL Execution Failure)
- SQL%FOUND와 반대의 개념이다. 최근 SQL 문에 영향을 받은 행이 없을 경우 TRUE를 반환한다.

IF SQL%NOTFOUND THEN
   DBMS_OUTPUT.PUT_LINE('No rows affected.');
END IF;

SQL%ISOPEN: 커서 열림 상태 (Cursor Open Status)
- 묵시적 커서는 자동으로 닫히기 때문에 항상 FALSE를 반환하게 된다.

IF SQL%ISOPEN THEN
   DBMS_OUTPUT.PUT_LINE('Cursor is open.');
END IF;

✅ 묵시적 커서 예제 (Implicit Cursor Example): 묵시적 커서를 사용자가 접하게 되는 경우는 많이 없기 때문에 간단하게 살펴보고 넘어가도록 한다.

1. 테이블 초기화 및 생성

DROP TABLE employees_temp;
CREATE TABLE employees_temp AS
SELECT * FROM employees;

employees_temp라는 임시 테이블이 이미 존재할 수 있으므로 삭제(DROP TABLE) 후 새로 정의하게 된다.
CREATE TABLE 문을 사용해 employees 테이블의 데이터를 복사한다.

2. DELETE문과 SQL%ROWCOUNT 사용

DECLARE
   mgr_no NUMBER(6) := 122;
BEGIN
   DELETE FROM employees_temp WHERE manager_id = mgr_no;

   DBMS_OUTPUT.PUT_LINE(
      'Number of employees deleted: ' || TO_CHAR(SQL%ROWCOUNT)
   );
END;

DECLARE후 mgr_no 를 변수로 선언한뒤 이 변수에 122를 할당한다.
DELETE 문 수행한다. 이를 통해 employees_temp 테이블에서 manager_id가 122인 레코드를 삭제하게 된다. 6명이면 6명, 9명이면 9명이 삭제가 될 것이다.
이럴때 방금 수행한 명령문인 DELECT에 SQL%ROWCOUNT 커서가 잠시 열려 값을 갖고 있는데 SQL%ROWCOUNT의 역할은 방금 수행한 질의문의 영향을 받은 로우의 수를 출력해주는 것이다. 이를 사용해 영향을 받은 행의 개수를 출력하게 된다.

3. 실행 결과: 첨부된 세 번째 이미지를 참고하면,

  Number of employees deleted: 8
  PL/SQL 프로시저가 성공적으로 완료되었습니다.

manager_id = 122인 레코드 8개가 삭제되었다.
이때 이 (SQL%ROWCOUNT) sql 묵시적 커서를 지칭하는 역할을 한다.

💡PL/SQL 서브프로그램(Subprogram)

서브프로그램(Subprogram)은 메인 프로그램의 작업을 분리하고 반복 작업을 모듈화한 프로그램 블록이다.

PL/SQL에서는 저장 프로시저(Stored Procedure), 함수(Function), 패키지(Package), 트리거(Trigger)를 서브프로그램(Subprogram)이라고 부른다. 이들은 각각 특정 목적에 맞게 사용된다.
서브프로그램은 메인 프로그램에서 자주 사용되는 기능이나 반복적인 작업을 모듈화하여 정의한 후, 필요할 때 호출하여 사용할 수 있는 프로그램 블록을 뜻한다.
메인 프로그램의 작업을 분리하여 효율성을 높이고 유지보수를 용이하게 하는 장점이 있다.
데이터베이스 내부 구조를 숨기고, 서브프로그램만 노출하여 데이터 보호할 수도 있다.

💡저장 프로시저와 함수의 사용 차이

프로그래밍 언어에서는 주로 계산이나 값 반환을 위한 함수(Function)를 많이 사용한다.
데이터베이스 프로그래밍에서는 데이터 조작이나 상태 변경 작업이 많아 저장 프로시저(Stored Procedure)가 더 자주 사용된다.

2️⃣저장 프로시저 (Stored Procedure)

💡요약: Stored Procedure는 데이터베이스 내에서 반복적으로 수행되는 작업을 효율적으로 처리하기 위해 사용된다. 재사용성과 성능 향상이 주요 장점으로, 잘 튜닝된 SQL을 미리 컴파일해 저장하고 반복적으로 호출할 수 있는 장점이 있다. DML 작업(INSERT, UPDATE, DELETE)뿐만 아니라, 조건부 로직과 파라미터를 활용한 복잡한 작업도 처리 가능하다.

✅ 저장 프로시저의 정의 (Stored Procedure)

데이터베이스 내에서 사전에 작성된 PL/SQL 문장의 집합으로, 특정 작업을 수행하기 위해 데이터베이스 서버에 저장된 프로그램 블록이다.
주로 데이터베이스의 상태 변경 작업(INSERT, UPDATE, DELETE)에 사용된다.
"일련의 PL/SQL 문장을 사전에 작성하고 컴파일한 다음 실행할 수 있는 상태로 만들어 데이터베이스 서버에 저장해 놓은 프로시저."

✅저장 프로시저의 활용

DML 문장 작업의 효율화
- 테이블의 레코드를 삽입(INSERT), 수정(UPDATE), 삭제(DELETE)할 때 사용.
- 예를 들어, 특정 조건에 따라 급여를 수정하거나 특정 데이터 그룹을 삭제하는 작업을 자동화할 수 있다.
파라미터를 이용한 동적 데이터 처리
- 저장 프로시저는 입력 파라미터를 통해 데이터를 동적으로 처리하고, 필요 시 결과값을 반환할 수 있다.
- 호출 시 파라미터만 전달하면, 프로시저 내부에서 해당 데이터를 사용해 작업을 수행할 수 있다.

✅ 저장 프로시저를 사용하는 이유와 장점

1. 재사용성

SQL 문장을 매번 작성할 필요 없이, 자주 사용하는 DML 문장을 저장 프로시저로 정의하여 재사용 가능하기 때문이다.
잘 작성된 프로시저는 실수를 줄이고 다음에 또 만들지 않아도 되기 때문에 유지보수를 용이하게 한다.

2. 성능 최적화

미리 컴파일된 상태로 저장: 저장 프로시저는 작성 후 컴파일되어 데이터베이스에 저장되므로, 실행 시 실행 계획을 다시 계산하지 않아도 된다. 실행 계획이 변경되지 않으므로, 매번 새로 컴파일할 필요 없이 바로 실행 가능하다는 뜻이다.
성능 향상: 질의문 실행 시 매번 새로 작성하는 것보다, 미리 튜닝된 저장 프로시저를 사용하면 성능이 높아지게 된다.

3. 유지보수성

중앙 관리 가능: 저장 프로시저를 데이터베이스에 저장해 놓으면, 로직이 한 곳에 통합되므로 유지보수가 용이해진다. 변경이 필요한 경우, 저장 프로시저를 수정하면 된다.

4. 보안 강화

사용자에게 저장 프로시저만 노출하여 데이터베이스 구조와 로직을 숨길 수 있다.
개발자들이 테이블에 직접 접근하지 않고 저장 프로시저를 통해 작업을 수행하도록 제한하는 것이다.

5. 파라미터 활용

저장 프로시저는 입력(IN), 출력(OUT), 입출력(IN OUT) 파라미터를 지원하여, 유연한 데이터 처리가 가능하다.

✅ 저장 프로시저와 함수의 차이

저장 프로시저(Stored Procedure) #1: 구조와 실행

💡요약: 저장 프로시저는 데이터베이스 상태를 변경하거나 반복 작업을 효율적으로 처리하기 위해 사용되는 서브프로그램이다. 첨부된 이미지들을 참고하여 저장 프로시저의 구조와 실행에 대해 자세히 살펴보자

✅ Database Applications와 Stored Procedure의 관계

왼쪽의 Database Applications:
- 데이터베이스 프로그램 코드들이 전시되어 있다. 이 중에서 stored procedure를 호출하게 된다.
- hire_employees(...): 이 함수는 직원 고용 정보를 employee 테이블에 한명의 개인을 삽입하는 저장 프로시저로 보인다
- 예를 들어, 이것을 매번쓰는 것이 아닌 새로운 직원 정보를 추가할 때마다 이 저장 프로시저를 호출하여 동일한 코드를 반복 작성하지 않아도 된다. stored procedure begin 과 end사이에 들어가게 된다.

Stored Procedure의 주요 특징:
- BEGIN과 END 사이에 데이터베이스 상태를 변경하는 명령문들이 포함된다
- 프로그램 코드가 저장 프로시저를 호출하여 작업을 위임한다.
- 함수하고 비슷하긴 하지만 함수는 주로 데이터베이스의 상태를 변경하는 것에 사용하진 않는다. 함수는 주로 데이터 베이스에서 값을 가지고와서 총 합 계산, 평균 계산, 가장 큰 값 가져오기 등 데이터베이스에서 어떤 값을 가져와서 필요한 값을 return 할때 함수를 많이 쓰는 반면, 저장프로시져는 데이터베이스의 상태를 변경, 위임하는데 사용한다.
- 위에서 언급된 저장 프로시저의 장점 중 “보안성” 설명을 추가하자면 stored procedure의 권한은 dba에게 있다면 나머지 개발자들은 database applications만 불러서 사용하게 된다. 그렇게 되면 실제 데이터구조가 어떤지, 어떤 데이터를 가지고있는지 개발자들은 접근할 필요가 없게 된다.

✅저장 프로시저 정의 및 실행 (Defining and Executing a Stored Procedure)

문법 같은 경우는 좋은 예제를 가지고 조금씩 수정하면서 사용하는 것이 제일 좋다. 오라클의 가장 좋은 예제는 오라클 사이트에서 찾아볼 수 있다. 그런 코드를 참고해서 필요한 코드를 작성하면 되겠다.

1. 저장 프로시저 정의

CREATE PROCEDURE emp_register
IS
BEGIN
   INSERT INTO employees (employee_id, first_name, last_name, email, hire_date, job_id)
   VALUES (EMPLOYEES_SEQ.NEXTVAL, 'HONGSEOK', 'NA', 'hsna99@cuk.edu', SYSDATE, 'IT_PROG');
END;

CREATE PROCEDURE: emp_register라는 이름의 저장 프로시저를 정의한다.
BEGIN ... END: 데이터베이스 상태를 변경하는 명령문을 포함한다.
- INSERT INTO employees:
  - 새로운 직원 정보를 employees 테이블에 삽입한다.
  - EMPLOYEES_SEQ.NEXTVAL: 시퀀스의 가장 최근값을 가져오란 뜻이다. employee_id의 다음 값을 생성.
  - first_name, last_name, email, hire_date, job_id이 값들을 INSERT INTO 에 넣는다.
- 이 저장 프로시저는 단순한 예제로, 실제로는 동적 데이터를 처리하도록 파라미터를 추가해 활용할 수 있다.

2. 저장 프로시저 실행

저장 프로시저를 호출하여 실행한다.
```
  EXEC emp_register;
```

3. 결과 확인

이미지를 보면, emp_register를 실행한 후 데이터가 employees 테이블에 삽입되었음을 확인할 수 있다.

💡저장 프로시저 정리:

데이터베이스 상태를 변경하는 작업(INSERT, UPDATE, DELETE)을 효율적으로 처리하기 위해 사용된다.

프로그램 코드에서 저장 프로시저를 호출하여 반복적인 작업을 간소화할 수 있다.
보안 및 성능 이점이 있으며, 개발자는 저장 프로시저를 호출하여 데이터베이스 작업을 위임받아 수행한다.
예제에서 emp_register는 직원 정보를 employees 테이블에 삽입하는 간단한 저장 프로시저를 보여준다.

저장 프로시저(Stored Procedure) #2: IN 파라미터 활용

💡요약: 저장 프로시저를 사용할 때 파라미터(IN, OUT, IN OUT)를 활용하여 데이터베이스 작업을 동적으로 수행할 수 있다. 이번 예제에서는 IN 파라미터를 통해 직원 정보를 삽입하는 방법을 살펴본다.

1. 저장 프로시저 정의

CREATE PROCEDURE emp_register1
   (f_name VARCHAR2, l_name VARCHAR2, e_mail VARCHAR2, j_id VARCHAR2)
IS
BEGIN
   INSERT INTO employees (employee_id, first_name, last_name, email, hire_date, job_id)
   VALUES (EMPLOYEES_SEQ.NEXTVAL, f_name, l_name, e_mail, SYSDATE, j_id);
   COMMIT;
END;

파라미터 정의:
- 저장 프로시저를 사용할 때는 주로 파라미터를 사용하여 데이터를 준다.
- employee 한명을 삽입 하려고 할땐 employee 정보를 줘야 하는데 이것을 IN 파라미터로 주게된다. 한국어로는 매개변수이다.

f_name, l_name, e_mail, j_id와 같은 입력값을 파라미터로 받아 테이블에 데이터를 삽입한다. 즉 emp_register1을 호출할 때 이 파라미터 값들을 주겠다는 뜻이다.
각 파라미터는 VARCHAR2 타입으로 정의되었다.

INSERT INTO:
- employee_id는 시퀀스 EMPLOYEES_SEQ.NEXTVAL을 사용해 자동으로 생성된다.
- 나머지 필드는 파라미터로 전달받은 값을 삽입한다.
COMMIT:
- 데이터 삽입 후 변경 사항을 저장한다.

2. 저장 프로시저 컴파일 및 실행

emp_register1 저장 프로시저를 생성하고 컴파일하였다.
컴파일 성공 메시지가 표시되며, 예제에서 빨간색 점선 박스처럼 프로시저가 완성되었음을 확인할 수 있다. 데이터베이스에서 해당 프로시저를 사용할 준비가 완료되었다는 뜻이다.

3.프로시저 호출 및 실행

EXEC를 사용해 저장 프로시저를 호출하였다.
파라미터 값으로 first_name = 'gildong', last_name = 'hong', email = 'hdg@gmail.com', job_id = 'IT_PROG'을 전달한다.
실행 결과: 입력한 데이터를 기반으로 새로운 레코드가 employees 테이블에 삽입 되었다. (빨간 점선 박스참조)

저장 프로시저(Stored Procedure) #3: OUT 파라미터 활용

💡요약: OUT 파라미터도 존재하는데 주로 IN 파라미터를 많이 쓰게 된다. OUT 파라미터는 저장 프로시저가 수행된 후 호출한 프로그램이 결과 값을 받을 수 있도록 값을 반환하는 역할을 한다. 이는 함수의 반환값과 비슷하지만, 저장 프로시저는 리턴값이 없으므로 OUT 파라미터를 사용하여 결과를 전달하게 된다.

✅ OUT 파라미터의 특징

결과값 반환: OUT 파라미터를 통해 저장 프로시저가 완료된 후 호출한 프로그램이 결과값을 받을 수 있다.
리턴값 대신 사용: 저장 프로시저 자체는 리턴값이 없으므로, OUT 파라미터를 사용하여 값을 반환할 수 있다.
함수와의 차이: 함수는 단일 값을 반환하지만, 저장 프로시저는 여러 OUT 파라미터를 사용해 여러 값을 반환할 수 있다.

✅OUT 파라미터를 사용하는 저장 프로시저의 구조

OUT 파라미터
- emp_id OUT NUMBER: 호출한 프로그램에 employee_id 값을 반환한다.
- OUT 키워드는 프로시저가 값을 반환할 수 있도록 지정한다.
INSERT INTO
- 새로운 직원 정보를 employees 테이블에 삽입.
- emp_id는 EMPLOYEES_SEQ.NEXTVAL을 사용하여 자동 생성된 값이다.

✅실행화면

저장 프로시저를 컴파일하면 성공 메시지가 표시된다.
호출 시 OUT 파라미터로 emp_id 값을 확인할 수 있다.
- 예: 209 값이 반환되었음을 확인.

저장 프로시저(Stored Procedure) #4: DROP

불필요해진 저장 프로시저는 DROP PROCEDURE를 사용해 삭제할 수 있다.

3️⃣함수 (Function)

PL/SQL에서 함수(Function)의 특징과 저장 프로시저와의 차이점

함수와 저장 프로시저는 서로 다른 목적에 맞게 사용되며, 이들의 차이점을 이해하면 더 적합한 데이터베이스 작업을 설계할 수 있다.

✅ 함수(Function)의 정의와 특징

함수(Function)는 데이터베이스에서 계산 작업을 수행하고 결과를 반환(Return)하는 목적으로 사용된다.

PL/SQL뿐만 아니라 거의 모든 DBMS에서 함수를 지원한다.
주요 목적:
- 함수는 return값이 제일 중요하다. 항상 존재하기 때문이다. 어떤 계산을 하고 계산된 결과를 return하게 된다.
- 함수(function)는 계산을 수행하여 호출한 애플리케이션에 반환하거나 결과집합에 통합해 넣을 목적으로 사용한다.
- DBMS는 문자열 함수, 수학 함수, 집계 함수 등 많은 편리한 함수를 제공하고 사용자가 직접 함수를 정의할 수 있다.
- 데이터베이스 상태를 변경하지 않고 계산 중심의 작업 수행 호출한 애플리케이션에 반환하거나 결과집합에 통합해 넣을 목적으로 사용한다.

✅저장 프로시저(Stored Procedure)와의 차이점

함수는 항상 값을 반환한다. (RETURN).
- SELECT 문에 포함되어 호출된다.
- 계산 용도로 사용하며 데이터베이스 상태를 변경하지 못한다.
- 함수는 복잡한 계산이 필요한 경우나 급여의 합계 계산, 평균 또는 최대값과 같은 데이터를 조회하여 특정 결과를 도출해야할때 사용하기 적합하다.
저장 프로시저는 데이터베이스 상태를 변경하는 작업(INSERT, UPDATE, DELETE 등)을 수행다.
- EXEC 또는 CALL로 호출되며, RETURN 값이 없다.
- 저장 프로시저는 데이터베이스 상태를 변경해야 할때나 새로운 직원 정보 삽입 혹은 오래된 데이터 삭제와 같은 INSERT, UPDATE, DELECTE 와 같은 DML 작업을 수행할 때 사용하기 적합하다.

함수(Function) #1: PL/SQL 함수의 구조와 실행

✅ 함수의 구문 형식 (Function Syntax)

CREATE OR REPLACE FUNCTION 함수명(파라미터1 데이터타입, ...)  
RETURN 데이터타입  
IS [AS]  
   변수 선언부 ...;  
BEGIN  
   프로시저 본문 ...;  
   RETURN 변수;  
EXCEPTION  
   예외처리 ...;  
END;

CREATE OR REPLACE:
- CREATE만 써도 상관없지만
- 보통은 CREATE OR REPLACE로 혹시 함수가 이미 존재할 경우 새 함수로 대체(Replace)하게 된다.
- 기존 함수와 중복될 우려가 있는 경우 보통 이 구문을 사용한다.
RETURN 데이터타입: 중요💡
- 함수는 반드시 하나의 값을 반환해야 하며, 반환값의 데이터타입을 지정한다.
- 예: RETURN NUMBER는 숫자 값을 반환.
BEGIN ... END:
- 함수는 BEGIN 과 END사이에서 정의 된다.
- RETURN 문을 통해 반환값을 명시적으로 지정해야 한다가 함수의 가장 큰 특징이다.
EXCEPTION:
- 함수 실행 중 발생할 수 있는 예외를 처리하는 블록이다.

✅ 함수 예제: 직원 급여 반환

CREATE FUNCTION emp_salaries (emp_id NUMBER)
RETURN NUMBER IS
   nSalaries NUMBER(9);
BEGIN
   nSalaries := 0;
   SELECT salary INTO nSalaries FROM employees
   WHERE employee_id = emp_id;
   RETURN nSalaries;
END;

파라미터:
- emp_id NUMBER: 함수가 입력받는 매개변수.
- 직원 ID를 기준으로 급여(salary)를 조회.
반환값:
- RETURN NUMBER: 숫자 데이터타입을 반환.
로직:
1. nSalaries 변수를 초기화(nSalaries := 0;).
2. SELECT salary INTO nSalaries로 해당 직원의 급여를 조회하여 변수에 저장.
3. RETURN nSalaries를 통해 조회된 급여를 반환.

보안성 강화:

SQL문 으로도 할수 있는데 굳이 함수로 하는 이유이다.
테이블 구조를 노출하지 않고 함수만 노출함으로써 필요한 데이터만 반환하므로 보안성이 강화된다.
권한이 제한된 개발자는 함수만 호출할 수 있어 테이블에 직접 접근하지 못한다.

함수 정의 및 호출

✅ 함수 정의 후 객체 생성

함수 정의가 완료되면 데이터베이스에 함수 객체로 저장된다.
예: emp_salaries 함수가 객체로 생성되었음을 확인 가능하다.

✅ 함수 호출

SELECT 문을 통해 호출:
- 함수는 반환값이 있으므로 SELECT 문에 포함되어 호출된다.
- 예제에서는 직원 ID가 100인 직원의 급여를 조회.
결과: EMP_SALARIES(100): 24000(직원의 급여)이 반환되었다.

💡정리:

PL/SQL 함수는 특정 계산 작업을 수행하고 하나의 값을 반환한다.
함수는 SELECT 문에서 호출되며, 반환값을 통해 다른 SQL 작업에 활용될 수 있다.
이번 예제의 emp_salaries 함수는 직원 ID를 기반으로 급여를 반환하며, 데이터베이스 보안성을 강화할 수 있다.

함수(Function) #3: 함수의 또다른 예제: 부서 이름 변환

✅부서 이름을 반환하는 함수

CREATE FUNCTION get_dep_name (dept_id NUMBER)
RETURN VARCHAR2 IS
   sDeptname VARCHAR2(30);
BEGIN
   SELECT department_name INTO sDeptname FROM departments
   WHERE department_id = dept_id;
   RETURN sDeptname;
END;

입력 파라미터:
- dept_id NUMBER: 부서 번호를 입력받는 매개변수.
- 이 값을 기준으로 부서 이름(department_name)을 조회한다.
반환 타입:
- RETURN VARCHAR2: 함수는 문자열 타입을 반환.
로직:
- sDeptname VARCHAR2(30) 변수 선언: 부서 이름을 저장할 문자열 변수.
- SELECT department_name INTO sDeptname:
  - departments 테이블에서 department_id와 일치하는 행의 department_name을 가져온다
- RETURN sDeptname: 조회된 부서 이름을 반환한다.

✅함수 호출

SELECT get_dep_name(100) FROM dual;

호출 방법: SELECT 문을 사용해 get_dep_name 함수를 호출한다. 입력값 100을 전달하여 부서 이름을 조회한다.

✅실행 결과

입력값: dept_id = 100
출력값: Finance (부서 이름)
함수는 부서 번호 100에 해당하는 부서 이름을 반환하였다.

함수(Function) #4: 함수와 프로시저를 사용하는 장점

✅ 보안성 (Security)

데이터에 대한 직접 접근 제한:
- 데이터베이스에 직접 접근하는 대신, 프로시저와 함수를 통해 접근하도록 제한할 수 있다. 이를 통해 데이터베이스의 내부 구조를 숨기고 보안을 강화한다.
- 예: 권한이 제한된 사용자/개발자는 테이블에 직접 접근하지 않고, 제공된 함수나 프로시저만 호출 가능하다.

✅ 성능 개선 (Performance Improvement)

미리 컴파일된 상태로 저장:
- 함수와 프로시저는 미리 컴파일되어 SGA(Shared Global Area)의 공유 풀에 저장된다.
- 이로 인해 반복적인 실행 시 바로 호출 가능하며, 실행 계획을 다시 계산하지 않아도 된다.
공유 자원의 활용:
- 여러 사용자가 동일한 함수나 프로시저를 공유해 사용함으로써 성능을 최적화한다.

✅ 재사용성 (Reusability)

코드 모듈화:
- 자주 사용되거나 오류가 발생해서는 안 되는 코드를 미리 함수나 프로시저로 정의하여 모듈화한다.
- 필요할 때 호출만 하면 되므로 코드의 재사용성이 높아지고 유지보수가 용이하다.
가독성 증가:
- 복잡한 로직을 간단하게 함수나 프로시저로 캡슐화하여 프로그램의 가독성을 향상시킨다.

✅ 데이터 무결성 보장 (Integrity)

오류 발생 가능성 감소:
- 직접 SQL 문을 작성하여 사용하는 대신, 프로시저와 함수에 로직을 구현하면 통합된 로직으로 오류 발생 가능성을 줄일 수 있다.
- 개발자마다 SQL 문을 개별적으로 작성하다 보면 발생할 수 있는 불일치나 오류가 발생할 수 있다. 혹은 감지를 처음엔 못하지만 나중에 발생 할 수도 있다.
업무 로직에 따른 일관성 유지:
- 프로시저와 함수는 업무 로직에 맞추어 프로그래밍되므로 데이터 무결성을 보장할 수 있다.

Managing ORACLE database - PL/SQL

Heesu Noh — Sun, 01 Dec 2024 16:10:38 GMT

Contents

1️⃣PL/SQL
2️⃣PL/SQL 구성요소 (PL/SQL Components)
3️⃣PL/SQL 제어문(Control Structures)

PL/SQL Summary

PL/SQL (Procedural Language/Structured Query Language)은 SQL에 절차적 프로그래밍 언어의 기능을 확장한 언어로, SQL의 데이터 처리 능력과 일반 프로그래밍 언어의 제어 구조를 결합한 것이다.

✅ 주요 특징 (Key Features)

SQL과 프로그래밍 언어의 결합:
PL/SQL은 SQL의 데이터 처리 기능과 일반 프로그래밍 언어의 제어 흐름을 결합하여 데이터베이스 작업을 더 효율적으로 처리할 수 있도록 한다.
변수 및 상수 선언:
변수와 상수를 선언하여 프로그램 내에서 값을 저장하고 변경할 수 있다.
조건문 (Control Statements):
- IF문: 조건에 따라 다른 동작을 수행할 수 있다.
- CASE문: 여러 조건을 한 번에 처리할 수 있으며, WHEN을 사용하여 조건을 지정한다.
반복문 (Loops):
- FOR문: 반복 횟수가 정해져 있을 때 사용하며, 범위 내에서 반복된다.
- LOOP문: 반복 조건을 명시적으로 설정할 수 있으며, 조건을 만족할 때까지 반복된다.
- WHILE문: 주어진 조건이 true일 때만 반복하며, 조건이 false로 변하면 종료된다.

✅ PL/SQL 구성 요소 (PL/SQL Components)

변수와 상수 (Variables & Constants):
PL/SQL에서는 변수와 상수를 선언하여 데이터를 저장하고 처리할 수 있다. 상수는 값이 변경되지 않는 값을 저장하는 데 사용된다.
콜렉션 (Collections):
여러 값을 한 번에 저장할 수 있는 배열, 리스트 형태의 자료 구조이다.
레코드 (Records):
여러 필드로 구성된 구조체와 같은 자료형으로, 관련된 데이터를 함께 저장할 수 있다. 레코드는 중첩되어 사용할 수 있게 된다.

✅ PL/SQL 제어문 (Control Structures in PL/SQL)

조건문 (Conditional Statements):
- IF문: 조건에 따라 다른 처리를 진행하는 구문이다.
- CASE문: 여러 조건을 처리할 수 있는 구문으로, 여러 가지 경우의 수를 쉽게 다룰 수 있게 된다.
반복문 (Loops):
- FOR문: 반복 횟수가 정해져 있을 때 사용하며, 특정 범위 내에서 순차적으로 실행된다.
- LOOP문: 종료 조건을 명시적으로 설정하여 조건이 맞을 때까지 반복을 실행한다.
- WHILE문: 주어진 조건을 만족할 때만 반복하며, 조건이 false가 되면 종료된다.

정리 (Summary)

PL/SQL은 SQL의 데이터 처리 능력과 프로그래밍 언어의 흐름 제어 기능을 결합하여 강력한 데이터베이스 처리 시스템을 구축할 수 있게 해준다. 조건문과 반복문을 활용하여 복잡한 논리 처리를 가능하게 하고, 변수, 상수, 콜렉션, 레코드를 통해 데이터를 효율적으로 관리할 수 있게된다.

1️⃣PL/SQL

💡요약: PL/SQL (Procedural Language/Structured Query Language)은 SQL의 한계를 보완하여, 데이터베이스 내에서 논리적 흐름과 절차적 제어를 가능하게 하는 확장 언어이다. PL/SQL은 Oracle 데이터베이스에서 주로 사용되며, 다음과 같은 특징과 기능을 제공한다.

✅ PL/SQL의 특징

절차적 언어
- SQL은 내가 원하는 것만 요구하면 되는 "비절차적" 언어라면, PL/SQL은 "절차적" 언어로 조건문, 반복문, 예외 처리 등을 지원한다.
- 데이터베이스를 이용하다보면 "흐름"의 제어가 필요할 때가 있다. 조건에 따라서 A명령어 사용 후 B명령어 사용한다던지 일때다. PL/SQL은 논리적인 흐름 제어와 데이터 조작을 함께 처리할 수 있다.
SQL과의 통합
- PL/SQL은 SQL과 완벽하게 통합되어 있다. 데이터를 처리하는 SQL 명령어와 절차적 흐름을 조합하여 강력한 데이터베이스 작업이 가능하다.
트랜잭션 제어
- 트랜잭션 단위로 작업이 수행되며, 명령어의 성공 또는 실패에 따라 데이터의 일관성을 보장한다.
보안 및 성능
- 데이터베이스 내에서 직접 실행되므로 네트워크 지연이 없고, 효율적이다.
- 데이터베이스에 저장된 PL/SQL 블록은 재사용이 가능하며, 보안성이 높다.

PL/SQL #1: 정의 및 특징 (Definition and Features of PL/SQL)

💡요약: PL/SQL은 SQL에 프로그래밍 언어(Programming Language)의 기능을 결합하여, 데이터 조작뿐 아니라 조건문과 반복문 등을 통해 절차적 흐름을 제어할 수 있음을 강조한다.

PL/SQL은 SQL(Structured Query Language)의 기능을 확장하여 만든 프로그래밍 언어이다. 데이터 조작과 제어 흐름을 결합하여 더욱 강력한 데이터베이스 작업 가능하게 한다.

SQL만으로는 부족한 경우가 있다.
- 일반 SQL은 데이터를 저장하거나 수정하거나 조회하는 데는 강력하지만, 특정 조건에 따라 명령을 실행하거나 여러 단계를 거쳐야 하는 작업을 하기에는 한계가 있다.
PL/SQL의 목적
- 이러한 한계를 해결하기 위해 절차적 프로그래밍(Programming)을 SQL에 추가한 것이 PL/SQL이다.
- 예를 들어 "만약 어떤 조건이 만족되면, 데이터를 이렇게 처리하라"와 같은 명령을 내릴 수 있다.
PL/SQL의 특징
- 변수와 상수 선언 가능 (Can declare variables and constants): 데이터를 저장하거나 계산할 때 필요하다.
- 조건문 사용 가능 (Can use conditional statements): 상황에 따라 다른 명령 실행한다.
- 반복문 사용 가능 (Can use loops): 동일한 작업을 여러 번 반복한다.

PL/SQL #2: 기본 구조 (Basic Structure of PL/SQL)

💡요약:기본 구조를 세 부분으로 나누어 설명한다. 각 부분은 선언부(Declarative Part), 실행부(Executable Part), 예외처리부(Exception Handlers)로 구성되며, 특히 실행부가 PL/SQL에서 가장 중요한 핵심 역할을 담당한다고 강조하고 있다.

선언부 (Declarative Part):
- DECLARE로 시작 (Starts with DECLARE).
- 이 부분은 변수(variables)와 상수(constants)를 선언한다.
- 선택 사항이며 필요하지 않을 경우 생략할 수 있다.
실행부 (Executable Part):
- BEGIN으로 시작 (Starts with BEGIN).
- 실제 작업(로직)을 실행하는 부분다.
- 데이터 처리, 반복문, 조건문 등 모든 프로그램 흐름 제어가 여기서 이루어진다.
- 필수(required)로 포함되어야 한다. 이 부분에서 모든 핵심 작업이 수행된다.
예외처리부 (Exception Handlers):
- EXCEPTION으로 시작 (Starts with EXCEPTION).
- 실행 중에 발생할 수 있는 오류를 처리한다.
- 선택 사항이다.

PL/SQL #3: 기본 구조와 예제 이해 (Basic Structure of PL/SQL with Example)

💡요약: PL/SQL 블록의 기본 구조(선언부, 실행부, 예외처리부)를 이해하고, 각 부분에서 수행되는 역할을 강조한다. PL/SQL은 SQL뿐만 아니라 프로그래밍 언어의 제어 흐름(반복문, 조건문)을 추가로 사용할 수 있어 더 복잡한 작업을 처리할 수 있음을 설명하고 있다.

PL/SQL 블록은 하나의 완전한 프로그램 단위를 형성하며, 아래와 같은 구조를 가진다.

DECLARE (선언부):
- 변수(variables)나 상수(constants)를 선언하는 부분이다.
- 예제에서는 v_lname VARCHAR(25);로 문자열 변수를 선언했다.
- SQL문이 들어가지는 않으며, 주로 데이터 저장을 위해 변수만 선언한다.
BEGIN (실행부, Executable Part):
- 코드 실행의 핵심 부분이다.
- SQL문과 PL/SQL 로직이 결합되어 데이터 처리와 흐름 제어를 수행한다.
- 예제에서는:
  - SELECT last_name INTO v_lname FROM employees WHERE employee_id = 101;로 SQL을 실행하여 결과를 변수에 저장한다.
  - DBMS_OUTPUT.PUT_LINE로 결과를 출력한다.
EXCEPTION (예외처리부, Exception Handlers):
- 오류가 발생했을 때 실행되는 코드이다.
- 예제에서는 WHEN OTHERS THEN을 사용하여 모든 종류의 오류를 포착하고 DBMS_OUTPUT.PUT_LINE('ERRORS');로 메시지를 출력한다.
- 선택 사항이며 필요하지 않다면 생략 가능하다.

PL/SQL #4: 블록의 실행과 출력 설정 (Executing a PL/SQL Block and Output Settings)

위에서 설명한 코드를 출력한 결과이다. PL/SQL 블록 실행 시 결과를 화면에 출력하려면 SET serveroutput ON; 명령어를 먼저 실행해야 한다. 이 명령은 DBMS_OUTPUT.PUT_LINE로 출력된 결과를 SQL Developer에서 볼 수 있도록 해준다.

이 예제에서는 employee_id = 101에 해당하는 직원의 last_name이 조회되어 Kochhar라는 이름이 출력되었다.

PL/SQL #5: 예외처리 (Exception Handling in PL/SQL)

💡요약: PL/SQL의 예외처리(EXCEPTION)는 프로그램 실행 중 발생하는 오류를 처리하기 위해 사용된다. 이것은 TRY...CATCH 문을 사용하는 Java와 비슷하다. 미리 정의된 예외와 사용자 정의 예외를 처리할 수 있으며, 마지막에 WHEN OTHERS를 사용하면 예상하지 못한 오류도 처리할 수 있게된다.

✅ 기본 구조

EXCEPTION
    WHEN 예외1 THEN 예외처리1 -- 특정 오류 1에 대한 처리
    WHEN 예외2 THEN 예외처리2 -- 특정 오류 2에 대한 처리
    …
    WHEN OTHERS THEN 나머지 예외처리 -- 다른 모든 예외에 대한 처리

WHEN 예외1 THEN: 특정 예외에 대한 처리 코드를 작성한다.
WHEN OTHERS THEN: 정의되지 않은 모든 예외를 처리한다. (옵션)

✅ 예제 코드

BEGIN
    -- 실행부: 오류가 발생할 가능성이 있는 코드
    SELECT salary INTO v_salary
    FROM employees
    WHERE employee_id = 999; -- 없는 ID를 조회해 의도적으로 오류 발생
EXCEPTION
    WHEN NO_DATA_FOUND THEN
        DBMS_OUTPUT.PUT_LINE('No data found for the given employee ID.'); -- 데이터가 없을 경우 처리
    WHEN OTHERS THEN
        DBMS_OUTPUT.PUT_LINE('An unexpected error occurred.'); -- 기타 오류 처리
END;

설명:

WHEN NO_DATA_FOUND THEN:
- employee_id = 999인 데이터가 없을 경우 이 블록이 실행된다.
- 오류 메시지: "No data found for the given employee ID."
WHEN OTHERS THEN:
- 다른 모든 예기치 못한 오류를 처리한다.
- 예를 들어, 데이터베이스 연결 문제나 문법 오류 등이 포함된다.

✅ 실행 흐름

BEGIN에서 지정된 SQL을 실행한다.
실행 중 오류가 발생하면 EXCEPTION 블록으로 이동한다.
오류 유형에 따라 WHEN ... THEN 조건문이 실행된다.
WHEN OTHERS는 정의되지 않은 모든 오류를 처리한다.

PL/SQL #6: 미리 정의된 예외와 처리 방법 (Predefined Exceptions in PL/SQL)

💡요약: PL/SQL은 실행 중 발생할 수 있는 미리 정의된 예외(predefined exceptions)를 제공한다. 특정 오류를 자동으로 인식하고 처리하는 도구이다. 너무 자세히 알 필요는 없고, "이런 오류가 있을 수 있구나" 정도만 이해하면 된다.
이 예외들은 특정 상황에서 프로그램 실행을 중단하지 않고 오류를 처리하도록 도와준다.

✅ 처리 방법 (예제 코드)

DECLARE
    v_number NUMBER;
BEGIN
    -- 오류를 의도적으로 발생시키는 코드
    v_number := 10 / 0; -- 0으로 나눔
EXCEPTION
    WHEN ZERO_DIVIDE THEN
        DBMS_OUTPUT.PUT_LINE('Cannot divide by zero!'); -- 0으로 나눌 경우 처리
    WHEN INVALID_NUMBER THEN
        DBMS_OUTPUT.PUT_LINE('Invalid number encountered.'); -- 잘못된 숫자 처리
    WHEN OTHERS THEN
        DBMS_OUTPUT.PUT_LINE('An unexpected error occurred.'); -- 기타 오류 처리
END;

✅ 실행 흐름

오류가 발생하면 PL/SQL은 실행을 중단하지 않고 EXCEPTION 블록으로 이동한다.
발생한 오류가 WHEN ZERO_DIVIDE 또는 WHEN INVALID_NUMBER와 일치하면 해당 처리를 실행한다.
WHEN OTHERS는 위에서 처리되지 않은 나머지 모든 오류를 처리한다.

✅ 중요한 포인트

미리 정의된 예외는 특정 오류를 자동으로 인식하고 처리하는 도구이다.
자주 사용되는 예외:
- NO_DATA_FOUND: SELECT 결과 없음.
- ZERO_DIVIDE: 0으로 나눔.
- INVALID_NUMBER: 숫자 변환 오류.
EXCEPTION 블록을 사용하면 이러한 오류를 안전하게 처리할 수 있다.
옵션이 없다면 실행이 중단되므로 예외 처리를 적절히 설정하는 것이 중요하다.

PL/SQL #7: 엔진의 역할과 구조 (Role and Structure of the PL/SQL Engine)

💡요약: PL/SQL 엔진(PL/SQL Engine)은 PL/SQL 코드를 컴파일하고 실행하며, SQL 문장을 처리하고 그 결과를 종합하는 것이다. 오라클 데이터베이스 내부에서 PL/SQL 블록을 실행하는 중요한 구성 요소이다.

💡 중요한 포인트

1. PL/SQL 엔진은 PL/SQL 코드를 처리하고 SQL 문을 실행하는 역할을 한다.
  1. 실행 단계:
    - 컴파일 → PL/SQL 엔진 실행 → SQL 문 처리 → 결과 종합
  2. Database Application은 외부 프로그램이며, 결과 종합 단계에서 PL/SQL 엔진을 통해 데이터베이스와 연결된다.
  3. PL/SQL 엔진은 SQL문과 PL/SQL 로직을 효율적으로 처리하는 독립적인 구조를 가지고 있다.

PL/SQL 엔진을 별도로 두는 이유는 효율적인 실행과 데이터베이스와의 통합을 위해서입니다. 이 정도만 이해하면 되겠다.

컴파일 단계 (Compilation):
- PL/SQL 블록은 먼저 System Global Area(SGA)에서 컴파일된다.
- 컴파일후, 이 컴파일된 코드는 PL/SQL 엔진으로 전달된다.
실행 단계 (Execution):
- PL/SQL 엔진은 컴파일된 코드를 실행한다.
- 이 과정에서 SQL문(statement)은 데이터베이스에 전달되어 실행된다.
- SQL Statement Executor는 SQL 문장을 처리한 결과를 PL/SQL 엔진으로 반환한다.
종합 및 실행 (Processing Results):
- PL/SQL 엔진은 SQL 실행 결과를 종합해 데이터베이스 응용 프로그램(database application)에 전달한다.
- 여기서 Database Application은 PL/SQL 엔진 외부에서 실행되는 프로그램을 의미한다.

2️⃣PL/SQL 구성요소 (PL/SQL Components)

💡요약: PL/SQL도 다른 프로그래밍 언어처럼 다양한 구성 요소를 제공한다. 대표적인 요소로는 변수(Variables)와 상수(Constants)가 있다. 이 두 가지는 데이터를 저장하고 처리하는 데 핵심적인 역할을 하며, 정보 시스템이나 대부분의 프로그래밍에서 없어서는 안 되는 중요한 개념이다.

✅ 변수(Variables)란? 변수(Variable)는 데이터를 저장하는 메모리 공간이다. 프로그램이 데이터를 읽고 처리한 결과를 저장하거나, 최종 결과를 출력하기 위해 사용된다.

변수의 역할

데이터 저장: 예를 들어, 숫자, 문자열, 이미지 등을 저장합니다.
중간 결과 저장: 계산 도중 중간 값을 저장하여 후속 처리를 가능하게 합니다.
최종 출력 준비: 결과 값을 변수에 저장한 후, 이를 출력하거나 활용합니다.

변수 선언 예제

emp_num1 NUMBER(9); 
-- 직원 번호를 저장하기 위한 숫자형 변수 선언

✅ 상수(Constants)란? 상수(Constant)는 한 번 정의하면 값이 변하지 않는 데이터이다.
이는 프로그램에 제약을 추가하여 코드의 일관성을 유지하고, 의도치 않은 오류를 방지한다.

상수를 사용하는 이유

코드 안정성: 여러 개발자가 작업 중, 중요한 데이터가 실수로 수정되는 것을 방지한다.
가독성 향상: 코드에서 특정 숫자나 값의 의미를 명확히 나타낼 수 있다.
예: nYear CONSTANT INTEGER := 30;는 "30"이 연도를 의미함을 명확히 한다.

상수 선언 예제

Year CONSTANT INTEGER := 30;  
-- 30이라는 값은 "변경 불가"하며 연도와 관련된 고정된 값임을 나타낸다.

💡 중요한 개념 요약

변수는 데이터를 저장하고 가공하는 데 사용되는 기본적인 메모리 공간이다.
- 예: emp_num1 NUMBER(9); (9자리 숫자형 데이터를 저장)
상수는 값을 변경할 수 없으며, 프로그램의 안정성과 가독성을 높이는 데 도움을 준다.
- 예: nYear CONSTANT INTEGER := 30; (30은 변하지 않는 값)
상수의 목적은 제약을 통해 오류를 방지하고, 협업 중 코드의 일관성을 유지하는 데 있다.

PL/SQL 구성요소 #1: 변수와 데이터 타입 (PL/SQL Variables and Data Types)

💡요약: PL/SQL에서 변수(Variables)는 데이터 값을 저장하는 메모리 공간이다. 하지만, 변수는 타입(Type)과 항상 함께 정의된다. 타입이란 저장할 데이터의 종류를 나타낸다. 예를 들어, 정수는 정수 타입, 실수는 실수 타입, 문자열은 문자열 타입이어야 한다. 이렇게 타입을 명시하는 이유는 데이터가 올바른 형태로 저장되도록 하기 위해서이다. 또한, 데이터베이스 컬럼의 데이터 타입과 변수 타입은 일관성을 유지해야 한다.

✅ %TYPE - 테이블 컬럼의 데이터 타입 사용

PL/SQL에서 변수를 정의할 때, 기존 테이블의 컬럼에서 데이터 타입을 그대로 가져올 수 있다. 이때 사용하는 것이 %TYPE이다.

Salaries EMPLOYEES.SALARY%TYPE;
-- EMPLOYEES 테이블의 SALARY 컬럼의 데이터 타입을 사용하여 변수를 정의

EMPLOYEES.SALARY%TYPE은 EMPLOYEES 테이블의 SALARY 컬럼의 데이터 타입을 그대로 사용하는 방법이다.
이점: 테이블 컬럼의 데이터 타입을 그대로 사용하므로, 데이터베이스 구조 변경 시 변수의 타입도 자동으로 일관되게 유지된다.

✅%ROWTYPE - 테이블의 한 레코드를 변수로 선언

%TYPE과 유사하나 %ROWTYPE은 하나 이상의 값을 묶어서 사용할 때 유용한 방법이다. 이 방식은 테이블의 전체 레코드를 한꺼번에 변수로 선언할 수 있게 해준다.

emp_record EMPLOYEES%ROWTYPE;
-- EMPLOYEES 테이블의 한 행(Row)을 저장할 수 있는 변수 선언

EMPLOYEES%ROWTYPE은 EMPLOYEES 테이블의 한 레코드 전체를 저장할 수 있는 변수를 정의하는 방법이다.
이 변수에는 테이블의 모든 컬럼의 데이터가 포함된다.

💡 중요한 개념 요약

%TYPE은 테이블의 컬럼 데이터 타입을 그대로 사용하는 방법이다.
- 예: EMPLOYEES.SALARY%TYPE (테이블의 SALARY 컬럼 타입을 그대로 사용)
%ROWTYPE은 테이블의 전체 행(레코드)을 변수로 선언할 때 사용한다.
- 예: EMPLOYEES%ROWTYPE (테이블의 한 행을 저장하는 변수)
타입 일관성은 데이터베이스의 컬럼과 변수가 같은 타입을 유지하도록 보장한다.

PL/SQL 구성요소 #2: 콜렉션 (Collection)

💡요약: PL/SQL에서 변수는 기본적으로 하나의 값만을 저장할 수 있습니다. 하지만, 배열 형태의 데이터값을 PL/SQL에서도 지원하는데 이를 콜렉션(Collection)이라고 한다. PL/SQL에서는 세 가지 종류의 콜렉션을 지원한다. VARRAY, Nested Table, Associative Array. 각각의 특성을 이해하는 것이 중요하다.

✅ VARRAY (변수 크기 배열)

- 고정된 크기: VARRAY는 선언할 때 배열의 크기를 정해 놓아야 한다. 크기는 고정되지만, 중간에 배열의 크기를 변경할 수 있다.
  - 숫자형 인덱스: VARRAY는 숫자형 인덱스를 사용하여 배열의 요소를 접근한다.
  - 순서와 밀집된 데이터: VARRAY는 순서가 중요한 데이터를 처리하는 데 유리하며, 밀집된 데이터 집합을 처리할 때 사용된다. 예를 들어, 학생들의 점수, 주문 목록 등 순서대로 처리할 필요가 있는 데이터에 적합하다.
  - 일부 원소 삭제 불가: VARRAY에서 원소는 삭제할 수 없으며, 전체 배열을 삭제해야 한다. 즉, 중간에 원소를 제거할 수는 없고, 배열을 전체적으로 초기화해야 한다.
  - 배열의 크기 변경 가능: 배열의 크기를 선언할 때 크기를 지정하지만, 이후 배열 크기를 동적으로 변경할 수 있다.
  - 테이블 내 저장 가능: VARRAY는 데이터베이스 테이블의 컬럼 타입으로 사용할 수 있다. 예를 들어, 한 테이블의 컬럼에 여러 값을 배열로 저장할 수 있다.

DECLARE
-- VARRAY 타입 정의
    TYPE num_array IS VARRAY(5) OF NUMBER;  -- 5개의 숫자를 저장할 수 있는 배열 타입 선언
    nums num_array := num_array(1, 2, 3, 4, 5);  -- 배열 초기화
BEGIN
    -- 배열 값 출력
    DBMS_OUTPUT.PUT_LINE(nums(1));  -- 첫 번째 요소 출력
END;

✅ 제한 사항

배열의 크기 고정: VARRAY를 선언할 때 크기를 고정해야 하므로, 동적으로 크기를 변경하는 데는 제한이 있다.
부분 삭제 불가: 배열 내 원소는 삭제할 수 없으며, 전체 배열 삭제만 가능하다.
단일 타입의 데이터만 사용 가능: VARRAY는 배열 내 동일한 데이터 타입을 갖는 원소들만 포함할 수 있다.

✅VARRAY (변수 크기 배열) 사용 예시: 복잡해보이지만 전혀 어렵지 않다. 예제코드가 이미 잘 정리되어있기 때문에 필요시마다 찾아서 하면 된다. 배열 선언과 배열 초기화, 그리고 배열의 값 출력을 다루고 있습니다. 이 코드를 이해하고 필요할 때 참조하면 복잡한 PL/SQL 배열 사용에 대한 이해가 쉬워질 것이다.

DECLARE
    TYPE Foursome IS VARRAY(4) OF VARCHAR2(15);  -- VARRAY type 선언
    -- 'Foursome'은 크기 4의 문자열 배열을 정의
    team Foursome := Foursome('John', 'Mary', 'Alberto', 'Juanita');  -- 배열 초기화
BEGIN
    DBMS_OUTPUT.PUT_LINE('---');
    FOR i IN 1..4 LOOP  -- 배열의 각 요소에 접근
        DBMS_OUTPUT.PUT_LINE(i || '.' || team(i));  -- 배열의 원소 출력
    END LOOP;
    DBMS_OUTPUT.PUT_LINE('---');
END;

VARRAY 타입 정의
```
 TYPE Foursome IS VARRAY(4) OF VARCHAR2(15);
```
- Foursome이라는 이름의 VARRAY 타입을 정의한다. 이 배열은 4개의 요소를 가질 수 있으며, 각 요소는 최대 15자의 문자열이다.
VARRAY 변수 초기화
```
 team Foursome := Foursome('John', 'Mary', 'Alberto', 'Juanita');
```
- team이라는 VARRAY 변수를 선언하고, 'John', 'Mary', 'Alberto', 'Juanita'라는 문자열 값을 배열로 초기화한다.
배열 값 출력
```
 FOR i IN 1..4 LOOP
     DBMS_OUTPUT.PUT_LINE(i || '.' || team(i));
 END LOOP;
```
- FOR 루프를 사용하여 배열의 각 원소를 출력한다. 배열의 인덱스는 1부터 4까지이며, 각 인덱스에 해당하는 team(i)의 값을 출력한다.
- DBMS_OUTPUT.PUT_LINE은 출력문을 담당한다. i || '.' || team(i)는 배열의 인덱스와 해당 인덱스의 값을 문자열로 연결하여 출력하는 부분이다.
구분선 출력
```
  DBMS_OUTPUT.PUT_LINE('---');
 END;
```
이 부분은 배열의 출력 전에 구분선을 출력하여, 출력되는 값들을 더 명확하게 구분하기 위한 구문이다. 구분선으로 ---를 출력하여 배열 항목들을 보기 좋게 구분할 수 있다.

✅ Nested Table (중첩 테이블)

동적 크기: NESTED TABLE은 처음 선언할 때 크기를 명시할 필요가 없으며, 사용에 따라 크기가 동적으로 변경된다. 즉, 배열의 크기를 제한하지 않고 유연하게 데이터를 추가할 수 있다.
숫자형 인덱스: 이 배열은 숫자형 인덱스를 사용하며, 데이터가 추가될 때 순차적으로 인덱스가 증가한다. 처음에 a,b,c가 있고 d,f가 추가가 된다면 순서대로 1,2,3,4,5로 구성된다.
삭제가 가능: NESTED TABLE은 중간에 원소를 삭제할 수 있는 특성이 있다. 삭제된 위치는 NULL로 처리되고, 테이블이 흩어진 상태(sparse)로 듬성 듬성한 배열이 될수있다.
밀집(dense)과 흩어진(sparse) 데이터: 처음에는 데이터를 밀집된 형태로 처리하지만, 일부 원소가 삭제되면 배열이 흩어지게 된다. 이로 인해 NESTED TABLE은 매우 유연하게 데이터를 처리할 수 있게 된다.
테이블의 컬럼으로 사용 가능: NESTED TABLE은 테이블 내 컬럼 타입으로도 사용할 수 있지만, 성능상의 이유로 자주 사용되지 않는 것이 권장된다.
문법: TYPE 타입명 IS TABLE OF 요소데이터 타입 [NOT NULL]

DECLARE
    TYPE num_table IS TABLE OF NUMBER;  -- 숫자들을 저장하는 중첩 테이블 타입 선언
    nums num_table := num_table(1, 2, 3, 4, 5);  -- 테이블 초기화
BEGIN
    DBMS_OUTPUT.PUT_LINE(nums(1));  -- 첫 번째 요소 출력
END;

중첩 테이블은 배열처럼 여러 값을 저장하지만, 동적 크기를 갖는다.

✅ Nested Table (중첩 테이블) 사용 예시

DECLARE
    TYPE Roster IS TABLE OF VARCHAR2(15); -- NESTED TABLE 타입 정의
    -- NESTED TABLE 변수 초기화
    names Roster := Roster('D Caruso', 'J Hamil', 'D Piro', 'R Singh');
BEGIN
    DBMS_OUTPUT.PUT_LINE('---');
    -- FIRST와 LAST 메소드를 사용하여 배열의 처음부터 끝까지 순회
    FOR i IN names.FIRST .. names.LAST LOOP
        DBMS_OUTPUT.PUT_LINE(names(i)); -- 각 이름 출력
    END LOOP;
    DBMS_OUTPUT.PUT_LINE('---');
END;

NESTED TABLE 타입 선언
```
 TYPE Roster IS TABLE OF VARCHAR2(15);
```
Roster라는 이름의 NESTED TABLE 타입을 정의하였다. 이 배열은 VARCHAR2(15) 타입의 요소들을 담을 수 있으며, 각 요소는 최대 15자까지 가능하다는 뜻이다.
NESTED TABLE 변수 초기화
```
 names Roster := Roster('D Caruso', 'J Hamil', 'D Piro', 'R Singh');
```
names라는 변수를 선언하고 Roster 타입을 이용해 초기값을 설정하였다. 여기서는 네 명의 이름을 담고 있는 배열을 생성한다.
FIRST와 LAST 메소드 사용:
```
 FOR i IN names.FIRST .. names.LAST LOOP
     DBMS_OUTPUT.PUT_LINE(names(i));
 END LOOP;
```
- FIRST: 배열에서 첫 번째 요소의 인덱스를 반환한다. 예를 들어, names.FIRST는 1을 반환한다.
- LAST: 배열에서 마지막 요소의 인덱스를 반환합니다. 예를 들어, names.LAST는 4를 반환한다.
- names(i): 배열의 인덱스 i에 해당하는 값을 가져온다. 루프 내에서 인덱스를 1부터 4까지 순차적으로 증가시키며 값을 출력하게 된다.

FIRST와 LAST 메소드는 배열의 시작과 끝을 나타내는 메소드로, 이를 사용하여 배열의 처음부터 끝까지 순차적으로 접근한다.

결과 출력: FOR 루프를 통해 names 배열에 있는 각 요소를 출력한다. 이 배열의 크기와 요소는 FIRST와 LAST 메소드를 통해 동적으로 결정되며, 이 범위 내에서 배열의 모든 값을 출력한다.

👀VARAY와 NESTED TABLE의 차이점 간단 소개

VARRAY: 고정 크기, 밀집(dense)된 데이터, 순차적 처리
NESTED TABLE: 동적 크기, 희소(sparse) 데이터, 유연한 데이터 처리, 빈번한 삽입 및 삭제가 이루어지면 성능이 저하될 수 있다.

✅ Associative Array (연관 배열 또는 맵)

키와 값의 쌍: Associative Array는 키-값 쌍으로 데이터를 저장하는 맵(Map)과 유사한 데이터 구조이다. 예를 들어, (A, 30), (B, 50), (C, 20)와 같은 형태이다. 같은 데이터 타입을 가진 요소들로 구성된다.
- A, B, C는 키(인덱스)이며, 30, 50, 20은 값(Value)이 된다. 각각의 요소가 고유한 키에 의해 식별되며, 키를 통해 값을 빠르게 검색할 수 있다.
Index와 데이터 타입:
- Index는 PLS_INTEGER, BINARY_INTEGER, VARCHAR2 등의 데이터 타입을 사용할 수 있다.
- PLS_INTEGER는 PL/SQL에서 제공하는 특수한 정수 타입으로, 계산 속도가 빠르기 때문에 자주 사용된다.
- 키를 Index라고 부르기 때문에 Index-by 테이블 이라고도 한다.
동적 크기: Associative Array는 크기가 동적이어서, 필요한 만큼 요소를 추가하거나 제거할 수 있다.
배열 접근: 값을 참조할 때 인덱스를 사용하여 빠르게 접근할 수 있게 된다.

문법:

TYPE 타입명 IS TABLE OF 요소 데이터타입 [NOT NULL]
INDEX BY [PLS_INTEGER | BINARY_INTEGER | VARCHAR2(크기)];

예를 들어, 인덱스가 VARCHAR2인 경우는 문자열을 키로 사용하여 배열을 구성할 수 있다.

DECLARE
    TYPE AssociativeArray IS TABLE OF NUMBER INDEX BY VARCHAR2(20);  -- 키는 VARCHAR2, 값은 NUMBER
    scores AssociativeArray;
BEGIN
    scores('A') := 30;  -- 키 'A'에 30을 저장
    scores('B') := 50;  -- 키 'B'에 50을 저장
    scores('C') := 20;  -- 키 'C'에 20을 저장

    DBMS_OUTPUT.PUT_LINE('A: ' || scores('A'));  -- A의 값을 출력 -> 30
    DBMS_OUTPUT.PUT_LINE('B: ' || scores('B'));  -- B의 값을 출력 -> 50
    DBMS_OUTPUT.PUT_LINE('C: ' || scores('C'));  -- C의 값을 출력 -> 20
END;

인덱스 타입: PLS_INTEGER, BINARY_INTEGER, VARCHAR2 등을 키로 사용하며, 문자열을 인덱스로 사용하는 것도 가능하다.
유사한 구조: 자바의 Map과 유사하며, 인덱스를 통해 값을 빠르게 찾을 수 있다.
데이터베이스의 키-값 구조를 구현할 때 유용하며, 특정 키를 기준으로 빠르게 데이터를 처리해야 할 경우 적합하다.
예를 들어, 학생 이름을 점수와 연결하는 구조나 상품명을 가격과 연결하는 구조 등에서 사용할 수 있다.

✅ Associative Array (연관 배열 또는 맵) 사용예제

도시 이름을 인구 수와 연결하는 구조를 표현한 코드이다.

Associative Array 타입 정의
```
  TYPE population IS TABLE OF NUMBER INDEX BY VARCHAR2(64);
```
- population이라는 타입을 정의하고, NUMBER 타입 값을 저장하는 배열을 생성한다.
- 인덱스는 VARCHAR2(64) 타입으로 설정하여, 각 원소를 문자열(도시 이름)으로 식별한다.
변수 선언:
```
  city_population population;
```
- city_population이라는 변수를 선언하여 population 타입의 associative array를 만든다.
값 추가:
```
  city_population('Smallville') := 2000;
  city_population('Midland') := 750000;
  city_population('Megalopolis') := 1000000;
```
- city_population 배열에 도시 이름을 키로, 인구 수를 값으로 추가한다. 각 도시의 인구를 인덱스를 사용하여 저장한다.

✅ 콜렉션 메소드

콜렉션을 사용하면 배열의 크기를 동적으로 관리하거나, 여러 값을 효율적으로 저장하고 관리할 수 있게 된다. PL/SQL은 이러한 콜렉션을 위한 메소드도 제공하여 데이터를 쉽게 다룰 수 있게 해준다. 예를 들어, 배열에서 첫 번째 요소를 가져오거나, 크기를 세거나, 다음 요소로 이동하는 작업을 메소드로 처리할 수 있다.

주요 메소드 예시

FIRST: 첫 번째 요소를 반환
LAST: 마지막 요소를 반환
NEXT: 다음 요소로 이동
COUNT: 배열의 요소 수를 반환

PL/SQL 구성요소 #1: 레코드 (Record)

💡요약: 레코드 (Record)는 PL/SQL에서 여러 다른 타입의 데이터를 하나의 구조체로 묶을 수 있는 데이터 타입이다. 이를 통해 테이블의 한 행(row)을 그대로 PL/SQL에서 다룰 수 있으며, 테이블의 각 열(column)을 개별적으로 처리할 수 있게 된다. 구조체 (Structure)와 유사한 개념으로, 데이터베이스 테이블의 각 컬럼을 변수로 묶어 관리할 수 있다.

✅ 레코드의 주요 특징

복합 데이터 구조:

레코드는 여러 필드(field)로 구성된다. 각 필드는 다른 데이터 타입을 가질 수 있다. 예를 들어, 이름은 문자열로, 나이는 정수로, 날짜는 날짜 형식으로 지정할 수 있다.

테이블 또는 커서 행을 참조:

테이블에서 한 행을 읽어와서 저장할 때, ROWTYPE을 사용하여 해당 테이블의 행 구조와 동일한 레코드를 정의할 수 있다.
커서에서 한 행을 가져올 때도 ROWTYPE을 사용하여 커서의 구조와 일치하는 레코드를 정의할 수 있다.

✅레코드 정의 방법

사용자 정의 레코드: 이 방법은 특정 구조를 가진 레코드를 정의할 때 사용한다.

  TYPE 레코드이름 IS RECORD (
      필드1 데이터타입1,
      필드2 데이터타입2,
      ...
  );

테이블의 행을 레코드로 정의:
```
  레코드이름 테이블명%ROWTYPE;
```
테이블의 한 행과 동일한 구조를 가지는 레코드를 정의할 때 사용한다. 이 경우, 테이블의 컬럼명이 자동으로 레코드의 필드명이 된다.
커서의 행을 레코드로 정의: 커서에서 반환된 한 행의 구조를 레코드로 정의할 때 사용한다.
```
  레코드이름 커서명%ROWTYPE;
```

PL/SQL 구성요소 #2: 레코드 예제 1 (Example of Record)

💡요약: 이 예제에서는 PL/SQL 레코드 (PL/SQL Record) 의 사용법을 보여준다. 레코드는 여러 다른 데이터 타입을 하나로 묶어서 처리할 수 있게 해주는 구조체 (Structure) 이다. 주어진 예제에서는 DeptRecType이라는 이름의 레코드를 정의하고, 이를 dept_rec라는 변수에 할당하였다.

DECLARE
  TYPE DeptRecType IS RECORD (  -- 레코드 타입을 정의
    dept_id NUMBER(4) NOT NULL := 10,  -- 부서 ID (기본값: 10)
    dept_name VARCHAR2(30) NOT NULL := 'Administration',  -- 부서 이름 (기본값: 'Administration')
    mgr_id NUMBER(6) := 200,  -- 부서 관리자 ID (기본값: 200)
    loc_id NUMBER(4)  -- 위치 ID (기본값 없음)
  );

  dept_rec DeptRecType;  -- DeptRecType 타입의 변수 선언

BEGIN
DBMS_OUTPUT.PUT_LINE('dept_rec:');
DBMS_OUTPUT.PUT_LINE('---------');
DBMS_OUTPUT.PUT_LINE('dept_id: ' || dept_rec.dept_id);
DBMS_OUTPUT.PUT_LINE('dept_name: ' || dept_rec.dept_name);
DBMS_OUTPUT.PUT_LINE('mgr_id: ' || dept_rec.mgr_id);
DBMS_OUTPUT.PUT_LINE('loc_id: ' || dept_rec.loc_id);
END;

레코드 타입 정의 (TYPE DeptRecType IS RECORD):
- 이 부분에서 DeptRecType이라는 레코드 타입을 정의하고 있다. 레코드 타입은 하나의 변수로 여러 데이터를 묶을 수 있도록 해준다.
필드와 기본값 (dept_id, dept_name, mgr_id, loc_id):
- 각 레코드는 여러 필드(field)로 구성됩니다. 필드는 서로 다른 데이터 타입을 가질 수 있다. 예를 들어, dept_id는 숫자 타입이고, dept_name은 문자열 타입이다. 각 필드에는 기본값 (default value)이 설정되어 있다.
  - dept_id: 기본값 10 (부서 ID)
  - dept_name: 기본값 'Administration' (부서 이름)
  - mgr_id: 기본값 200 (부서 관리자 ID)
  - loc_id: 기본값 없음 (위치 ID) **기본값이 지정되지 않았기 때문에, 해당 값은 NULL이 된다. 출력 결과는 빈 문자열처럼 보이게 된다. 즉, 공백이 출력된다.
변수 선언 (dept_rec DeptRecType):
- dept_rec라는 변수는 DeptRecType 타입의 레코드 변수를 선언하는 부분이다. 이 변수는 나중에 데이터를 저장하는데 사용된다.
DBMS_OUTPUT.PUT_LINE:
- DBMS_OUTPUT.PUT_LINE은 PL/SQL에서 결과를 화면에 출력하는 명령이다. 이 명령을 사용해 변수의 값을 출력할 수 있.
점 연산자 (Dot operator):
- 레코드의 필드에 접근할 때 dept_rec.dept_id와 같이 점 연산자 (dot operator)를 사용한다. dept_rec는 레코드 변수이고, 그 뒤에 점(.)을 찍고 필드명을 입력하여 해당 필드의 값을 참조할 수 있다.
  - dept_rec.dept_id: 부서 ID
  - dept_rec.dept_name: 부서 이름
  - dept_rec.mgr_id: 부서 관리자 ID
  - dept_rec.loc_id: 위치 ID

PL/SQL 구성요소 #3: 레코드 예제 2 (Example of Record)

💡요약: 이 예제에서는 레코드 (Record)와 그 안에 또 다른 레코드가 포함된 구조를 설명하고 있다. 레코드는 데이터베이스 테이블의 한 행(row)과 비슷한 개념으로, 여러 개의 필드를 가지는 데이터 타입이다. 중첩된 레코드 (Nested Record) 는 하나의 레코드 안에 다른 레코드가 포함되는 구조를 의미한다.

TYPE name_rec IS RECORD (
    first employees.first_name%TYPE,  -- 직원의 이름
    last employees.last_name%TYPE    -- 직원의 성
);
TYPE contact IS RECORD (
    name name_rec, -- 중첩된 레코드
    phone employees.phone_number%TYPE  -- 전화번호
);
friend contact;

BEGIN
friend.name.first := 'John';
friend.name.last := 'Smith';
friend.phone := '1-650-555-1234';
DBMS_OUTPUT.PUT_LINE ( friend.name.first || ', ' ||
friend.name.last || ', ' || friend.phone );
END;

TYPE name_rec IS RECORD (
    first employees.first_name%TYPE,  -- 직원의 이름
    last employees.last_name%TYPE    -- 직원의 성
);

name_rec 레코드 타입: first와 last라는 두 필드를 가지며, 각각 employees 테이블의 first_name과 last_name 컬럼의 데이터 타입을 따른다.

TYPE contact IS RECORD (
    name name_rec, -- 중첩된 레코드
    phone employees.phone_number%TYPE  -- 전화번호
);

contact 레코드 타입:
- name이라는 필드를 가진다. 이 필드는 name_rec 레코드 타입을 사용하므로, **first**와 **last**를 가지게 된다.
- 또 하나의 필드인 phone은 직원의 전화번호를 나타내는 employees.phone_number 컬럼의 데이터 타입을 사용한다.

friend contact;  -- friend 변수 선언

friend 변수: friend는 contact 타입의 레코드 변수로, 이 변수는 name 필드 안에 **first**와 **last**를 포함하고, phone 필드에는 전화번호를 저장할 수 있다.

BEGIN
friend.name.first := 'John';
friend.name.last := 'Smith';
friend.phone := '1-650-555-1234';

레코드 값 할당:
- friend.name.first := 'John'; friend 레코드의 name 중 first 필드에 'John'이라는 값을 할당한다.
- friend.name.last := 'Smith'; friend 레코드의 name 중 last 필드에 'Smith'라는 값을 할당한다.
- friend.phone := '1-650-555-1234'; friend 레코드의 phone 필드에 전화번호 '1-650-555-1234'를 할당한다.

이렇게 세 가지 필드에 값을 할당하여 친구의 이름과 전화번호를 설정한다.

DBMS_OUTPUT.PUT_LINE ( friend.name.first || ', ' ||
friend.name.last || ', ' || friend.phone );
END;

값 출력: 이 부분은 **DBMS_OUTPUT.PUT_LINE**을 사용하여 레코드의 값을 출력한다.
- || 기호는 문자열을 이어 붙이는 (concatenation) 연산자입니다.
- friend.name.first, friend.name.last, friend.phone 값들이 이어져서 하나의 긴 문자열을 만든다.
- 예를 들어, 위 코드에서는 friend.name.first가 'John', friend.name.last가 'Smith', friend.phone이 '1-650-555-1234'이므로 출력되는 결과는 John, Smith, 1-650-555-1234가 된다.

3️⃣PL/SQL 제어문(Control Structures)

💡요약: PL/SQL에서 제어문은 프로그램 흐름을 제어하는 데 사용된다. 제어문은 조건문과 반복문으로 나뉜다. IF문은 조건에 맞춰 여러 가지 처리를 할 수 있는 제어문이고, CASE문은 여러 조건값을 비교하여 해당하는 처리문을 실행하는 제어문이다. 두 문법 모두 PL/SQL에서 자주 사용된다.

IF문은 주어진 조건이 참일 때 실행할 문을 결정하고, 추가적인 조건을 ELSIF로 확인할 수 있다. 마지막에는 ELSE로 모든 조건이 거짓일 때 실행될 문을 설정할 수 있다.
CASE문은 여러 조건을 한 번에 확인하고, 조건값에 맞는 처리문을 실행하는 방식이다. WHEN으로 조건을 비교하고, ELSE는 모든 조건이 일치하지 않을 때 실행된다.

✅IF문 (IF Statement)

IF문은 주어진 조건이 참일 때 특정 코드를 실행하는 제어문이다. 주로 조건에 맞는 처리를 할 때 사용된다.

IF 조건 THEN
    처리문1;
ELSIF 조건2 THEN
    처리문2;
...
ELSE
    처리문N;
END IF;

IF 뒤에 조건을 적고, 그 조건이 참일 때 처리할 내용을 THEN 아래에 작성한다.
ELSIF는 첫 번째 조건이 거짓일 때 추가적으로 또 다른 조건을 검사할 수 있게 해준다.
ELSE는 모든 조건이 거짓일 때 실행되는 코드를 작성한다.

예시: 이 예제는 판매 금액(sales)과 목표 금액(quota)을 비교하여 보너스(bonus)를 계산하는 로직을 보여준다. 사용된 제어문은 IF로, 주어진 조건에 따라 보너스 금액을 다르게 계산한다.

DECLARE --변수 선언 (Variable Declaration)
    sales NUMBER := 10100;  -- 판매 금액 (Sales amount)
    quota NUMBER := 10500;  -- 목표 금액 (Quota)
    bonus NUMBER := 0;      -- 보너스 금액 (Bonus)

BEGIN --  IF문 사용 (Using IF statement)
    IF sales > (quota + 200) THEN  -- 판매 금액이 목표 금액보다 200 이상 클 때
        bonus := (sales - quota) / 4;  -- 보너스는 목표 금액을 초과한 판매 금액의 1/4
    ELSE
        IF sales > quota THEN  -- 판매 금액이 목표 금액을 초과할 때
            bonus := 50;  -- 보너스는 50
        ELSE  -- 판매 금액이 목표 금액에 미치지 못할 때
            bonus := 0;  -- 보너스는 0
        END IF;
    END IF;
    DBMS_OUTPUT.PUT_LINE('bonus = ' || bonus);  -- 계산된 보너스 출력
END;

변수 선언 (Variable Declaration): sales: 실제 판매 금액 , quota: 목표 금액 , bonus: 보너스 금액으로, 기본값은 0으로 설정된다.
IF문 사용 (Using IF statement)
- 첫 번째 IF문은 판매 금액이 목표 금액을 200 이상 초과했을 때 보너스를 1/4로 계산한다. (sales - quota) / 4로 계산된 보너스를 bonus에 할당한다.
- 만약 첫 번째 조건이 거짓이라면, 두 번째 IF문이 실행된다. 이 경우 판매 금액이 목표 금액을 초과하면 보너스를 50으로 설정한다.
- 만약 두 번째 조건도 거짓이라면, ELSE 부분에서 보너스를 0으로 설정한다.
DBMS_OUTPUT.PUT_LINE을 사용하여 최종 보너스를 출력한다.

✅CASE문 (CASE Statement)

CASE문은 여러 조건을 동시에 처리하고, 조건값에 맞는 처리문을 실행하는 제어문이다. 각 조건을 WHEN과 함께 설정하여 사용한다.

plsqlCopy codeCASE 조건
    WHEN 조건값1 THEN 처리문1;
    WHEN 조건값2 THEN 처리문2;
    ...
    ELSE 처리문N;
END CASE;

WHEN은 주어진 조건값을 검사하고, 일치하는 조건값에 해당하는 코드를 실행합니다.
ELSE는 모든 조건에 맞지 않을 때 실행할 코드를 작성합니다.

예시: 이 예제는 GRADE 값에 따라 적절한 평가 메시지를 출력하는 CASE문을 보여준다. CASE문은 조건에 맞는 값을 선택하여 처리할 수 있는 구조이다.

DECLARE -- 변수 선언 (Variable Declaration)
grade CHAR(1);

BEGIN
grade := 'B';

CASE grade -- CASE문 사용 (Using CASE statement)
WHEN 'A' THEN DBMS_OUTPUT.PUT_LINE('Excellent');
WHEN 'B' THEN DBMS_OUTPUT.PUT_LINE('Very Good');
WHEN 'C' THEN DBMS_OUTPUT.PUT_LINE('Good');
WHEN 'D' THEN DBMS_OUTPUT.PUT_LINE('Fair');
WHEN 'F' THEN DBMS_OUTPUT.PUT_LINE('Poor');

ELSE DBMS_OUTPUT.PUT_LINE('No such grade');

END CASE;
END;

변수 선언 (Variable Declaration): 먼저, grade라는 학생의 성적을 나타내 변수를 선언하고 초기값으로 'B'를 설정한다.
CASE문 사용 (Using CASE statement): CASE문은 grade 값에 따라 각각 다른 출력을 한다. 성적이 'A'일 때는 Excellent, 'B'일 때는 Very Good 등의 메시지를 출력한다.
만약 grade 값이 'A', 'B', 'C', 'D', 'F' 중 어느 것도 아니면 ELSE를 사용하여 'No such grade' 메시지를 출력하게 된다.

CASE문은 grade 값을 체크하여 각 조건에 맞는 문장을 실행한다.
WHEN 뒤에 조건값을 적고, 조건이 맞으면 그 뒤의 THEN 절을 실행하게 된다.
조건에 맞는 것이 없으면 ELSE 절을 실행한다.

PL/SQL 구성요소 #4: 반복문 (Loops)

💡요약: PL/SQL에서 반복문을 사용하면 특정 조건을 만족할 때까지 동일한 작업을 반복할 수 있다. 가장 많이 사용되는 반복문은 FOR문이며, 그 외에 LOOP문과 WHILE문도 자주 사용된다.

💡LOOP

✅ LOOP문 (LOOP Statement) : LOOP문은 무한 반복문이다. 조건 없이 반복이 시작되고, 반복을 멈추기 위해서는 EXIT 조건을 명시해야 한다.

✅ LOOP문 예제 (LOOP Example)

DECLARE
    x NUMBER := 0;  -- 변수 x를 0으로 초기화
BEGIN
    LOOP
        DBMS_OUTPUT.PUT_LINE('Inside loop: x = ' || TO_CHAR(x));  -- x 값을 출력
        x := x + 1;  -- x 값 1 증가
        IF x > 3 THEN  -- x가 3보다 크면 반복문을 종료
            EXIT;  -- EXIT 조건이 맞으면 반복문을 종료
        END IF;
    END LOOP;
    DBMS_OUTPUT.PUT_LINE('After loop: x = ' || TO_CHAR(x));  -- 반복문 종료 후 x 값을 출력
END;

LOOP: 반복문을 시작한다. 이 안의 모든 코드가 반복 실행된다.
DBMS_OUTPUT.PUT_LINE: x 값을 출력하는 명령어이다.
x := x + 1: x 값을 1 증가시키는 연산이다.
IF x > 3 THEN EXIT;: x가 3보다 커지면 EXIT가 실행되어 반복문을 종료한다.
END LOOP;: LOOP문이 끝나는 부분이다.
DBMS_OUTPUT.PUT_LINE: 반복문 종료 후 x 값을 출력한다.

✅중요한 부분 요약 (Summary of Key Points)

LOOP문은 조건 없이 무한 반복되므로, 반드시 반복을 종료할 조건을 설정해야 한다.
EXIT 명령어를 사용하여 반복문을 강제로 종료할 수 있다.
LOOP문은 간단한 반복을 위해 사용된다.

💡EXIT WHEN 문 사용 (Using EXIT WHEN) LOOP문을 더 간단하게 표현할 수 있는 방법은 EXIT WHEN 조건을 사용하는 것이다. EXIT WHEN은 반복문을 특정 조건에서 자동으로 종료하도록 도와준다. 이를 통해 IF문을 간소화할 수 있게 되고 코드가 더 깔끔하고 읽기 쉽다.

DECLARE
    x NUMBER := 0;  -- 변수 x를 0으로 초기화
BEGIN
    LOOP
        DBMS_OUTPUT.PUT_LINE('Inside loop: x = ' || TO_CHAR(x));  -- x 값을 출력
        x := x + 1;  -- x 값 1 증가
        EXIT WHEN x > 3;  -- x가 3보다 크면 반복문 종료
    END LOOP;
    DBMS_OUTPUT.PUT_LINE('After loop: x = ' || TO_CHAR(x));  -- 반복문 종료 후 x 값을 출력
END;

EXIT WHEN x > 3;: 이 구문은 x가 3보다 크면 반복문을 종료한다. 이전 예제에서 IF문과 EXIT를 사용한 방식과 동일한 기능을 한다.
나머지 부분은 이전 예제와 동일하며, 반복문 안에서 x 값을 1씩 증가시키고, x > 3이면 반복문을 종료하게 된다
EXIT WHEN을 사용하면 조건을 간단하게 표현할 수 있으며, IF문 없이 바로 종료 조건을 지정할 수 있게 된다.
**EXIT WHEN**은 조건을 만족하면 즉시 반복문을 종료한다.

💡FOR Loop

✅FOR Loop: FOR문은 반복문 중 하나로, 인덱스 변수를 사용하여 정해진 범위 내에서 값을 증가시키거나 감소시키면서 반복된다. FOR문은 범위 내에서 인덱스가 자동으로 증가하거나 감소하므로 코드가 간결해지고 이해하기 쉬워진다. 가장 많이 쓰이는 loop 이다.

카운터: 반복문을 실행할 때 값을 가지고 있는 인덱스 변수이다.
최소값..최대값: 반복문이 시작될 범위와 종료될 범위를 나타낸다.
FOR문은 시작값부터 끝값까지 반복을 수행하며, 끝값을 포함하지 않으며, 자동으로 증가한다.

✅FOR문 예제 (FOR Loop Example)

BEGIN
    DBMS_OUTPUT.PUT_LINE('lower_bound < upper_bound');
    FOR i IN 1..3 LOOP  -- i는 1부터 3까지 반복
        DBMS_OUTPUT.PUT_LINE(i);  -- i 값을 출력
    END LOOP;

    DBMS_OUTPUT.PUT_LINE('lower_bound = upper_bound');
    FOR i IN 2..2 LOOP  -- i는 2부터 2까지 반복
        DBMS_OUTPUT.PUT_LINE(i);  -- i 값을 출력
    END LOOP;

    DBMS_OUTPUT.PUT_LINE('lower_bound > upper_bound');
    FOR i IN 3..1 LOOP  -- 실행되지 않음, 시작값이 끝값보다 큼
        DBMS_OUTPUT.PUT_LINE(i);  -- 실행되지 않음
    END LOOP;
END;

첫 번째 FOR문 (First FOR Loop):
- 범위가 1..3이므로 i는 1, 2, 3으로 반복된다.
- 출력 결과는 1, 2, 3이 차례로 출력된다.
두 번째 FOR문 (Second FOR Loop):
- 범위가 2..2로, i는 2로만 반복된다.
- 출력은 2만 출력된다.
세 번째 FOR문 (Third FOR Loop):
- 범위가 3..1로, 시작값이 끝값보다 크기 때문에 반복문이 실행되지 않게 된다.
- 결과적으로 출력되지 않는다.

✅ 중요한 부분 요약 (Summary of Key Points)

FOR문은 범위 내에서 자동으로 값을 증가시키며 반복한다.
시작값이 끝값보다 크면 반복문이 실행되지 않게 된다.
범위 내에서 지정된 값들만 반복되며, 끝값은 포함되지 않는다.

💡WHILE 문

✅ WHILE 문: WHILE문은 조건이 만족될 때까지 반복을 실행하는 구조이다. 조건이 true일 때만 반복이 실행되며, 조건이 false이면 반복을 종료한다. 이 예제는 두 개의 WHILE문을 사용하여 조건을 다르게 설정하고 있다.

✅ WHILE문 예제 (WHILE Loop Example)

DECLARE
    done BOOLEAN := FALSE;
BEGIN
    WHILE done LOOP
        DBMS_OUTPUT.PUT_LINE('This line does not print.');
        done := TRUE;  -- 이 할당은 실행되지 않습니다.
    END LOOP;

    WHILE NOT done LOOP
        DBMS_OUTPUT.PUT_LINE('Hello, world!');
        done := TRUE;
    END LOOP;
END;

첫 번째 WHILE문 (First WHILE Loop):
- done은 FALSE로 초기화되었기 때문에, 첫 번째 WHILE문은 조건이 false로 시작하여 실행되지 않게된다.
- done := TRUE;는 실행되지 않으며 WHILE문이 실행되지 않기 때문에, 해당 출력문은 출력되지 않게된다.
두 번째 WHILE문 (Second WHILE Loop):
- done은 첫 번째 WHILE문에서 변경되지 않아서 여전히 FALSE이다.
- NOT done은 TRUE가 되어 두 번째 WHILE문이 실행된다.
- DBMS_OUTPUT.PUT_LINE('Hello, world!');가 출력되며, done := TRUE;로 done이 TRUE로 변경된다.
- 그 후 조건이 false가 되어 두 번째 WHILE문은 종료된다.

✅ 중요한 부분 요약 (Summary of Key Points)

WHILE문은 조건이 true일 때만 실행된다.
done이 FALSE이면 첫 번째 WHILE문은 실행되지 않고, 두 번째 WHILE문에서만 실행된다.
done := TRUE;는 두 번째 WHILE문에서 실행되어 조건이 false로 변경되며 반복이 종료된다.

LOOP, FOR, WHILE 비교 (Comparison of LOOP, FOR, WHILE)

LOOP:
- 무한 루프로 시작하지만, 내부에서 EXIT 조건을 사용해 종료할 수 있다.
- 조건을 동적으로 설정할 수 있어 유연성이 높다.
FOR:
- 반복 횟수가 명확하게 주어질 때 사용된다.
- 코드가 간결하고 반복 횟수가 정해져 있어 코드 작성이 용이하다.
WHILE:
- 조건이 true일 때만 반복되며, 조건을 만족할 때까지 반복한다.
- 반복 조건이 false가 되면 반복이 종료되며, 조건 만족 여부를 기준으로 반복이 진행된다.

💡 정리 (Summary)

LOOP: 반복을 명시적으로 제어하고 싶을 때 사용.
FOR: 반복 횟수가 정해져 있을 때 사용.
WHILE: 조건에 맞을 때만 반복하고 싶을 때 사용.