LangChain-Core 的抽象

2025-07-30 约 1621 字预计阅读 4 分钟

目前为止我们学习 langchain-core 中几乎所有的核心对象。现在我们对这些核心对象的抽象层次做一个总结，便于我们理解 Agent 中对象的传递和调用链。查看源代码时可以快速定位到每个组件核心代码的位置。

1. Prompts

1.1 抽象层次

Prompts 抽象层次如下:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18


BasePromptTemplate
    # 单条 Prompt
    StringPromptTemplate
        PromptTemplate
    ImagePromptTemplate
    PipelinePromptTemplate

    # Chat Promt
    BaseChatPromptTemplate
        ChatPromptTemplate
            StructuredPrompt


BaseMessagePromptTemplate
    _StringImageMessagePromptTemplate
        HumanMessagePromptTemplate
        AIMessagePromptTemplate
        SystemMessagePromptTemplate

这里把 PromptTemplate 分为:

单条 Prompt: Prompt 的基础单元，处理不同类型的 Promt
Chat Prompt: 处理多种多个单条 Promt 的组合

1.2 顶层抽象

BasePromptTemplate 是顶层抽象类，继承自 RunnableSerializable 有默认实现。 BaseLanguageModel 主要有三个接口方法:

format_prompt
format

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60


class BasePromptTemplate(
    RunnableSerializable[dict, PromptValue], ABC, Generic[FormatOutputType]
):
    @abstractmethod
    def format_prompt(self, **kwargs: Any) -> PromptValue:
        """Create Prompt Value.

        Args:
            kwargs: Any arguments to be passed to the prompt template.

        Returns:
            PromptValue: The output of the prompt.
        """

    @abstractmethod
    def format(self, **kwargs: Any) -> FormatOutputType:
        """Format the prompt with the inputs.

        Args:
            kwargs: Any arguments to be passed to the prompt template.

        Returns:
            A formatted string.

        Example:

        .. code-block:: python

            prompt.format(variable1="foo")
        """

    @override
    def invoke(
        self, input: dict, config: Optional[RunnableConfig] = None, **kwargs: Any
    ) -> PromptValue:
        """Invoke the prompt.

        Args:
            input: Dict, input to the prompt.
            config: RunnableConfig, configuration for the prompt.

        Returns:
            PromptValue: The output of the prompt.
        """
        config = ensure_config(config)
        if self.metadata:
            config["metadata"] = {**config["metadata"], **self.metadata}
        if self.tags:
            config["tags"] = config["tags"] + self.tags
        return self._call_with_config(
            self._format_prompt_with_error_handling,
            input,
            config,
            run_type="prompt",
            serialized=self._serialized,
        )

    def _format_prompt_with_error_handling(self, inner_input: dict) -> PromptValue:
        inner_input_ = self._validate_input(inner_input)
        return self.format_prompt(**inner_input_)

调用链如下:

1
2
3
4


input -> PromptValue
invoke
    _format_prompt_with_error_handling
        format_prompt

1.3 业务抽象

单条 Prompt 将 format_prompt 实现在 format 方法之上，所以它们有如下的调用链:

1
2
3
4


invoke
    _format_prompt_with_error_handling
        format_prompt
            format

Chat Prompt 把 format 方法实现在 format_prompt 方法之上，并且添加了一个 format_messages 抽象方法来实现 format_prompt。所以它的调用链如下:

1
2
3
4
5
6
7


invoke
    _format_prompt_with_error_handling
        format_prompt
            format_messages
format
    format_prompt
        format_messages

1.4 BaseMessagePromptTemplate

为什么有 BaseMessagePromptTemplate？

参数 -> BasePromptTemplate -> PromptValue -> message/text
BaseMessagePromptTemplate 接收更通用的输入，根据输入的不同解析成不同的 PromptTemplate，并生成特定类型的 Message(由 _msg_class 决定生成什么类型的消息)。相当于一个 prompt 生成和聚合的入口。

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24


class _StringImageMessagePromptTemplate(BaseMessagePromptTemplate):
    """Human message prompt template. This is a message sent from the user."""

    prompt: Union[
        StringPromptTemplate,
        list[Union[StringPromptTemplate, ImagePromptTemplate, DictPromptTemplate]],
    ]
    """Prompt template."""
    additional_kwargs: dict = Field(default_factory=dict)
    """Additional keyword arguments to pass to the prompt template."""

    _msg_class: type[BaseMessage]
    @classmethod
    def from_template(
        cls: type[Self],
        template: Union[
            str,
            list[Union[str, _TextTemplateParam, _ImageTemplateParam, dict[str, Any]]],
        ],
        template_format: PromptTemplateFormat = "f-string",
        *,
        partial_variables: Optional[dict[str, Any]] = None,
        **kwargs: Any,
    ) -> Self:

2. Language Model

2.1 抽象层次

Language Model 抽象层次如下:

1
2
3


BaseLanguageModel
    BaseLLM
    BaseChatModel

2.2 顶层抽象

BaseLanguageModel 是顶层抽象，继承自 RunnableSerializable，但是并没有提供 invoke 的默认实现。 BaseLanguageModel 主要有三个接口方法:

generate_prompt
agenerate_prompt
invoke

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20


class BaseLanguageModel(
    RunnableSerializable[LanguageModelInput, LanguageModelOutputVar], ABC
):
    @abstractmethod
    def generate_prompt(
        self,
        prompts: list[PromptValue],
        stop: Optional[list[str]] = None,
        callbacks: Callbacks = None,
        **kwargs: Any,
    ) -> LLMResult:

    @abstractmethod
    async def agenerate_prompt(
        self,
        prompts: list[PromptValue],
        stop: Optional[list[str]] = None,
        callbacks: Callbacks = None,
        **kwargs: Any,
    ) -> LLMResult:

2.3 业务抽象

BaseLLM、BaseChatModel 是实际业务继承类，抽象接口为:

_generate
_llm_type
_stream

调用链如下:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29


# BaseLLM
# LanguageModelInput = Union[PromptValue, str, Sequence[MessageLikeRepresentation]]
LanguageModelInput -> str
invoke
    generate_prompt            
        generate
            _generate_helper
                _generate
            dict
                _llm_type

stream:
    _stream

# BaseChatModel
LanguageModelInput -> Message
invoke:
    generate_prompt
        generate
            _get_invocation_params
                dict
                    _llm_type
            _generate_with_cache
                _generate
stream:
    _stream

with_structured_output:
    bind_tools

3. OutPut

3.1 抽象层次

1
2
3
4
5
6


Generation
    GenerationChunk
    ChatGeneration
        ChatGenerationChunk
ChatResult
LLMResult

Output 代表模型的输出:

Generation 表示最简单文本输出
ChatGeneration 表示 Chat Model 的单条输出，包含一条 Message 表示输出的内容
ChatResult 表示 Chat Model call 调用结果
LLMResult 表示 LLM Model call 调用结果

4. OutPut Parser

4.1 抽象层次

1
2
3
4
5
6
7


BaseLLMOutputParser
    BaseGenerationOutputParser
    BaseOutputParser
        StrOutputParser
        BaseTransformOutputParser
            BaseCumulativeTransformOutputParser
                JsonOutPutParser

4.2 顶层抽象

BaseLLMOutputParser 是顶层抽象，定义 parse_result 抽象方法

1
2
3
4
5


class BaseLLMOutputParser(ABC, Generic[T]):
    """Abstract base class for parsing the outputs of a model."""

    @abstractmethod
    def parse_result(self, result: list[Generation], *, partial: bool = False) -> T:

4.3 业务抽象

BaseGenerationOutputParser

1
2
3


Union[str, BaseMessage] -> T
invoke
    parse_result

BaseOutputParser

1
2
3
4


Union[str, BaseMessage] -> T
invoke
    parse_result
        parse

5. Chain

Chain 只有 Chain，其定义了如下接口:

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17


    @property
    @abstractmethod
    def input_keys(self) -> list[str]:
        """Keys expected to be in the chain input."""

    @property
    @abstractmethod
    def output_keys(self) -> list[str]:
        """Keys expected to be in the chain output."""

    @abstractmethod
    def _call(
        self,
        inputs: dict[str, Any],
        run_manager: Optional[CallbackManagerForChainRun] = None,
    ) -> dict[str, Any]:
        pass

chain 实现了 RunnableSerializable 接口，其 invoke 调用链如下:

1
2
3


dict[str, Any] -> dict[str, Any]
invoke
    _call

6. Tool

6.1 抽象层次

Tool 抽象层次如下:

1
2
3


BaseTool
    Tool
    StructuredTool

6.2 顶层抽象

BaseTool 是顶层抽象，继承自 RunnableSerializable，提供了 invoke 的默认实现。 BaseLanguageModel 主要有三个接口方法:

_run

调用链如下:

1
2
3


invoke
    run
        _run

6.2 业务抽象

Tool，StructuredTool 都是基于函数实现的 Tool 的包装。两个的区别在于，Tool 限定了只能处理单参数。调用链如下:

1
2
3
4


invoke
    run
        _run
            func call

7. Agent

顶层抽象

Agent 的抽象比较简单，顶层抽象只有 BaseMultiActionAgent、BaseSingleActionAgent，并且只有一个抽象方法 plan。

1
2
3
4
5
6
7


BaseMultiActionAgent
    RunnableMultiActionAgent
BaseSingleActionAgent
    RunnableAgent
    LLMSingleActionAgent
    Agent
        ChatAgent

Agent 的执行实现在 AgentExecutor 中，AgentExecutor 继承自 Chain。所以其 invoke 调用链如下:

1
2
3


dict[str, Any] -> dict[str, Any]
invoke
    _call

目录

LangChain-Core 的抽象

1. Prompts

1.1 抽象层次

1.2 顶层抽象

1.3 业务抽象

1.4 BaseMessagePromptTemplate

2. Language Model

2.1 抽象层次

2.2 顶层抽象

2.3 业务抽象

3. OutPut

3.1 抽象层次

4. OutPut Parser

4.1 抽象层次

4.2 顶层抽象

4.3 业务抽象

5. Chain

6. Tool

6.1 抽象层次

6.2 顶层抽象

6.2 业务抽象

7. Agent

顶层抽象