Lucene search

GitHub Advisory DatabaseGHSA-H9J7-5XVC-QHG5

HistoryFeb 26, 2024 - 6:30 p.m.

langchain Server-Side Request Forgery vulnerability

2024-02-2618:30:29

CWE-918

GitHub Advisory Database

github.com

server-side request forgery

vulnerability

crawler

configuration

attacker

control

malicious html

download

prevent_outside

patch

CVSS3

3.7

Attack Vector

LOCAL

Attack Complexity

HIGH

Privileges Required

HIGH

User Interaction

REQUIRED

Scope

CHANGED

Confidentiality Impact

LOW

Integrity Impact

LOW

Availability Impact

NONE

CVSS:3.0/AV:L/AC:H/PR:H/UI:R/S:C/C:L/I:L/A:N

AI Score

Confidence

High

EPSS

0.001

Percentile

26.4%

JSON

With the following crawler configuration:

from bs4 import BeautifulSoup as Soup

url = "https://example.com"
loader = RecursiveUrlLoader(
    url=url, max_depth=2, extractor=lambda x: Soup(x, "html.parser").text 
)
docs = loader.load()

An attacker in control of the contents of https://example.com could place a malicious HTML file in there with links like “https://example.completely.different/my_file.html” and the crawler would proceed to download that file as well even though prevent_outside=True.

https://github.com/langchain-ai/langchain/blob/bf0b3cc0b5ade1fb95a5b1b6fa260e99064c2e22/libs/community/langchain_community/document_loaders/recursive_url_loader.py#L51-L51

Resolved in https://github.com/langchain-ai/langchain/pull/15559

Affected configurations

Vulners

Node

langchainlangchainRange<0.1.0

VendorProductVersionCPE
langchainlangchain*cpe:2.3:a:langchain:langchain:*:*:*:*:*:*:*:*

Vendor	Product	Version	CPE
langchain	langchain	*	cpe:2.3:a:langchain:langchain::::::::

References

github.com/advisories/GHSA-h9j7-5xvc-qhg5

github.com/langchain-ai/langchain/blob/bf0b3cc0b5ade1fb95a5b1b6fa260e99064c2e22/libs/community/langchain_community/document_loaders/recursive_url_loader.py#L51-L51

github.com/langchain-ai/langchain/commit/bf0b3cc0b5ade1fb95a5b1b6fa260e99064c2e22

github.com/langchain-ai/langchain/pull/15559

huntr.com/bounties/370904e7-10ac-40a4-a8d4-e2d16e1ca861

nvd.nist.gov/vuln/detail/CVE-2024-0243