Question 1

What is Crawl?

Accepted Answer

Crawl is an open-source pre-migration intelligence tool for enterprise data infrastructure. It extracts business logic from stored procedures, ETL jobs, and data warehouse views before a migration begins, so teams can see what will break and what requires rewriting before cutover.

Question 2

How is Crawl different from Datafold, dbt, or SnowConvert?

Accepted Answer

Datafold, dbt, and SnowConvert operate during or after migration. Crawl runs at Step 0, before any conversion. It produces the business-logic inventory those tools assume already exists, turning undocumented legacy ETL into structured specifications that downstream migration tools can act on.

Question 3

How does Crawl combine AST parsing and LLMs?

Accepted Answer

Crawl uses sqlglot to parse SQL and ETL artifacts into abstract syntax trees, then uses an LLM to infer business intent from the parsed structure and surrounding context. The AST layer constrains the LLM to real code structure, which keeps extracted logic grounded in the source rather than hallucinated.

Question 4

Which platforms and LLM providers does Crawl support?

Accepted Answer

Crawl is vendor-neutral on both ends. Oracle Data Integrator (ODI) and Informatica PowerCenter (offline XML exports) are currently supported, with Snowflake, SQL Server, Oracle PL/SQL, Postgres, and dbt planned. On the LLM side, Crawl works with any provider and supports local-first inference via Ollama and vLLM so enterprise code never needs to leave your environment.

Source	Status
Oracle Data Integrator (ODI)	Supported
Informatica PowerCenter (offline XML)	Supported
Snowflake (views, UDFs, procs, tasks)	Planned
SQL Server stored procedures	Planned
Oracle PL/SQL	Planned
PostgreSQL stored procedures	Planned
dbt models	Planned

Crawl

What Crawl does

CLI commands

The problem

Questions Crawl answers

How it works

Design principles

Enterprise safety

Supported sources