Native WALA implementation of source code analysis tool for Enterprise Java Applications.
codeanalyzer extracts a comprehensive symbol table and call graph from Java applications
and emits them either as the canonical analysis.json, or as a Neo4j property graph
(--emit neo4j) — a graph.cypher snapshot or a live, incremental push over Bolt. See
§4. Neo4j graph output.
Grab the latest release jar and a codeanalyzer launcher (requires a Java 11+ runtime):
curl --proto '=https' --tlsv1.2 -LsSf https://github.com/codellm-devkit/codeanalyzer-java/releases/latest/download/codeanalyzer-installer.sh | sh
# or with wget:
wget -qO- https://github.com/codellm-devkit/codeanalyzer-java/releases/latest/download/codeanalyzer-installer.sh | shOverrides: CODEANALYZER_INSTALL_DIR (default ~/.local/bin), CODEANALYZER_VERSION (default latest).
Prefer to build from source? See §2. Building codeanalyzer.
Before you begin, ensure you have met the following requirements:
- You have a Linux/MacOS/WSL machine.
- You have installed the latest version of SDKMan!
-
Install SDKMan! Open your terminal and enter the following command:
curl -s "https://get.sdkman.io" | bash
Follow the on-screen instructions to complete the installation.
-
Open a new terminal or source the SDKMan! scripts:
source "$HOME/.sdkman/bin/sdkman-init.sh"
-
You can list all available GraalVM versions with:
sdk list java | grep semYou should see the following:
Semeru | | 21.0.2 | sem | | 21.0.2-sem | | 21.0.1 | sem | | 21.0.1-sem | | 17.0.10 | sem | | 17.0.10-sem | | 17.0.9 | sem | | 17.0.9-sem | | 11.0.22 | sem | installed | 11.0.22-sem | | 11.0.21 | sem | | 11.0.21-sem -
Install Java 11 or above (we'll go with 17.0.10-sem):
sdk install java 17.0.10-sem
-
Set Java 17 as the current Java version:
sdk use java 17.0.10-sem
Clone the repository (if you haven't already) and navigate into the cloned directory.
Run the Gradle wrapper script to build the project. This will compile the project using GraalVM native image.
./gradlew fatJarThe jar will be built at build/libs/codeanalyzer-<version>.jar. It may be used as follows:
Usage: codeanalyzer [-hvV] [--no-build] [--no-clean-dependencies]
[-a=<analysisLevel>] [-b=<build>] [-f=<projectRootPom>]
[-i=<input>] [-o=<output>] [-s=<sourceAnalysis>]
[--emit=<emit>] [--app-name=<appName>]
[--neo4j-uri=<uri>] [--neo4j-user=<user>]
[--neo4j-password=<password>] [--neo4j-database=<db>]
[-t=<targetFiles>]...
Analyze java application.
-i, --input=<input> Path to the project root directory.
-s, --source-analysis=<s> Analyze a single string of java source code instead
of the project.
-o, --output=<output> Destination directory to save the output graphs. By
default, the analysis JSON is printed to the console.
-b, --build-cmd=<build> Custom build command. Defaults to auto build.
--no-build Do not build your application (use if already built).
-a, --analysis-level=<n> Level of analysis: 1 (symbol table) or 2 (call graph).
Default: 1. Level 2 adds J_CALLS edges to the graph.
-t, --target-files=<f>... Restrict analysis to specific files (incremental).
--emit=<emit> Output target: json (analysis.json, default) |
neo4j (graph.cypher or live Bolt push) |
schema (the Neo4j schema.neo4j.json contract).
--app-name=<name> Logical application name for the graph :JApplication
anchor (default: input dir name).
--neo4j-uri=<uri> Push the graph to a live Neo4j over Bolt (incremental);
omit to write graph.cypher. Falls back to the
NEO4J_URI environment variable.
--neo4j-user=<user> Neo4j username (env: NEO4J_USERNAME, default: neo4j).
--neo4j-password=<pw> Neo4j password (env: NEO4J_PASSWORD, default: neo4j).
--neo4j-database=<db> Neo4j database name (env: NEO4J_DATABASE, default:
server default).
-v, --verbose Print logs to console.
-h, --help Show this help message and exit.
-V, --version Print version information and exit.3. Installing codeanalyzer as a native binary (once built, no JVM will be required for running codeanalyzer)
To install codeanalyzer, follow these steps:
-
You can list all available GraalVM versions with:
sdk list java | grep graal -
Install GraalVM 17 or above (we'll go with 21.0.2-graalce):
sdk install java 21.0.2-graalce
-
Set GraalVM 21 as the current Java version:
sdk use java 21.0.2-graalce
Clone the repository (if you haven't already) and navigate into the cloned directory.
Run the Gradle wrapper script to build the project. This will compile the project using GraalVM native image.
./gradlew nativeCompile -PbinDir=$HOME/.local/binNote: -PbinDir is optional. If not provided, this command places the binaries in build/bin.
Assuming the path you provided in -PbinDir (in my case $HOME/.local/bin) is in your $PATH, after installation, you can use codeanalyzer by following the below format:
Usage: codeanalyzer [-hqV] [-d=<appDeps>] [-e=<extraLibs>] -i=<input>
-o=<outDir>
Convert java binary (*.jar, *.ear, *.war) to a neo4j graph.
-d, --app-deps=<appDeps> Path to the application dependencies.
-e, --extra-libs=<extraLibs>
Path to the extra libraries.
-h, --help Show this help message and exit.
-i, --input=<input> Path to the input jar(s).
-o, --output=<outDir> Destination directory to save the output graphs.
-q, --quiet Don't print logs to console.
-V, --version Print version information and exit.There is a sample application in src/test/resources/sample_apps/daytrader8/binaries/. You can use this to test the tool.
codeanalyzer -i src/test/resources/sample_apps/daytrader8/binaries/ This will produce print the SDG on the console. Explore other flags to save the output to a JSON.
codeanalyzer can project the analysis IR into a Neo4j property graph instead
of analysis.json. The graph is a lossless projection of the IR: compilation units, types,
callables, fields, parameters, call sites, variables, enum constants, record components,
initialization blocks, CRUD operations/queries, comments, annotations and packages are all
first-class nodes and relationships, and (at -a 2) it adds J_CALLS edges from the call graph.
Every field of the Lombok entity model is represented (scalars as node properties — maps such as a
field's per-variable initializers are kept as a *_json property since Neo4j has no map type;
comments are :JComment nodes in addition to the convenience docstring property).
The full contract (node labels, their keys and typed properties, relationship types and endpoints,
plus the constraint/index DDL) lives in schema.neo4j.json and is visualized
in neo4j-schema.drawio. All node labels are J-prefixed and relationship
types J_-prefixed (e.g. :JType, :JCallable, J_CALLS) so a Java graph can share a Neo4j
database with the Python (Py*/PY_*) and TypeScript (TS*/TS_*) backends without colliding.
SCHEMA_VERSION is stamped onto the :JApplication node of every emitted graph.
codeanalyzer -i /path/to/project -a 2 --emit neo4j -o ./out
# → writes ./out/graph.cypher (a self-contained, re-runnable script)
cypher-shell -u neo4j -p <password> < ./out/graph.cypherThe snapshot is not incremental: it constraints, scopes-wipes this application's prior subgraph,
then UNWIND … MERGE-loads the full truth.
codeanalyzer -i /path/to/project -a 2 --emit neo4j \
--neo4j-uri bolt://localhost:7687 --neo4j-user neo4j --neo4j-password <password>The Bolt writer reads the database's current state and updates only what changed: it diffs each
compilation unit's content_hash, replaces just the changed units' subgraphs (idempotent
MERGE upserts), and — on a full run — prunes units whose source file vanished. Combine with
--target-files for a targeted, partial re-push (orphan pruning is then skipped).
codeanalyzer --emit schema -o ./out # → ./out/schema.neo4j.json (no project analysis needed)
codeanalyzer --emit schema # → prints the contract to stdoutA no-container conformance test (Neo4jSchemaConformanceTest) asserts the projector never emits a
label/relationship/property the catalog doesn't declare, and that schema.neo4j.json is current. A
Testcontainers-backed integration test (Neo4jBoltWriterTest) spins up a real Neo4j and exercises
the Bolt writer (full push, idempotent re-push, orphan pruning). The container suite is opt-in
(it needs Docker/Podman) and runs only when RUN_CONTAINER_TESTS is set:
RUN_CONTAINER_TESTS=1 ./gradlew test-
After making a few code changes, my native binary gives random exceptions. But, my code works perfectly with
java -jar.The
reflect-config.jsonis most likely out of date. Plese follow the below instructions:a. Build the fatjar using
./gradlew fatJarb. Run the following
java -agentlib:native-image-agent=config-output-dir=src/main/resources/META-INF/native-image-config -jar build/libs/codeanalyzer-1.0.jar -i src/test/resources/sample.applications/daytrader8/source -a 2 -v
c. Then build using the instructions in §3.3.
The problem should be resolved.
Copyright IBM Corporation 2023, 2024
Licensed under the Apache Public License 2.0, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
