At a minimum, development of TinkerPop requires Java 8 but it is
preferable to use Java 11 cross-compiled to Java 8 (the
cross-compilation happens automatically as part of the build). Starting with 3.7.0, you can also build with
Java 17. Note, however, that there are some issues with deep reflection
so there are --add-opens
JVM options included in the pom files to enable this to work. Maven (requiring a minimum of
Maven 3.5.3+) is used as the common build system, which even
controls the builds of non-JVM GLVs
such as gremlin-python
. Java and Maven are described as a "minimum" for a development environment, because they
will only build JVM portions of TinkerPop and many integration tests will not fire with this simple setup. It is
possible to get a clean and successful build with this minimum, but it will not be possible to build non-JVM aspects
of the project and those will go untested.
To gain the ability to execute all aspects of the TinkerPop build system, other environmental configurations must be established. Those prerequisites are defined in the following subsections.
As of TinkerPop 3.5.5, environment configuration for Gremlin Language Variants can be optional, as Docker becomes a system requirement to build and test GLVs inside of Maven. Please make sure Docker is installed and running on your system. You will need to install both Docker Engine and Docker Compose, which are included in Docker Desktop.
Important
|
Use Java 11 for documentation generation with bin/process-docs.sh and for other build features outside
of the basic mvn clean install sort of function.
|
Important
|
For those who intend to offer a contribution, building with a minimal configuration may not be sufficient when submitting a pull request. Consider setting up the full environment. |
Tip
|
Consider using SDKMAN! to manage Java and Maven versions and environments. |
Note
|
For those using Windows, efforts have been made to keep the build OS independent, but, in practice, it is likely
that TinkerPop’s build system will only allow for a minimum build at best. Refer to Building On Windows section for more details. |
Groovy is not used in the standard build, but when generating documentation it does require the loading of a Gremlin Console instance. The Gremlin Console is Groovy-based and the documentation bootstrapping loads TinkerPop plugins which requires proper configuration of Graph/Ivy dependency loaders as described in the Gremlin Applications Section of the Reference Documentation.
The base configuration described in that link may need to be modified if there is a desire to work with the Gremlin
Console (for documentation generation or just general testing) in a way that utilizes SNAPSHOT releases in the
Apache Snapshots Repository. In that case, the grapeConfig.xml
will need to include a resolver for that repository
and the basic Ivy configuration will look as follows:
<ivysettings>
<settings defaultResolver="downloadGrapes"/>
<resolvers>
<chain name="downloadGrapes" returnFirst="true">
<filesystem name="cachedGrapes">
<ivy pattern="${user.home}/.groovy/grapes/[organisation]/[module]/ivy-[revision].xml"/>
<artifact pattern="${user.home}/.groovy/grapes/[organisation]/[module]/[type]s/[artifact]-[revision](-[classifier]).[ext]"/>
</filesystem>
<ibiblio name="localm2" root="${user.home.url}/.m2/repository/" checkmodified="true" changingPattern=".*" changingMatcher="regexp" m2compatible="true"/>
<ibiblio name="jcenter" root="https://jcenter.bintray.com/" m2compatible="true"/>
<ibiblio name="ibiblio" m2compatible="true"/>
<ibiblio name="apache-snapshots" root="http://repository.apache.org/snapshots/" m2compatible="true"/>
</chain>
</resolvers>
</ivysettings>
The above configuration is just a modification of the default. Perhaps the most lean common configuration might just be:
<ivysettings>
<settings defaultResolver="downloadGrapes"/>
<resolvers>
<chain name="downloadGrapes">
<ibiblio name="local" root="file:${user.home}/.m2/repository/" m2compatible="true"/>
<ibiblio name="central" root="https://repo1.maven.org/maven2/" m2compatible="true"/>
</chain>
</resolvers>
</ivysettings>
In the above case, the configuration largely relies on the standard Maven builds to create a well cached .m2
directory. Under typical development circumstances, SNAPSHOT will find themselves deployed there locally and that
is all that will be required for Grape to do its work.
As a final word, it is important to take note of the order used for these references as Grape will check them in the order they are specified and depending on that order, an artifact other than the one expected may be used which is typically an issue when working with SNAPSHOT dependencies.
The documentation generation process is not Maven-based and uses shell scripts to process the project’s asciidoc. The scripts should work on Mac and Linux. Javadocs should be built using Java 11.
Tip
|
We recommend performing documentation generation on Linux. For the scripts to work on Mac, you will need to
install GNU versions of the utility programs via homebrew , e.g.grep , awk , sed , findutils , and diffutils .
|
To generate documentation, it is required that Hadoop 3.3.x is running in
pseudo-distributed
mode. Be sure to set the HADOOP_GREMLIN_LIBS
environment variable as described in the
reference documentation. It is also important
to set the CLASSPATH
to point at the directory containing the Hadoop configuration files, like mapred-site.xml
.
The /etc/hadoop/yarn-site.xml
file prefers this configuration over the one provided in the Hadoop documentation
referenced above:
<configuration>
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.vmem-check-enabled</name>
<value>false</value>
</property>
<property>
<name>yarn.nodemanager.vmem-pmem-ratio</name>
<value>4</value>
</property>
</configuration>
The /etc/hadoop/mapred-site.xml
file prefers the following configuration:
<configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<property>
<name>mapred.map.tasks</name>
<value>4</value>
</property>
<property>
<name>mapred.reduce.tasks</name>
<value>4</value>
</property>
<property>
<name>mapreduce.job.counters.limit</name>
<value>1000</value>
</property>
<property>
<name>mapreduce.jobtracker.address</name>
<value>localhost:9001</value>
</property>
<property>
<name>mapreduce.map.memory.mb</name>
<value>2048</value>
</property>
<property>
<name>mapreduce.reduce.memory.mb</name>
<value>4096</value>
</property>
<property>
<name>mapreduce.map.java.opts</name>
<value>-Xmx2048m</value>
</property>
<property>
<name>mapreduce.reduce.java.opts</name>
<value>-Xmx4096m</value>
</property>
</configuration>
Also note that awk version 4.0.1
is required for documentation generation.
The YARN recipe also uses the zip
program to
create an archive so that needs to be installed, too, if you don’t have it already.
The Hadoop 3.3.x installation instructions call for installing pdsh
but installing that seems to cause permission
problems when executing sbin/start-dfs.sh
. Skipping that prerequisite seems to solve the problem.
Documentation can be generated locally with:
bin/process-docs.sh
Documentation is generated to the target/docs
directory. It is also possible to generate documentation locally with
Docker. docker/build.sh -d
.
Note
|
The installation of plugins sometimes fails in this step with the error: Error grabbing grapes - download
failed . It often helps in this case to delete the directories for the dependencies that cannot be downloaded
in the .m2 (~/.m2/ ) and in the grapes (~/.groovy/grapes/ ) cache. E.g., if the error is about
asm#asm;3.2!asm.jar , then remove the asm/asm sub directory in both directories.
|
Note
|
Unexpected failures with OLAP often point to a jar conflict that arises in scenarios where Hadoop or Spark
dependencies (or other dependencies for that matter) are modified and conflict. It is not picked up by the enforcer
plugin because the inconsistency arises through plugin installation in Gremlin Console at document generation time.
Making adjustments to the various paths by way of the <manifestEntries> on the jar given the functionality provided
by the DependencyGrabber class which allows you to manipulate (typically deleting conflicting files from /lib and
/plugin ) plugin loading will usually resolve it, though it could also be a more general environmental problem with
Spark or Hadoop. The easiest way to see the error is to simply run the examples in the Gremlin Console which more
plainly displays the error than the failure of the documentation generation process.
|
To generate the web site locally, there is no need for any of the above infrastructure. Site generation is a simple shell script:
bin/generate-home.sh
The site will be generated to the target/site/home
directory.
As of TinkerPop 3.2.2, the build optionally requires Python to build the gremlin-python
module. If Python is not installed, TinkerPop will still build with Maven, but native Python tests and
Java tests that require Python code will be skipped. Developers should also install pip
and virtualenv (version 15.0.2 - older versions may cause build failures).
The build expects Python a python3
installation which should be 3.5.3 or better. Python also tests kerberos and
therefore requires:
sudo apt install libkrb5-dev krb5-user
As of TinkerPop 3.5.5, gremlin-python
uses Docker for all tests inside of Maven, and Python installation will not be required to
run gremlin-python
through Maven. Please make sure Docker is installed and running on your system.
Once the Python environment is established, the full building and testing of gremlin-python
may commence. It
can be done manually from the command line with:
mvn clean install -Pglv-python
which enables the "glv-python" Maven profile or in a more automated fashion simply add a .glv
file to the root of the
gremlin-python
module which will signify to Maven that the environment is Python-ready. The .glv
file need not have
any contents and is ignored by Git. A standard mvn clean install
will then build gremlin-python
in full.
The .glv
file in gremlin-python
also activates the "console-integration-tests" Maven profile to run gremlin-console
integration tests. Alternatively, this profile can be activated manually. Note that unlike gremlin-python
the tests
are actually integration tests and therefore must be actively switched on with -DskipIntegrationTests=false
:
mvn clean install -pl gremlin-console -DskipIntegrationTests=false
Tip
|
For those who do not have a full Maven environment, please see this section for how Docker can be used to help run tests. |
Tip
|
Consider use of pyenv to better manage Python versions and environments. |
See the Release Environment section for more information on release manager configurations.
The build optionally requires .NET SDK (>=8.0) to work with the
gremlin-dotnet
module. If .NET SDK is not installed, TinkerPop will still build with Maven, but .NET projects
will be skipped.
As of TinkerPop 3.5.5, gremlin-dotnet
uses Docker for running all test projects inside of Maven, and .NET SDK will not
be required to run gremlin-dotnet
tests through Maven. Please make sure Docker is installed and running on your system.
gremlin-dotnet
can be built and tested from the command line with:
mvn clean install -Pgremlin-dotnet
which enables the "gremlin-dotnet" Maven profile or in a more automated fashion simply add a .glv
file to the src
and test
directories of the gremlin-dotnet
module which will signify to Maven that the environment is .NET-ready.
The .glv
file need not have any contents and is ignored by Git. A standard mvn clean install
will then build
gremlin-dotnet
in full.
In order to pack the Gremlin.Net.Template project, it is also necessary to install Mono.
The template can still be built and tested without Mono but packing will be skipped.
To pack the template (which will also download the NuGet CLI tool)
the nuget
property has to be set:
mvn clean install -Dnuget
Tip
|
For those who do not have a full Maven environment, please see this section for how Docker can be used to help run tests. |
See the Release Environment section for more information on release manager configurations.
When building gremlin-javascript
, mvn command will include a local copy of Node.js runtime and npm inside your project
using com.github.eirslett:frontend-maven-plugin
plugin. This copy of the Node.js runtime will not affect any
other existing Node.js runtime instances in your machine.
To run the development and build scripts of gremlint
and its corresponding web page docs/gremlint
, Node.js and npm
have to be installed. When generating or publishing the TinkerPop website, the docs/gremlint
web page has to be
built. Consequently, the scripts bin/generate-home.sh
and bin/publish-home.sh
require that Node.js and npm are
installed. Version 8.x or newer of npm is required. This is covered in more detail in the Site section.
As of TinkerPop 3.5.5, gremlin-javascript
uses Docker for all tests inside of Maven. Please make sure Docker is
installed and running on your system.
Important
|
Beware of unexpected or unwanted changes on package-lock.json files when committing and merging.
|
Tip
|
For those who do not have a full Maven environment, please see this section for how Docker can be used to help run tests. |
Tip
|
Consider using nvm to manage node.js versions and environments. |
See the Release Environment section for more information on release manager configurations.
The build optionally requires Go (>=1.22) to work with the gremlin-go
module. Creating an
empty .glv
file will enable running of tests inside of Maven. If .glv
file does not exist, TinkerPop
will still build with Maven, but Go projects will be skipped.
gremlin-go
can be built and tested from the command line with:
mvn clean install -pl gremlin-go
Alternatively, after installing Go, gremlin-go
can be built from the command line with:
go build
Docker allows you to test the driver without installing any dependencies. The following command can be used to run docker:
docker compose up --exit-code-from gremlin-go-integration-tests
See the Release Environment section for more information on release manager configurations.
The build optionally requires Docker to build Docker images of Gremlin Server and Gremlin Console. The Docker images can be built from the command line with:
mvn clean install -pl gremlin-server,gremlin-console -DdockerImages
which enables the "docker-images" Maven profile.
If confronted with "Permission denied" errors on Linux, it may be necessary to do the following:
sudo groupadd docker
sudo usermod -aG docker $USER
newgrp docker
sudo chmod 666 /var/run/docker.sock
As of TinkerPop 3.5.5, a docker image of the Gremlin Server will be built automatically with mvn clean install
, which
is use for GLV tests inside of Docker. To skip building this image, append the -DskipImageBuild
flag to Maven commands.
This section is only useful to TinkerPop release managers and describes prerequisites related to deploying an official release of TinkerPop.
Maven needs to be configured to deploy maven artifacts. Apache LDAP credentials can be used for this. Release
managers should encrypt their Apache LDAP password as described
in the Apache Maven docs.
The encrypted password can then be configured in the settings.xml
as described in the section
"Set up your development environment" of this Apache Infra article.
This configuration will be used by mvn deploy
.
For Python releases, uploading to pypi uses twine which is automatically
installed by the build process in maven. Twine refers to HOME/.pypirc
file for configuration on the pypi deploy
environments and username and password combinations. The file typically looks like this:
[distutils]
index-servers=
pypi
pypitest
[pypitest]
username = <username>
password =
[pypi]
username = <username>
password =
The approach above assumes basic authentication and a blank password will in .pyirc
will have Maven prompt for a
password. That said, it is preferred that your account be secured with 2FA, in which case, basic authentication will
no longer be allowed. Instead, an API token is required. It can be generated under your
account settings and then the username in .pyirc
is set to token
and the password to the token value. Please see PyPI documentation for more
details.
For .NET releases, install Mono. The release process is known to work with 6.12.0,
so it is best to probably install that version. Release managers should probably also do an install of
nuget 3.4.4 as it will help with environmental setup.
To get an environment ready to deploy to NuGet, it is necessary to have a NuGet API key. First, create an account with
nuget and request that a PMC member add your account to the Gremlin.Net and
the Gremlin.Net.Template package in nuget so that you can deploy. Next, generate an API key for your account on the
nuget website. The API key should be added to NuGet.Config
with the following:
mono nuget.exe setApiKey [your-api-key]
This should update ~/.config/NuGet/NuGet.Config
a file with an entry containing the encrypted API key. On
mvn deploy
, this file will be referenced on the automated nuget push
.
To deploy JavaScript / TypeScript artifacts on the npm registry, the release manager must
set the authentication information on the ~/.npmrc file. The easiest way to do that is to use the npm adduser
command. This must be done only once, as the auth token doesn’t have an expiration date and it’s stored on your file
system. If this account is newly created then request that a PMC member add your account to the "gremlin" package on
npm.
Deploying Docker images to Docker Hub requires an account that is a member of the TinkerPop
organization. So if you don’t already have an account on Docker Hub then create one and request that
a PMC member adds your account to the TinkerPop organization. Afterwards, authentication information needs to be added to
the ~/.docker/config.json
file. This information can simply be added with the docker login
command which will ask for
credentials. This must be done only once. Finally, docker push
can be used to push images to Docker Hub which will
be done automatically on mvn deploy
or it can be triggered manually with mvn docker:push
.
The following commands are a mix of Maven flags and shell scripts that handle different build operations
-
Build project:
mvn clean install
-
Build a specific module (e.g.
gremlin-server
) within the project:mvn clean install -pl gremlin-server
-
Build without assertions for "iterator leaks" which are enabled by default:
mvn clean install -DtestIteratorLeaks=false
-
Specify specific tests in a TinkerPop Suite to run with the
GREMLIN_TESTS
environment variable, along with the Maven project list argument, e.g.:export GREMLIN_TESTS='org.apache.tinkerpop.gremlin.process.traversal.step.map.PathTest$Traversals,org.apache.tinkerpop.gremlin.process.traversal.PathTest' mvn -Dmaven.javadoc.skip=true --projects tinkergraph-gremlin test
-
Clean the
.groovy/grapes/org.apache.tinkerpop
directory on build:mvn clean install -DcleanGrapes
-
Turn off "heavy" logging in the "process" tests:
mvn clean install -DargLine="-DmuteTestLogs=true"
-
The test suite for
neo4j-gremlin
is disabled by default - to turn it on:mvn clean install -DincludeNeo4j
-
-
Regenerate toy graph data (only necessary given changes to IO classes):
mvn clean install -Dio
fromtinkergraph-gremlin
directory-
If there are changes to the Gryo format, it may be necessary to generate the Grateful Dead dataset from GraphSON (see
IoDataGenerationTest.shouldWriteGratefulDead
)
-
-
Start Gremlin Server with Docker using the standard test configuration:
docker/gremlin-server.sh
-
Check license headers are present:
mvn apache-rat:check
-
Build AsciiDocs (see Documentation Environment):
bin/process-docs.sh
-
Build AsciiDocs (but don’t evaluate code blocks):
bin/process-docs.sh --dryRun
-
Build AsciiDocs (but don’t evaluate code blocks in specific files):
bin/process-docs.sh --dryRun docs/src/reference/the-graph.asciidoc,docs/src/tutorial/getting-started,…
-
Build AsciiDocs (but evaluate code blocks only in specific files):
bin/process-docs.sh --fullRun docs/src/reference/the-graph.asciidoc,docs/src/tutorial/getting-started,…
-
Process a single AsciiDoc file: docs/preprocessor/preprocess-file.sh `pwd`/gremlin-console/target/apache-tinkerpop-gremlin-console-*-standalone "" "*" `pwd`/docs/src/xyz.asciidoc
-
-
Build JavaDocs/JSDoc:
mvn process-resources -Djavadoc
-
Javadoc to
target/site/apidocs
directory -
JSDoc to the
gremlin-javascript/src/main/javascript/gremlin-javascript/doc/
directory
-
-
Specify the seed used for
Random
in testsmvn clean install -DtestSeed
- useful when a test fails, the seed will be printed in the build output so that the test can run with the same version of random (look for "TestHelper" logger in output) -
Check for newer dependencies:
mvn versions:display-dependency-updates
ormvn versions:display-plugin-updates
-
Check the effective
pom.xml
:mvn -pl gremlin-python -Pglv-python help:effective-pom -Doutput=withProfilePom.xml
-
Deploy JavaDocs/AsciiDocs:
bin/publish-docs.sh svn-username
-
Integration Tests:
mvn verify -DskipIntegrationTests=false
-
Execute with the
-DincludeNeo4j
option to include neo4j-gremlin tests. As of 3.7.0, transactional tests on Gremlin Server, Gremlin Driver, and GLVs will run automatically with integration tests against the TinkerTransactionGraph. -
Execute with the
-DuseEpoll
option to try to use Netty native transport (works on Linux, but will fallback to Java NIO on other OS).
-
-
Benchmarks:
mvn verify -DskipBenchmarks=false
-
Reports are generated to the console and to
gremlin-tools/gremlin-benchmark/target/reports/benchmark
.
-
-
Test coverage report:
mvn clean install -Dcoverage
- note that theinstall
is necessary because report aggregation is bound to that part of the lifecycle.-
Reports are generated to
gremlin-tools/gremlin-coverage/target/site
.
-
-
cd site
-
Generate web site locally:
bin/generate-home.sh
-
Publish web site:
bin/publish-home.sh <username>
-
The following steps must be taken in order to build TinkerPop on Windows:
-
Install winutils for Hadoop
-
Download the latest version of winutils binaries for Hadoop. The binaries
winutils.exe
andhadoop.dll
are required.-
Note that these libraries require Microsoft Visual C Redistributable 2015-2022 to be installed. We've tested it on Windows 10 and Windows 11 with Microsoft Visual C Redistributable 2015-2022 14.32.31326.
-
-
Place contents of the bin folder on your local driver in the following folder structure:
-
e.g.
hadoop-3.3.1/bin/winutils.exe
-
-
Set
HADOOP_HOME
to point to thehadoop-3.3.1
folder -
Add
%HADOOP_HOME%\bin
to yourPATH
-
-
Run
mvn clean install
from root of tinkerpop -
Follow IDE specific steps if applicable:
You should now be able to work with TinkerPop on Windows.
TinkerPop provides a shell script, that can start several build tasks within a Docker container. The required Docker images will be built automatically if they don’t exist yet. Thus the first invocation of the Docker script is expected to take some time.
The script can be found under PROJECT_HOME/docker/build.sh
. The following tasks are currently
supported:
-
run standard test suite
-
run integration tests
-
build Java docs
-
build user docs
A list of command line options is provided by docker/build.sh --help
. The container will install,
configure and start all required dependencies, such as Hadoop.
By default, this script will run every module in the project. However, if you are planning on working on just a small set of the modules (e.g. the GLVs) then you can use the script options to reduce the modules included by specifically selecting which modules you want. This behavior is currently supported for the non-Java GLVs and gremlin-console. This option will include only the selected modules as well as gremlin-server, gremlin-test, neo4j-gremlin and all their dependencies. This is the minimum set of modules required to build and test the GLVs.
./docker/build.sh --tests --integration-tests --python --golang
Options can be passed to Docker by setting the TINKERPOP_DOCKER_OPTS
environment variable. A speed boost can
be gained at the expense of memory by using tmpfs and the special directory /usr/src/tinkermem
.
TINKERPOP_DOCKER_OPTS="--tmpfs /usr/src/tinkermem:exec,mode=0755,rw,noatime,size=2000m"
TINKERPOP_DOCKER_OPTS="--sysctl net.ipv6.conf.all.disable_ipv6=1 --sysctl net.ipv6.conf.default.disable_ipv6=1"
A custom maven settings.xml can be supplied, for example, to point to a local proxy. Copy the settings.xml
to the
PROJECT_HOME/
directory. The Docker script will detect and copy it to the running container.
If the container is used to generate the user docs, it will start a web server and show the URL that is used to host the HTML docs.
After finishing all tasks, the script will immediately destroy the container.
Docker can also be helpful to developers who do not want to run tests from a Maven environment, which may be a bit opaque when dealing with test failures and largely unhelpful for debugging. This situation is typically case for developers doing work on Gremlin Language Variants (e.g. Python). To help alleviate this problem, developers can start a standalone Gremlin Server with its standard test configuration that is used in the standard Maven build.
Generally speaking, most developers will want to test their code against the latest build of Gremlin Server in the TinkerPop repository. To do that, first be sure to build a Docker image of the current code:
mvn clean install -DskipTests
Next, generate the a Docker image for Gremlin Server with:
mvn clean install -pl :gremlin-server -DdockerImages -DskipTests
Important
|
If changes are made to the repository that need to be reflected in the Gremlin Server Docker image then the old image should be removed and then the above commands re-executed. |
Finally, start the server with:
docker/gremlin-server.sh
Starting Gremlin Server this way makes it possible to run Gremlin Language Variant tests without Maven (for example, directly from a debugger) which should greatly reduce development friction for these environments.
It is also possible to specify the exact version of Gremlin Server to run with the test configuration. This version should be an existing Docker image version and must be an explicit version that maps to an actual TinkerPop artifact:
docker/gremlin-server.sh 3.4.2
To be a bit more clear, the version can not be a Docker tag like "latest" because there is no such TinkerPop artifact that has been published with that version number.
Currently the modules gremlin-go, gremlin-javascript, gremlin-dotnet, gremlin-python and gremlin-console can be tested through Docker.
Please make sure Docker is installed and running on your system. You will need to install both Docker Engine and Docker Compose, which are included in Docker Desktop.
The following environment variables used by Docker Compose will automatically be set when running through Maven.
The docker compose environment variable GREMLIN_SERVER
specifies the Gremlin server docker image to use, i.e. an
image with the tag tinkerpop/gremlin-server:$GREMLIN_SERVER
, and is a required environment variable. This also
requires the specified docker image to exist, either locally or in Docker Hub.
Running mvn clean install -pl gremlin-server -DskipTests -DskipIntegrationTests=true -Dci -am
in the main tinkerpop
directory will automatically build a local SNAPSHOT Gremlin server image. If your OS Platform cannot build a local
SNAPSHOT Gremlin server through maven
, it is recommended to use the latest released server version from
Docker Hub (do not use GREMLIN_SERVER=latest
, use actual
version number, e.g. GREMLIN_SERVER=3.5.x
or GREMLIN_SERVER=3.6.x
).
The docker compose environment variable HOME
specifies the user home directory for mounting volumes during test image
set up. This variable is set by default in Unix/Linux, but will need to be set for Windows, for example, run
$env:HOME=$env:USERPROFILE
in PowerShell.
There are different ways to launch the test suite and set the GREMLIN_SERVER
environment variable depending on
your Platform:
-
Run Maven commands, e.g.
mvn clean install
inside of project folder e.g.tinkerpop/gremlin-go
, ormvn clean install -pl gremlin-go
inside oftinkerpop
(platform-agnostic - recommended) -
Add
GREMLIN_SERVER=<server-image-version>
andHOME=<user-home-directory>
to an.env
file inside project folder and rundocker compose up --exit-code-from gremlin-go-integration-tests
(Platform-agnostic). -
Run
GREMLIN_SERVER=<server-image-version> docker compose up --exit-code-from gremlin-go-integration-tests
in Unix/Linux. -
Run
$env:GREMLIN_SERVER="<server-image-version>";$env:HOME=$env:USERPROFILE;docker compose up --exit-code-from gremlin-go-integration-tests
in Windows PowerShell.
You should see exit code 0 upon successful completion of the test suites. Run docker compose down
to remove the
service containers (not needed if you executed Maven commands or run.sh
), or docker compose down --rmi all
to
remove the service containers while deleting all used images.
Note for running docker with MacOS on ARM processors: Docker’s performance is extremely poor on ARM Mac’s in its default configuration. It is recommended to enable both the "New Virtualization Framework" and "VirtioFS" under Docker Desktop Settings → Experimental Features.
Most core TinkerPop developers are using Intellij for their work so this section helps describe the mechanisms for best working with it as an IDE.
Installation and basic configuration of Intellij is beyond the scope of this writing and it is assumed that the TinkerPop GitHub repository has been cloned and the root of the repository is open in Intellij. From there, we can begin to look at configuration options specifically relevant to TinkerPop itself.
TinkerPop has a module called gremlin-shaded
which contains shaded dependencies for some libraries that are widely
used and tend to introduce conflicts. To ensure that Intellij properly interprets this module after importing the
Maven pom.xml
perform the following steps:
-
Build
gremlin-shaded
from the command line withmvn clean install
. -
Right-click on the
gremlin-shaded
module in the project viewer of Intellij and select "Remove module". If this menu option is not available (as is the case in newer versions of Intellij - first noticed in 13.1.5), then open the "Maven Projects" side panel, right click thegremlin-shaded
module and select "Ignore Project". -
In the "Maven Projects" Tool window and click the tool button for "Reimport All Maven projects" (go to
View | Tool Windows | Maven Projects
on the main menu if this panel is not activated). -
At this point it should be possible to compile and run the tests within Intellij, but in the worst case, use
File | Invalidate Caches/Restart
to ensure that indices properly rebuild.
Note that it may be necessary to re-execute these steps if the gremlin-shaded
pom.xml
is ever updated.
You will initially see lots of errors related to the Gremlin lexer/parser. The gremlin-language
module requires
ANTLR processing. While this processing is configured to execute with Maven, it can also be setup to generate parser
files within Intellij itself on command:
-
Install the ANTLR4 Grammar Plugin for Intellij
-
Right-click on the
Gremlin.g4
file and "Configure ANTLR" -
Set "Output directory where all output is generated" to
target/generated-sources/antlr4
-
Set "Grammar file encoding" to
utf-8
-
Set "Package/namespace for the generated code" to
org.apache.tinkerpop.gremlin.language.grammar
-
Set "Language" to
Java
-
Set "Case transformation in the Preview window" should be "Leave as-is"
-
The "generate parse tree listener" should be unchecked and the "generate parse tree visitor" should be checked.
With these settings it should be possible to right-click Gremlin.g4
and "Generate ANTLR Recognizer" which will place
the generated code in where specified at target/generated-sources/antlr4
. Be sure to right-click the antlr4
directory and "Mark directory as" "Generated Sources Root" which should allow Intellij to recognize it.
The gremlin-groovy
module uses a Java annotation processor to help support DSLs. Annotation processing in Intellij
should be set up by the Maven import, but if this is set up incorrectly you will see a cryptic error message when
building: java: Compilation failed: internal java compiler error
. To fix this, search for the Intellij setting
Annotation Processors
. Make sure that annotation processing is enabled for gremlin-groovy
but disabled for
gremlin-annotations
. This should fix the internal java compile error. Next you may see compile errors complaining
that the jsr223
classes __
, `CredentialTraversal
, CredentialTraversalSource
, and DefaultCredentialTraversal
cannot be found. Those classes are generated by annotations. To fix these errors, make sure mark the
directory gremlin-groovy/target/generated-sources/annotations
as "Generated Sources Root".
Developers working on the neo4j-gremlin
module should enabled the include-neo4j
Maven profile in Intellij.
This will ensure that tests will properly execute within the IDE.
It is generally assumed that JVM-based debugging of TinkerPop code in Intellij is a relatively straightforward task for most developers, but it is worth pointing out a few important points related to it and to drill into some specifics for the non-JVM languages.
There are generally no complexities to running the debugger for any JVM-based test in the repository, but the following tips are helpful to know when doing so:
-
It is not possible to run the tests in
gremlin-test
without anGraph
implementation. For example, it is not possible to just right-clickorg.apache.tinkerpop.gremlin.process.traversal.step.filter.CoinTest
and then selectDebug 'CoinTest'
. Instead, running that test would require openingtinkergraph-gremlin
and executing theTinkerGraphProcessStandardTest
(as an example) which runs the full Gremlin test suite to includeCoinTest
. -
To run just
CoinTest
, set aGREMLIN_TESTS
environment variable with the fully qualified path name to the test configuration in Intellij. Note that the fully qualified test name is reallyorg.apache.tinkerpop.gremlin.process.traversal.step.filter.CoinTest$Traversals
. -
Gherkin tests have a similar pattern in that they require a
Graph
implementation to execute them. Therefore, debugging entails going totinkergraph-gremlin
and runningTinkerGraphFeatureTest
in the debugger. -
It is possible to filter the Gherkin tests by adding a system property to the debug configuration that specifies the tags to use or ignore. For example to just run the
coin()
tests:-Dcucumber.filter.tags="@StepCoin"
Debugging Python within this mainly JVM-based project structure requires a bit of configuration. The following steps will help get Intellij prepared for this task:
-
Install the Python plugin from JetBrains which should provide PyCharm like functionality.
-
Right-click the "tinkerpop" top-level module in Intellij’s project explorer and "Open Module Settings".
-
Select "Platform Settings | SDKs" and then click the "+" to "Add Python SDK…"
-
Choose a "Virtualenv Environment" and a "New environment". Set the "Location" to "<project-root>/gremlin-python/src/main/python". Select a "Base interpreter" that matches the version required by
gremlin-python
and click "OK". -
Select "Project Settings | Modules" and then select "gremlin-python" from the listing. Change the "Module SDK" to the newly added Python SDK.
-
Open a terminal to
gremlin-python/scr/main/python
and dovenv/bin/pip3 install -e .
to pull in all of thegremlin-python
dependencies.
At this stage, it should be possible to run unit tests in Python:
-
Pull down the drop down in the toolbar for "Run/Debug Configurations" and select "Edit Configurations…"
-
Click the "+" to "Python Tests | pytest" and browse to a test to supply the "Script path".
-
Go to "Python interpreter" and select "Use SDK of module" and choose "gremlin-python" in the drop-down and click OK.
-
The test should not appear in the "Run/Debug Configurations" and can be executed.
Note
|
When the Debug button is pressed, Intellij may display a notification that using the debugger requires some additional downloads - confirm those installations as required. |
Many of the tests in gremlin-python
require Gremlin Server. They are effectively integration tests. TinkerPop makes
it easy to debug these tests by providing a Docker based test server which is rigged up with all the configurations
required for the tests to execute. Start this server with docker/gremlin-server.sh -n
where the -n
will enable
Neo4j for transaction based tests.