Migrating From 1.x
If you aren't coming from an existing deployment of Horizon, feel free to skip this section and move on to installing Horizon!
Starting with version v1.6.0, Horizon allows using Stellar Core in "captive" mode for ingestion. This mode has been enabled by default since Horizon 2.0, so even though you can enable captive mode on 1.6+, this migration guide is catered towards upgrading to 2.x due to the stability and configuration improvements introduced in the later versions.
Please note that Horizon team will support the previous non-captive mode for the time being. To use the previous method, set
ENABLE_CAPTIVE_CORE_INGESTION=false in your ingesting instances. After 6 months, this compatibility flag will be removed.
Please start with the blog post to understand the major changes that Horizon 2.0 introduces with the Captive Core architecture. In summary, Captive Core is a specialized, narrowed-down Stellar-Core instance with the sole aim of emitting transaction metadata to Horizon. It means:
- no separate Stellar Core instance
- no Core database: everything done in-memory
- much faster ingestion
Captive Stellar Core completely eliminates all Horizon issues caused by connecting to Stellar Core's database, but it requires extra time to initialize and manage its Stellar Core subprocess. Captive Core can be used in both reingestion (
horizon db reingest range) and normal Horizon operation (
horizon serve). In fact, using Captive Core to reingest historical data is considerably faster than without it.
How It Works
The blog post linked above gives a high-level overview, while this section dives a little deeper into the technical differences relative to Horizon's relationship with standalone, "Watcher" Core.
When using Captive Core, Horizon runs the
stellar-core binary as a subprocess. Then, both processes communicate over filesystem pipe: Core sends
xdr.LedgerCloseMeta structs with information about each ledger and Horizon reads it.
The behaviour is slightly different when reingesting old ledgers and when reading recently closed ledgers:
When reingesting, Stellar Core is started in a special
catchupmode that simply replays the requested range of ledgers. This mode requires an additional 3GiB of RAM because all ledger entries are stored in memory, making it extremely fast. This mode only depends on the history archives, so a Captive Core configuration (see below) is not required.
When reading recently closed ledgers, Core is started with a normal
runcommand. This mode also requires an additional 3GiB of RAM for in-memory ledger entries. In this case, a configuration file (again, read on below) is required in order to configure a quorum set so that it can connect to the Stellar network.
As discussed earlier, Captive Core provides much better decoupling for Horizon at the expense of persistence. You should be aware of the following consequences:
- Captive Core requires a couple of minutes to complete the "apply buckets" stage first time Horizon is started, but it should reuse the cached buckets on subsequent restarts (as of Horizon 2.5 and Core 17.1).
- If the Horizon process terminates, Stellar Core is also terminated.
- Running Horizon now requires more RAM and less disk space. You can refer to the earlier Prerequisites page for details.
To hedge against these limitations, we recommend running multiple ingesting Horizon servers in a single cluster. This allows other ingesting instances to maintain service without interruptions if a Captive Core instance is restarted.
Now, we'll discuss migrating existing systems running the pre-2.0 versions of Horizon to the new Captive Core world.
The first major change from 1.x is how you will configure Horizon. You will no longer need your Stellar Core configuration, but will rather need to craft a configuration file describing Captive Core's behavior. Read this section to understand what the stub should contain.
Your old configuration cannot be used directly: Horizon needs special settings for Captive Core. Otherwise, running Horizon may fail with the following error, or errors like it:
Invalid captive core toml file: LOG_FILE_PATH in captive core config file does not match Horizon captive-core-log-path flag
Again, while the Captive Core configuration file may appear to just be a subset of Stellar Core's configuration, you shouldn't think about it that way and treat it as its own format. It may diverge in the future, and not all of Core's options are available to Captive Core.
You should pass the location of this new TOML configuration to the
CAPTIVE_CORE_CONFIG_PATH command-line flag / environmental variable.
If you want to continue to have access to the underlying Stellar Core subprocess (like you did previously with a standalone Watcher Core), you should set the
HTTP_PORT field in your configuration file accordingly.
Once you have a configuration file ready, you should also modify your Horizon configuration to include Captive Core parameters. Within
/etc/default/stellar-horizon, you should add:
# Captive Core Ingestion Config
# end Captive Core
You may need to adjust these accordingly, for example by pointing
CAPTIVE_CORE_CONFIG_PATH to your configuration file and possibly
CAPTIVE_CORE_STORAGE_PATH to where you'd like Captive Core to store its bucket files (but keep in mind the disk space and permissions requirements).
Finally, the process for upgrading both Stellar Core and Horizon is covered here.
Depending on the version you're migrating from, you may need to include an additional step here: manual reingestion. This can still be accomplished with Captive Core; see below.
Now, we can stop Core and restart Horizon:
sudo systemctl stop stellar-core
sudo systemctl restart stellar-horizon
After a few moments, the logs should show Captive Core running successfully as a subprocess, and eventually Horizon will be running as usual except with Captive Core rapidly generating transaction metadata in-memory!
If you want your Captive Core instance to connect to a private Stellar network, you will need to specify the validator(s) of the private network in the Captive Core configuration file.
Assuming the validator of your private network has a public key of
GD5KD2KEZJIGTC63IGW6UMUSMVUVG5IHG64HUTFWCHVZH2N2IBOQN7PS and can be accessed at
private1.validator.com, then the Captive Core config would consist of the following:
FAILURE_SAFETY=0 are required when there are too few validators in the private network to form a quorum.
You will also need to set
RUN_STANDALONE=false in the Stellar Core configuration for the validator. Otherwise, the validator will not accept connections on its peer port, which means Captive Core will not be able to connect to the validator.
On a new Stellar network, the first history archive snapshot is published after ledger 63 is closed. Captive Core depends on the history archives, which means that Horizon ingestion via Captive Core will not begin until after ledger 63 is closed. Assuming the standard 5 second delay in between ledgers, it will take ~5 minutes for the network to progress from the genesis ledger to ledger 63.
There are cases where you may need to repeatedly create new private networks (e.g. spawning a private network during integration tests) and this 5 minute delay is too costly. In that case, you can consider including
ARTIFICIALLY_ACCELERATE_TIME_FOR_TESTING=true in both the validator configuration and the Captive Core configuration. When this parameter is set, Stellar Core will publish a new ledger every second. It will also publish history archive snapshots every 8 ledgers, so you will need to set Horizon's checkpoint frequency parameter (
CHECKPOINT_FREQUENCY) to 8.
After migrating to the Captive Core world, you will assuredly need to reingest your history again.
The Ingestion guide should refresh your memory on this: nothing has really changed aside from how quickly reingestion gets done. For example, a full reingestion of the entire network only takes ~1.5 days (as opposed to weeks previously) on an m5.8xlarge instance.