External Workers¶

Tierkreis does not put any restrictions on workers except the general contract. This means that you can also write workers in your preferred language and interface it with Tierkreis. To incorporate the worker into a Tierkreis graph, you can either describe the interface using a subset of TypeSpec to generate stubs for your worker or use the untyped functions in the graph builder. To run the worker you may use one of the existing executors; In the common case when you can run your code as a commandline script, the ShellWorker is appropriate. Alternatively you can write your own executor to launch the worker process.

For a full example without stubs you can have a look at the signing_graph.py example.

IDL¶

A TypeSpec file can be used to define the inputs and outputs of the tasks in your workers.
Tierkreis is restricted to a subset of the full specification:

Interfaces: defining the available tasks in your worker. Currently, Tierkreis allows only one interface definition for a worker, which must have the same name as the directory it is placed in.
Models: to define the types of the task in- and outputs Additionally, you may use the @portmapping decorator on models to indicate tasks with multiple outputs.

For example you could define the following interface in TestNamespace/namespace.tsp

@portmapping
model Person {
  name: string;
  age: uint8;
}

model Dog{
  name: string;
  color: string;
}

@portmapping
model Creature<T> {
  name: string;
  t: T;
}
interface TestNamespace {
  foo(age: integer, name: string): Person;
  bar(): Dog;
  baz<T>(c: Creature<T>): Creature<T>;
}

From which you could generate stubs. This is currently only available through a python script:

# /// script
# requires-python = ">=3.12"
# dependencies = ["tierkreis"]
# ///

from pathlib import Path
from tierkreis import Worker
from tierkreis.namespace import Namespace


if __name__ == "__main__":
    tsp_path = Path("<path to your spec file>")
    namespace = Namespace.from_spec_file(tsp_path)
    namespace.write_stubs(tsp_path.parent / "stubs.py")

Parsing Inputs and Outputs¶

There are two possible scenarios to parse input and outputs for workers. You can either define a custom executor in python; this has the advantage of interacting with the storage from Tierkreis, making it simpler to read inputs. But you also have to make sure the outputs are written to the correct places. Alternatively you can use an existing executor, which means you have to do the parsing inside the worker. Tierkreis typically hands over a single argument to the invoked script: the NodeCallArgs This contains all the information specified in the the contract of a worker. Parsing this file will provide the locations of inputs and expected outputs.

C++ Example¶

This example shows how to parse the following interface:

interface MyWorker {
    double(value: Array<integer>): Array<integer>;
}

using nlohmann/json.

void parse_json(const std::string &call_args_path, json &call_args)
{
  std::ifstream call_args_file(call_args_path);
  if (!call_args_file.is_open()) // check if open is necessary for << syntax
  {
    std::cerr << "Error: Could not open configuration file: " << call_args_path << std::endl;
    MPI_Abort(MPI_COMM_WORLD, 1);
    return;
  }
  try
  {
    call_args = json::parse(call_args_file);
  }
  catch (json::parse_error &e)
  {
    std::cerr << "Error: JSON parsing failed: " << e.what() << std::endl;
    MPI_Abort(MPI_COMM_WORLD, 1);
    return;
  }
  call_args_file.close();
}

void parse_input(const json &call_args, std::vector<int> &input_data)
{
  json inputs = call_args["inputs"];
  std::cout << inputs << std::endl;
  for (auto &[key, path] : inputs.items())
  {
    if (key != "value")
      continue; // we only want to load value

    auto abs_path = CHECKPOINTS_DIRECTORY + path.template get<std::string>();
    std::cout << "parsing input: " << key << " at: " << abs_path << std::endl;
    json data;
    parse_json(abs_path, data);
    std::cout << "Read data" << data << std::endl;
    auto tmp = data.template get<std::string>();
    std::vector<int> ret = json::parse(tmp);
    input_data.resize(ret.size());
    std::copy(ret.begin(), ret.end(), input_data.begin());
  }
}

void write_output(const json &call_args, const std::vector<int> &gathered_data)
{

  json outputs = call_args["outputs"];
  for (auto &[key, path] : outputs.items())
  {
    if (key != "value")
      continue; // we only want to write value
    std::cout << "writing output: " << key << " at: " << path << std::endl;
    auto out_path = CHECKPOINTS_DIRECTORY + path.template get<std::string>();
    std::ofstream output_file(out_path);
    if (!output_file.is_open())
    {
      std::cerr << "Error: Could not open output file for writing: " << out_path << std::endl;
      MPI_Abort(MPI_COMM_WORLD, 1);
      return;
    }
    json data = gathered_data;
    output_file << data << std::endl;
    output_file.close();
  }

  auto done_file_path = CHECKPOINTS_DIRECTORY + call_args["done_path"].template get<std::string>();
  std::ofstream done_file(done_file_path);
  if (done_file.is_open())
  {
    done_file.close();
  }
  else
  {
    std::cerr << "Error: Could not create done file at " << done_file_path << std::endl;
  }
}