pub struct IndexerExecutor<P> { /* private fields */ }Expand description
The Executor of the main ingestion pipeline process.
This struct orchestrates the execution of multiple worker pools, handling
checkpoint distribution, progress tracking, and shutdown. It utilizes
ProgressStore for persisting checkpoint progress and provides metrics
for monitoring the indexing process.
§Example
use async_trait::async_trait;
use iota_data_ingestion_core::{
DataIngestionMetrics, FileProgressStore, IndexerExecutor, IngestionError, ReaderOptions,
Worker, WorkerPool,
};
use iota_types::full_checkpoint_content::CheckpointData;
use prometheus::Registry;
use tokio_util::sync::CancellationToken;
use std::{path::PathBuf, sync::Arc};
struct CustomWorker;
#[async_trait]
impl Worker for CustomWorker {
type Message = ();
type Error = IngestionError;
async fn process_checkpoint(
&self,
checkpoint: Arc<CheckpointData>,
) -> Result<Self::Message, Self::Error> {
// custom processing logic.
println!(
"Processing Local checkpoint: {}",
checkpoint.checkpoint_summary.to_string()
);
Ok(())
}
}
#[tokio::main]
async fn main() {
let concurrency = 5;
let progress_store = FileProgressStore::new("progress.json").await.unwrap();
let mut executor = IndexerExecutor::new(
progress_store,
1, // number of registered WorkerPools.
DataIngestionMetrics::new(&Registry::new()),
CancellationToken::new(),
);
// register a worker pool with 5 workers to process checkpoints in parallel
let worker_pool = WorkerPool::new(CustomWorker, "local_reader".to_string(), concurrency, Default::default());
// register the worker pool to the executor.
executor.register(worker_pool).await.unwrap();
// run the ingestion pipeline.
executor
.run(
PathBuf::from("./chk".to_string()), // path to a local directory where checkpoints are stored.
None,
vec![], // optional remote store access options.
ReaderOptions::default(), // remote_read_batch_size.
)
.await
.unwrap();
}Implementations§
Source§impl<P: ProgressStore> IndexerExecutor<P>
impl<P: ProgressStore> IndexerExecutor<P>
pub fn new( progress_store: P, number_of_jobs: usize, metrics: DataIngestionMetrics, token: CancellationToken, ) -> Self
Sourcepub async fn register<W: Worker + 'static>(
&mut self,
pool: WorkerPool<W>,
) -> IngestionResult<()>
pub async fn register<W: Worker + 'static>( &mut self, pool: WorkerPool<W>, ) -> IngestionResult<()>
Registers new worker pool in executor.
Sourcepub fn shutdown_when<F>(&mut self, f: F)
pub fn shutdown_when<F>(&mut self, f: F)
Registers a predicate callback that determines when the ingestion process should stop.
This function f will be called for every incoming checkpoint
before it’s sent to the worker pool.
Based on the returned ShutdownAction the executor will evaluate
whether to continue or stop the ingestion process by initiating the
graceful shutdown process.
Once a shutdown action is triggered, the executor will stop sending new checkpoints and will wait for all previously sent checkpoints to be processed by workers before initiating graceful shutdown process.
Note:
Calling this method after
with_ingestion_limit replaces the
earlier predicate, and vice versa. They are not cumulative.
Sourcepub fn with_ingestion_limit(&mut self, limit: IngestionLimit)
pub fn with_ingestion_limit(&mut self, limit: IngestionLimit)
Adds an upper‑limit policy that determines when the ingestion process should stop.
This is a convenience method, it internally uses
shutdown_when by registering a predicate
derived from the provided IngestionLimit.
Note:
Calling this method after shutdown_when
replaces the earlier predicate, and vice versa. They are not cumulative.
pub async fn update_watermark( &mut self, task_name: String, watermark: CheckpointSequenceNumber, ) -> IngestionResult<()>
pub async fn read_watermark( &mut self, task_name: String, ) -> IngestionResult<CheckpointSequenceNumber>
Sourcepub async fn run(
self,
path: PathBuf,
remote_store_url: Option<String>,
remote_store_options: Vec<(String, String)>,
reader_options: ReaderOptions,
) -> IngestionResult<HashMap<String, CheckpointSequenceNumber>>
pub async fn run( self, path: PathBuf, remote_store_url: Option<String>, remote_store_options: Vec<(String, String)>, reader_options: ReaderOptions, ) -> IngestionResult<HashMap<String, CheckpointSequenceNumber>>
Main executor loop.
§Error
Returns an IngestionError::EmptyWorkerPool if no worker pool was
registered.
Sourcepub async fn run_with_config(
self,
config: CheckpointReaderConfig,
) -> IngestionResult<HashMap<String, CheckpointSequenceNumber>>
pub async fn run_with_config( self, config: CheckpointReaderConfig, ) -> IngestionResult<HashMap<String, CheckpointSequenceNumber>>
Alternative main executor loop. Uses the new iteration of the
CheckpointReader supporting syncing checkppints from hybrid historical
store.
§Error
Returns an IngestionError::EmptyWorkerPool if no worker pool was
registered.
Auto Trait Implementations§
impl<P> Freeze for IndexerExecutor<P>where
P: Freeze,
impl<P> !RefUnwindSafe for IndexerExecutor<P>
impl<P> Send for IndexerExecutor<P>where
P: Send,
impl<P> !Sync for IndexerExecutor<P>
impl<P> Unpin for IndexerExecutor<P>where
P: Unpin,
impl<P> !UnwindSafe for IndexerExecutor<P>
Blanket Implementations§
§impl<U> As for U
impl<U> As for U
§fn as_<T>(self) -> Twhere
T: CastFrom<U>,
fn as_<T>(self) -> Twhere
T: CastFrom<U>,
self to type T. The semantics of numeric casting with the as operator are followed, so <T as As>::as_::<U> can be used in the same way as T as U for numeric conversions. Read more§impl<'a, T, E> AsTaggedExplicit<'a, E> for Twhere
T: 'a,
impl<'a, T, E> AsTaggedExplicit<'a, E> for Twhere
T: 'a,
§impl<'a, T, E> AsTaggedImplicit<'a, E> for Twhere
T: 'a,
impl<'a, T, E> AsTaggedImplicit<'a, E> for Twhere
T: 'a,
Source§impl<T> BorrowMut<T> for Twhere
T: ?Sized,
impl<T> BorrowMut<T> for Twhere
T: ?Sized,
Source§fn borrow_mut(&mut self) -> &mut T
fn borrow_mut(&mut self) -> &mut T
§impl<T> Conv for T
impl<T> Conv for T
§impl<T> FmtForward for T
impl<T> FmtForward for T
§fn fmt_binary(self) -> FmtBinary<Self>where
Self: Binary,
fn fmt_binary(self) -> FmtBinary<Self>where
Self: Binary,
self to use its Binary implementation when Debug-formatted.§fn fmt_display(self) -> FmtDisplay<Self>where
Self: Display,
fn fmt_display(self) -> FmtDisplay<Self>where
Self: Display,
self to use its Display implementation when
Debug-formatted.§fn fmt_lower_exp(self) -> FmtLowerExp<Self>where
Self: LowerExp,
fn fmt_lower_exp(self) -> FmtLowerExp<Self>where
Self: LowerExp,
self to use its LowerExp implementation when
Debug-formatted.§fn fmt_lower_hex(self) -> FmtLowerHex<Self>where
Self: LowerHex,
fn fmt_lower_hex(self) -> FmtLowerHex<Self>where
Self: LowerHex,
self to use its LowerHex implementation when
Debug-formatted.§fn fmt_octal(self) -> FmtOctal<Self>where
Self: Octal,
fn fmt_octal(self) -> FmtOctal<Self>where
Self: Octal,
self to use its Octal implementation when Debug-formatted.§fn fmt_pointer(self) -> FmtPointer<Self>where
Self: Pointer,
fn fmt_pointer(self) -> FmtPointer<Self>where
Self: Pointer,
self to use its Pointer implementation when
Debug-formatted.§fn fmt_upper_exp(self) -> FmtUpperExp<Self>where
Self: UpperExp,
fn fmt_upper_exp(self) -> FmtUpperExp<Self>where
Self: UpperExp,
self to use its UpperExp implementation when
Debug-formatted.§fn fmt_upper_hex(self) -> FmtUpperHex<Self>where
Self: UpperHex,
fn fmt_upper_hex(self) -> FmtUpperHex<Self>where
Self: UpperHex,
self to use its UpperHex implementation when
Debug-formatted.§fn fmt_list(self) -> FmtList<Self>where
&'a Self: for<'a> IntoIterator,
fn fmt_list(self) -> FmtList<Self>where
&'a Self: for<'a> IntoIterator,
§impl<T> Instrument for T
impl<T> Instrument for T
§fn instrument(self, span: Span) -> Instrumented<Self>
fn instrument(self, span: Span) -> Instrumented<Self>
§fn in_current_span(self) -> Instrumented<Self>
fn in_current_span(self) -> Instrumented<Self>
Source§impl<T> IntoEither for T
impl<T> IntoEither for T
Source§fn into_either(self, into_left: bool) -> Either<Self, Self>
fn into_either(self, into_left: bool) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moreSource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
T in a Request§impl<T> IntoRequest<T> for T
impl<T> IntoRequest<T> for T
§fn into_request(self) -> Request<T>
fn into_request(self) -> Request<T>
T in a tonic::Request§impl<L> LayerExt<L> for L
impl<L> LayerExt<L> for L
§fn named_layer<S>(&self, service: S) -> Layered<<L as Layer<S>>::Service, S>where
L: Layer<S>,
fn named_layer<S>(&self, service: S) -> Layered<<L as Layer<S>>::Service, S>where
L: Layer<S>,
Layered].§impl<T> Pipe for Twhere
T: ?Sized,
impl<T> Pipe for Twhere
T: ?Sized,
§fn pipe<R>(self, func: impl FnOnce(Self) -> R) -> Rwhere
Self: Sized,
fn pipe<R>(self, func: impl FnOnce(Self) -> R) -> Rwhere
Self: Sized,
§fn pipe_ref<'a, R>(&'a self, func: impl FnOnce(&'a Self) -> R) -> Rwhere
R: 'a,
fn pipe_ref<'a, R>(&'a self, func: impl FnOnce(&'a Self) -> R) -> Rwhere
R: 'a,
self and passes that borrow into the pipe function. Read more§fn pipe_ref_mut<'a, R>(&'a mut self, func: impl FnOnce(&'a mut Self) -> R) -> Rwhere
R: 'a,
fn pipe_ref_mut<'a, R>(&'a mut self, func: impl FnOnce(&'a mut Self) -> R) -> Rwhere
R: 'a,
self and passes that borrow into the pipe function. Read more§fn pipe_borrow<'a, B, R>(&'a self, func: impl FnOnce(&'a B) -> R) -> R
fn pipe_borrow<'a, B, R>(&'a self, func: impl FnOnce(&'a B) -> R) -> R
§fn pipe_borrow_mut<'a, B, R>(
&'a mut self,
func: impl FnOnce(&'a mut B) -> R,
) -> R
fn pipe_borrow_mut<'a, B, R>( &'a mut self, func: impl FnOnce(&'a mut B) -> R, ) -> R
§fn pipe_as_ref<'a, U, R>(&'a self, func: impl FnOnce(&'a U) -> R) -> R
fn pipe_as_ref<'a, U, R>(&'a self, func: impl FnOnce(&'a U) -> R) -> R
self, then passes self.as_ref() into the pipe function.§fn pipe_as_mut<'a, U, R>(&'a mut self, func: impl FnOnce(&'a mut U) -> R) -> R
fn pipe_as_mut<'a, U, R>(&'a mut self, func: impl FnOnce(&'a mut U) -> R) -> R
self, then passes self.as_mut() into the pipe
function.§fn pipe_deref<'a, T, R>(&'a self, func: impl FnOnce(&'a T) -> R) -> R
fn pipe_deref<'a, T, R>(&'a self, func: impl FnOnce(&'a T) -> R) -> R
self, then passes self.deref() into the pipe function.§impl<T> Pointable for T
impl<T> Pointable for T
§impl<T> PolicyExt for Twhere
T: ?Sized,
impl<T> PolicyExt for Twhere
T: ?Sized,
§impl<T> Tap for T
impl<T> Tap for T
§fn tap_borrow<B>(self, func: impl FnOnce(&B)) -> Self
fn tap_borrow<B>(self, func: impl FnOnce(&B)) -> Self
Borrow<B> of a value. Read more§fn tap_borrow_mut<B>(self, func: impl FnOnce(&mut B)) -> Self
fn tap_borrow_mut<B>(self, func: impl FnOnce(&mut B)) -> Self
BorrowMut<B> of a value. Read more§fn tap_ref<R>(self, func: impl FnOnce(&R)) -> Self
fn tap_ref<R>(self, func: impl FnOnce(&R)) -> Self
AsRef<R> view of a value. Read more§fn tap_ref_mut<R>(self, func: impl FnOnce(&mut R)) -> Self
fn tap_ref_mut<R>(self, func: impl FnOnce(&mut R)) -> Self
AsMut<R> view of a value. Read more§fn tap_deref<T>(self, func: impl FnOnce(&T)) -> Self
fn tap_deref<T>(self, func: impl FnOnce(&T)) -> Self
Deref::Target of a value. Read more§fn tap_deref_mut<T>(self, func: impl FnOnce(&mut T)) -> Self
fn tap_deref_mut<T>(self, func: impl FnOnce(&mut T)) -> Self
Deref::Target of a value. Read more§fn tap_dbg(self, func: impl FnOnce(&Self)) -> Self
fn tap_dbg(self, func: impl FnOnce(&Self)) -> Self
.tap() only in debug builds, and is erased in release builds.§fn tap_mut_dbg(self, func: impl FnOnce(&mut Self)) -> Self
fn tap_mut_dbg(self, func: impl FnOnce(&mut Self)) -> Self
.tap_mut() only in debug builds, and is erased in release
builds.§fn tap_borrow_dbg<B>(self, func: impl FnOnce(&B)) -> Self
fn tap_borrow_dbg<B>(self, func: impl FnOnce(&B)) -> Self
.tap_borrow() only in debug builds, and is erased in release
builds.§fn tap_borrow_mut_dbg<B>(self, func: impl FnOnce(&mut B)) -> Self
fn tap_borrow_mut_dbg<B>(self, func: impl FnOnce(&mut B)) -> Self
.tap_borrow_mut() only in debug builds, and is erased in release
builds.§fn tap_ref_dbg<R>(self, func: impl FnOnce(&R)) -> Self
fn tap_ref_dbg<R>(self, func: impl FnOnce(&R)) -> Self
.tap_ref() only in debug builds, and is erased in release
builds.§fn tap_ref_mut_dbg<R>(self, func: impl FnOnce(&mut R)) -> Self
fn tap_ref_mut_dbg<R>(self, func: impl FnOnce(&mut R)) -> Self
.tap_ref_mut() only in debug builds, and is erased in release
builds.§fn tap_deref_dbg<T>(self, func: impl FnOnce(&T)) -> Self
fn tap_deref_dbg<T>(self, func: impl FnOnce(&T)) -> Self
.tap_deref() only in debug builds, and is erased in release
builds.