Class AbstractAsyncStateStreamOperatorV2<OUT>
- java.lang.Object
-
- org.apache.flink.streaming.api.operators.AbstractStreamOperatorV2<OUT>
-
- org.apache.flink.runtime.asyncprocessing.operators.AbstractAsyncStateStreamOperatorV2<OUT>
-
- All Implemented Interfaces:
Serializable,org.apache.flink.api.common.state.CheckpointListener,KeyContext,StreamOperator<OUT>,StreamOperatorStateHandler.CheckpointedStreamOperator,AsyncKeyOrderedProcessing,AsyncKeyOrderedProcessingOperator
@Internal public abstract class AbstractAsyncStateStreamOperatorV2<OUT> extends AbstractStreamOperatorV2<OUT> implements AsyncKeyOrderedProcessingOperator
This operator is an abstract class that give theAbstractStreamOperatorV2the ability to performAsyncKeyOrderedProcessing. The aim is to make any subclass ofAbstractStreamOperatorV2could manipulate async state with only a change of base class.- See Also:
- Serialized Form
-
-
Field Summary
Fields Modifier and Type Field Description protected DeclarationManagerdeclarationManager-
Fields inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperatorV2
combinedWatermark, config, lastRecordAttributes, latencyStats, metrics, output, processingTimeService, stateHandler, timeServiceManager
-
-
Constructor Summary
Constructors Constructor Description AbstractAsyncStateStreamOperatorV2(StreamOperatorParameters<OUT> parameters, int numberOfInputs)
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description <K> voidasyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Asynchronously process a code with a key provided.voidbeforeInitializeStateHandler()Initialize necessary state components forAbstractStreamOperatorV2.voidclose()This method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.voidfinish()This method is called at the end of data processing.ObjectgetCurrentKey()RecordContextgetCurrentProcessingContext()DeclarationManagergetDeclarationManager()Get the declaration manager for user-logic declaring.ElementOrdergetElementOrder()Get theElementOrderof this operator.<K,N>
InternalTimerService<N>getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K,N> triggerable)Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers.protected <N,S extends org.apache.flink.api.common.state.v2.State,T>
SgetOrCreateKeyedState(N defaultNamespace, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor)Create new state (v2) based on new state descriptor.<T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception>getRecordProcessor(int inputId)Get the record processor that could process record from input, which is the only entry for async processing.StateExecutionController<?>getStateExecutionController()protected <T> voidinternalSetKeyContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> selector)booleanisAsyncKeyOrderedProcessingEnabled()Indicates whether this operator is enabling the async state.voidnewKeySelected(Object newKey)A hook that will be invoked after a new key is selected.voidpostProcessElement()A callback that will be triggered after an element finishesprocessElement.voidpostProcessWatermark(Watermark watermark)A hook that will be invoked after finishing advancing the watermark.voidprepareSnapshotPreBarrier(long checkpointId)This method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.WatermarkpreProcessWatermark(Watermark watermark)A hook that will be triggered when receiving a watermark.voidpreserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Check the order of same-key record, and then process the record.protected voidprocessNonRecord(org.apache.flink.util.function.ThrowingRunnable<? extends Exception> triggerAction, org.apache.flink.util.function.ThrowingRunnable<? extends Exception> finalAction)Process a non-record event.voidprocessRecordAttributes(RecordAttributes recordAttributes, int inputId)voidprocessWatermark(Watermark mark)Process a watermark when receiving it.voidprocessWatermarkStatus(WatermarkStatus watermarkStatus, int inputId)protected voidreportOrForwardLatencyMarker(LatencyMarker marker)<T> voidsetAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> keySelector)Set key context for async state processing.-
Methods inherited from class org.apache.flink.streaming.api.operators.AbstractStreamOperatorV2
getExecutionConfig, getKeyedStateBackend, getKeyedStateStore, getMetricGroup, getOperatorConfig, getOperatorID, getOperatorName, getOperatorStateBackend, getOrCreateKeyedState, getPartitionedState, getPartitionedState, getProcessingTimeService, getRuntimeContext, getTimeServiceManager, getUserCodeClassloader, initializeState, initializeState, isUsingCustomRawKeyedState, notifyCheckpointAborted, notifyCheckpointComplete, open, reportWatermark, setCurrentKey, setKeyContextElement1, setKeyContextElement2, snapshotState, snapshotState, useSplittableTimers
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface org.apache.flink.streaming.api.operators.StreamOperator
getOperatorAttributes
-
-
-
-
Field Detail
-
declarationManager
protected DeclarationManager declarationManager
-
-
Constructor Detail
-
AbstractAsyncStateStreamOperatorV2
public AbstractAsyncStateStreamOperatorV2(StreamOperatorParameters<OUT> parameters, int numberOfInputs)
-
-
Method Detail
-
beforeInitializeStateHandler
public final void beforeInitializeStateHandler()
Initialize necessary state components forAbstractStreamOperatorV2.- Overrides:
beforeInitializeStateHandlerin classAbstractStreamOperatorV2<OUT>
-
isAsyncKeyOrderedProcessingEnabled
public boolean isAsyncKeyOrderedProcessingEnabled()
Description copied from class:AbstractStreamOperatorV2Indicates whether this operator is enabling the async state. Can be overridden by subclasses.- Specified by:
isAsyncKeyOrderedProcessingEnabledin interfaceAsyncKeyOrderedProcessing- Overrides:
isAsyncKeyOrderedProcessingEnabledin classAbstractStreamOperatorV2<OUT>- Returns:
- ture if async state processing is enabled.
-
getElementOrder
public ElementOrder getElementOrder()
Description copied from interface:AsyncKeyOrderedProcessingOperatorGet theElementOrderof this operator.- Specified by:
getElementOrderin interfaceAsyncKeyOrderedProcessingOperator
-
setAsyncKeyedContextElement
public final <T> void setAsyncKeyedContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> keySelector) throws Exception
Description copied from interface:AsyncKeyOrderedProcessingOperatorSet key context for async state processing.- Specified by:
setAsyncKeyedContextElementin interfaceAsyncKeyOrderedProcessingOperator- Type Parameters:
T- the type of the record.- Parameters:
record- the record.keySelector- the key selector to select a key from record.- Throws:
Exception
-
newKeySelected
public void newKeySelected(Object newKey)
A hook that will be invoked after a new key is selected. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
newKey- the new key selected.
-
internalSetKeyContextElement
protected <T> void internalSetKeyContextElement(StreamRecord<T> record, org.apache.flink.api.java.functions.KeySelector<T,?> selector) throws Exception
- Overrides:
internalSetKeyContextElementin classAbstractStreamOperatorV2<OUT>- Throws:
Exception
-
getCurrentKey
public Object getCurrentKey()
- Specified by:
getCurrentKeyin interfaceKeyContext- Overrides:
getCurrentKeyin classAbstractStreamOperatorV2<OUT>
-
postProcessElement
public final void postProcessElement()
Description copied from interface:AsyncKeyOrderedProcessingOperatorA callback that will be triggered after an element finishesprocessElement.- Specified by:
postProcessElementin interfaceAsyncKeyOrderedProcessingOperator
-
preserveRecordOrderAndProcess
public final void preserveRecordOrderAndProcess(org.apache.flink.util.function.ThrowingRunnable<Exception> processing)
Description copied from interface:AsyncKeyOrderedProcessingOperatorCheck the order of same-key record, and then process the record. Mainly used when theAsyncKeyOrderedProcessingOperator.getElementOrder()returnsElementOrder.RECORD_ORDER.- Specified by:
preserveRecordOrderAndProcessin interfaceAsyncKeyOrderedProcessingOperator- Parameters:
processing- the record processing logic.
-
asyncProcessWithKey
public <K> void asyncProcessWithKey(K key, org.apache.flink.util.function.ThrowingRunnable<Exception> processing)Description copied from interface:AsyncKeyOrderedProcessingOperatorAsynchronously process a code with a key provided.- Specified by:
asyncProcessWithKeyin interfaceAsyncKeyOrderedProcessingOperator- Type Parameters:
K- the type of key.- Parameters:
key- the specified key.processing- the process logic.
-
getDeclarationManager
public final DeclarationManager getDeclarationManager()
Description copied from interface:AsyncKeyOrderedProcessingOperatorGet the declaration manager for user-logic declaring.- Specified by:
getDeclarationManagerin interfaceAsyncKeyOrderedProcessingOperator
-
getRecordProcessor
public final <T> org.apache.flink.util.function.ThrowingConsumer<StreamRecord<T>,Exception> getRecordProcessor(int inputId)
Description copied from interface:AsyncKeyOrderedProcessingGet the record processor that could process record from input, which is the only entry for async processing.- Specified by:
getRecordProcessorin interfaceAsyncKeyOrderedProcessing- Parameters:
inputId- the input identifier, start from 1. Borrow the design fromorg.apache.flink.streaming.api.operators.AbstractInput#inputId. This is only relevant if there is multiple inputs for the instance.
-
processNonRecord
protected void processNonRecord(@Nullable org.apache.flink.util.function.ThrowingRunnable<? extends Exception> triggerAction, @Nullable org.apache.flink.util.function.ThrowingRunnable<? extends Exception> finalAction)Process a non-record event. This method is used to process events that are not related to records, such as watermarks or latency markers. It is used to ensure that the async state processing is performed in the correct order. Subclasses could override this method to inject some async state processing logic.- Parameters:
triggerAction- the action that will be performed when the event is triggered.finalAction- the action that will be performed when the event is finished considering the epoch control.
-
getOrCreateKeyedState
protected <N,S extends org.apache.flink.api.common.state.v2.State,T> S getOrCreateKeyedState(@Nonnull N defaultNamespace, @Nonnull org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, @Nonnull org.apache.flink.api.common.state.v2.StateDescriptor<T> stateDescriptor) throws ExceptionCreate new state (v2) based on new state descriptor.- Throws:
Exception
-
prepareSnapshotPreBarrier
public void prepareSnapshotPreBarrier(long checkpointId) throws ExceptionDescription copied from interface:StreamOperatorThis method is called when the operator should do a snapshot, before it emits its own checkpoint barrier.This method is intended not for any actual state persistence, but only for emitting some data before emitting the checkpoint barrier. Operators that maintain some small transient state that is inefficient to checkpoint (especially when it would need to be checkpointed in a re-scalable way) but can simply be sent downstream before the checkpoint. An example are opportunistic pre-aggregation operators, which have small the pre-aggregation state that is frequently flushed downstream.
Important: This method should not be used for any actual state snapshot logic, because it will inherently be within the synchronous part of the operator's checkpoint. If heavy work is done within this method, it will affect latency and downstream checkpoint alignments.
- Specified by:
prepareSnapshotPreBarrierin interfaceStreamOperator<OUT>- Overrides:
prepareSnapshotPreBarrierin classAbstractStreamOperatorV2<OUT>- Parameters:
checkpointId- The ID of the checkpoint.- Throws:
Exception- Throwing an exception here causes the operator to fail and go into recovery.
-
getInternalTimerService
public <K,N> InternalTimerService<N> getInternalTimerService(String name, org.apache.flink.api.common.typeutils.TypeSerializer<N> namespaceSerializer, Triggerable<K,N> triggerable)
Description copied from class:AbstractStreamOperatorV2Returns aInternalTimerServicethat can be used to query current processing time and event time and to set timers. An operator can have several timer services, where each has its own namespace serializer. Timer services are differentiated by the string key that is given when requesting them, if you call this method with the same key multiple times you will get the same timer service instance in subsequent requests.Timers are always scoped to a key, the currently active key of a keyed stream operation. When a timer fires, this key will also be set as the currently active key.
Each timer has attached metadata, the namespace. Different timer services can have a different namespace type. If you don't need namespace differentiation you can use
VoidNamespaceSerializeras the namespace serializer.- Overrides:
getInternalTimerServicein classAbstractStreamOperatorV2<OUT>N- The type of the timer namespace.- Parameters:
name- The name of the requested timer service. If no service exists under the given name a new one will be created and returned.namespaceSerializer-TypeSerializerfor the timer namespace.triggerable- TheTriggerablethat should be invoked when timers fire
-
reportOrForwardLatencyMarker
protected void reportOrForwardLatencyMarker(LatencyMarker marker)
- Overrides:
reportOrForwardLatencyMarkerin classAbstractStreamOperatorV2<OUT>
-
preProcessWatermark
public Watermark preProcessWatermark(Watermark watermark) throws Exception
A hook that will be triggered when receiving a watermark. Some async state can safely go within this method. Return the watermark that should be normally processed.- Parameters:
watermark- the receiving watermark.- Returns:
- the watermark that should be processed. Null if there is no need for following processing.
- Throws:
Exception
-
postProcessWatermark
public void postProcessWatermark(Watermark watermark) throws Exception
A hook that will be invoked after finishing advancing the watermark. It is not recommended to perform async state here. Only some synchronous logic is suggested.- Parameters:
watermark- the advanced watermark.- Throws:
Exception
-
processWatermark
public final void processWatermark(Watermark mark) throws Exception
Process a watermark when receiving it. Do not override this method since the async processing is difficult to write. Please override the hooks, seepreProcessWatermark(Watermark)andpostProcessWatermark(Watermark). The basic logic of processWatermark with hooks in sync form would be:Watermark watermark = preProcessWatermark(mark); if (watermark != null) { super.processWatermark(watermark); postProcessWatermark(watermark); }- Overrides:
processWatermarkin classAbstractStreamOperatorV2<OUT>- Throws:
Exception
-
processWatermarkStatus
public void processWatermarkStatus(WatermarkStatus watermarkStatus, int inputId) throws Exception
- Overrides:
processWatermarkStatusin classAbstractStreamOperatorV2<OUT>- Throws:
Exception
-
processRecordAttributes
public void processRecordAttributes(RecordAttributes recordAttributes, int inputId) throws Exception
- Overrides:
processRecordAttributesin classAbstractStreamOperatorV2<OUT>- Throws:
Exception
-
getStateExecutionController
@VisibleForTesting public StateExecutionController<?> getStateExecutionController()
-
getCurrentProcessingContext
@VisibleForTesting public RecordContext getCurrentProcessingContext()
-
finish
public void finish() throws ExceptionDescription copied from interface:StreamOperatorThis method is called at the end of data processing.The method is expected to flush all remaining buffered data. Exceptions during this flushing of buffered data should be propagated, in order to cause the operation to be recognized as failed, because the last data items are not processed properly.
After this method is called, no more records can be produced for the downstream operators.
WARNING: It is not safe to use this method to commit any transactions or other side effects! You can use this method to flush any buffered data that can later on be committed e.g. in a
CheckpointListener.notifyCheckpointComplete(long).NOTE:This method does not need to close any resources. You should release external resources in the
StreamOperator.close()method.- Specified by:
finishin interfaceStreamOperator<OUT>- Overrides:
finishin classAbstractStreamOperatorV2<OUT>- Throws:
Exception- An exception in this method causes the operator to fail.
-
close
public void close() throws ExceptionDescription copied from interface:StreamOperatorThis method is called at the very end of the operator's life, both in the case of a successful completion of the operation, and in the case of a failure and canceling.This method is expected to make a thorough effort to release all resources that the operator has acquired.
NOTE:It can not emit any records! If you need to emit records at the end of processing, do so in the
StreamOperator.finish()method.- Specified by:
closein interfaceStreamOperator<OUT>- Overrides:
closein classAbstractStreamOperatorV2<OUT>- Throws:
Exception
-
-