T - The type of the events/records produced by this source.SplitT - The subclass type of the FileSourceSplit used by the source implementation.@PublicEvolving public abstract class AbstractFileSource<T,SplitT extends FileSourceSplit> extends Object implements org.apache.flink.api.connector.source.Source<T,SplitT,PendingSplitsCheckpoint<SplitT>>, org.apache.flink.api.java.typeutils.ResultTypeQueryable<T>
FileSource, which
also has the majority of the documentation.
To read new formats, one commonly does NOT need to extend this class, but should implement a
new Format Reader (like StreamFormat, BulkFormat and use it with the FileSource.
The only reason to extend this class is when a source needs a different type of split,
meaning an extension of the FileSourceSplit to carry additional information.
| Modifier and Type | Class and Description |
|---|---|
protected static class |
AbstractFileSource.AbstractFileSourceBuilder<T,SplitT extends FileSourceSplit,SELF extends AbstractFileSource.AbstractFileSourceBuilder<T,SplitT,SELF>>
The generic base builder.
|
| Modifier | Constructor and Description |
|---|---|
protected |
AbstractFileSource(org.apache.flink.core.fs.Path[] inputPaths,
FileEnumerator.Provider fileEnumerator,
FileSplitAssigner.Provider splitAssigner,
BulkFormat<T,SplitT> readerFormat,
ContinuousEnumerationSettings continuousEnumerationSettings) |
| Modifier and Type | Method and Description |
|---|---|
org.apache.flink.api.connector.source.SplitEnumerator<SplitT,PendingSplitsCheckpoint<SplitT>> |
createEnumerator(org.apache.flink.api.connector.source.SplitEnumeratorContext<SplitT> enumContext) |
org.apache.flink.api.connector.source.SourceReader<T,SplitT> |
createReader(org.apache.flink.api.connector.source.SourceReaderContext readerContext) |
FileSplitAssigner.Provider |
getAssignerFactory() |
org.apache.flink.api.connector.source.Boundedness |
getBoundedness() |
ContinuousEnumerationSettings |
getContinuousEnumerationSettings() |
org.apache.flink.core.io.SimpleVersionedSerializer<PendingSplitsCheckpoint<SplitT>> |
getEnumeratorCheckpointSerializer() |
protected FileEnumerator.Provider |
getEnumeratorFactory() |
org.apache.flink.api.common.typeinfo.TypeInformation<T> |
getProducedType() |
abstract org.apache.flink.core.io.SimpleVersionedSerializer<SplitT> |
getSplitSerializer() |
org.apache.flink.api.connector.source.SplitEnumerator<SplitT,PendingSplitsCheckpoint<SplitT>> |
restoreEnumerator(org.apache.flink.api.connector.source.SplitEnumeratorContext<SplitT> enumContext,
PendingSplitsCheckpoint<SplitT> checkpoint) |
protected AbstractFileSource(org.apache.flink.core.fs.Path[] inputPaths,
FileEnumerator.Provider fileEnumerator,
FileSplitAssigner.Provider splitAssigner,
BulkFormat<T,SplitT> readerFormat,
@Nullable
ContinuousEnumerationSettings continuousEnumerationSettings)
protected FileEnumerator.Provider getEnumeratorFactory()
public FileSplitAssigner.Provider getAssignerFactory()
@Nullable public ContinuousEnumerationSettings getContinuousEnumerationSettings()
public org.apache.flink.api.connector.source.Boundedness getBoundedness()
getBoundedness in interface org.apache.flink.api.connector.source.Source<T,SplitT extends FileSourceSplit,PendingSplitsCheckpoint<SplitT extends FileSourceSplit>>public org.apache.flink.api.connector.source.SourceReader<T,SplitT> createReader(org.apache.flink.api.connector.source.SourceReaderContext readerContext)
createReader in interface org.apache.flink.api.connector.source.SourceReaderFactory<T,SplitT extends FileSourceSplit>public org.apache.flink.api.connector.source.SplitEnumerator<SplitT,PendingSplitsCheckpoint<SplitT>> createEnumerator(org.apache.flink.api.connector.source.SplitEnumeratorContext<SplitT> enumContext)
createEnumerator in interface org.apache.flink.api.connector.source.Source<T,SplitT extends FileSourceSplit,PendingSplitsCheckpoint<SplitT extends FileSourceSplit>>public org.apache.flink.api.connector.source.SplitEnumerator<SplitT,PendingSplitsCheckpoint<SplitT>> restoreEnumerator(org.apache.flink.api.connector.source.SplitEnumeratorContext<SplitT> enumContext, PendingSplitsCheckpoint<SplitT> checkpoint)
restoreEnumerator in interface org.apache.flink.api.connector.source.Source<T,SplitT extends FileSourceSplit,PendingSplitsCheckpoint<SplitT extends FileSourceSplit>>public abstract org.apache.flink.core.io.SimpleVersionedSerializer<SplitT> getSplitSerializer()
getSplitSerializer in interface org.apache.flink.api.connector.source.Source<T,SplitT extends FileSourceSplit,PendingSplitsCheckpoint<SplitT extends FileSourceSplit>>public org.apache.flink.core.io.SimpleVersionedSerializer<PendingSplitsCheckpoint<SplitT>> getEnumeratorCheckpointSerializer()
getEnumeratorCheckpointSerializer in interface org.apache.flink.api.connector.source.Source<T,SplitT extends FileSourceSplit,PendingSplitsCheckpoint<SplitT extends FileSourceSplit>>Copyright © 2014–2025 The Apache Software Foundation. All rights reserved.