@PublicEvolving public class NonSplittingRecursiveEnumerator extends Object implements FileEnumerator
FileEnumerator enumerates all files under the given paths recursively. Each file
becomes one split; this enumerator does not split files into smaller "block" units.
The default instantiation of this enumerator filters files with the common hidden file prefixes '.' and '_'. A custom file filter can be specified.
FileEnumerator.Provider| Modifier and Type | Field and Description |
|---|---|
protected Predicate<org.apache.flink.core.fs.Path> |
fileFilter
The filter predicate to filter out unwanted files.
|
| Constructor and Description |
|---|
NonSplittingRecursiveEnumerator()
Creates a NonSplittingRecursiveEnumerator that enumerates all files except hidden files.
|
NonSplittingRecursiveEnumerator(Predicate<org.apache.flink.core.fs.Path> fileFilter)
Creates a NonSplittingRecursiveEnumerator that uses the given predicate as a filter for file
paths.
|
| Modifier and Type | Method and Description |
|---|---|
protected void |
addSplitsForPath(org.apache.flink.core.fs.FileStatus fileStatus,
org.apache.flink.core.fs.FileSystem fs,
ArrayList<FileSourceSplit> target) |
protected void |
convertToSourceSplits(org.apache.flink.core.fs.FileStatus file,
org.apache.flink.core.fs.FileSystem fs,
List<FileSourceSplit> target) |
Collection<FileSourceSplit> |
enumerateSplits(org.apache.flink.core.fs.Path[] paths,
int minDesiredSplits)
Generates all file splits for the relevant files under the given paths.
|
protected String |
getNextId() |
protected final Predicate<org.apache.flink.core.fs.Path> fileFilter
public NonSplittingRecursiveEnumerator()
public NonSplittingRecursiveEnumerator(Predicate<org.apache.flink.core.fs.Path> fileFilter)
public Collection<FileSourceSplit> enumerateSplits(org.apache.flink.core.fs.Path[] paths, int minDesiredSplits) throws IOException
FileEnumeratorminDesiredSplits is an optional hint indicating how many splits would be necessary to
exploit parallelism properly.enumerateSplits in interface FileEnumeratorIOExceptionprotected void addSplitsForPath(org.apache.flink.core.fs.FileStatus fileStatus,
org.apache.flink.core.fs.FileSystem fs,
ArrayList<FileSourceSplit> target)
throws IOException
IOExceptionprotected void convertToSourceSplits(org.apache.flink.core.fs.FileStatus file,
org.apache.flink.core.fs.FileSystem fs,
List<FileSourceSplit> target)
throws IOException
IOExceptionprotected final String getNextId()
Copyright © 2014–2025 The Apache Software Foundation. All rights reserved.