| Package | Description |
|---|---|
| org.apache.hudi.common.data | |
| org.apache.hudi.common.engine | |
| org.apache.hudi.common.function |
| Modifier and Type | Method and Description |
|---|---|
default <O> HoodieData<T> |
HoodieData.distinctWithKey(SerializableFunction<T,O> keyGetter,
int parallelism) |
<O> HoodieData<T> |
HoodieListData.distinctWithKey(SerializableFunction<T,O> keyGetter,
int parallelism) |
HoodieData<T> |
HoodieData.filter(SerializableFunction<T,Boolean> filterFunc)
Returns new instance of
HoodieData collection only containing elements matching provided
filterFunc (ie ones it returns true on) |
HoodieData<T> |
HoodieListData.filter(SerializableFunction<T,Boolean> filterFunc) |
<O> HoodieData<O> |
HoodieData.flatMap(SerializableFunction<T,Iterator<O>> func)
Maps every element in the collection into a collection of the new elements using provided
mapping
func, subsequently flattening the result (by concatenating) into a single
collection
This is an intermediate operation |
<O> HoodieData<O> |
HoodieListData.flatMap(SerializableFunction<T,Iterator<O>> func) |
<K,V> HoodiePairData<K,V> |
HoodieData.flatMapToPair(SerializableFunction<T,Iterator<? extends Pair<K,V>>> func)
Maps every element in the collection into a collection of the
Pairs of new elements
using provided mapping func, subsequently flattening the result (by concatenating) into
a single collection
NOTE: That this operation will convert container from HoodieData to HoodiePairData
This is an intermediate operation |
<K,V> HoodiePairData<K,V> |
HoodieListData.flatMapToPair(SerializableFunction<T,Iterator<? extends Pair<K,V>>> func) |
<W> HoodiePairData<K,W> |
HoodieListPairData.flatMapValues(SerializableFunction<V,Iterator<W>> func) |
<O> HoodieData<O> |
HoodieListPairData.map(SerializableFunction<Pair<K,V>,O> func) |
<O> HoodieData<O> |
HoodiePairData.map(SerializableFunction<Pair<K,V>,O> func)
Maps key-value pairs of this
HoodiePairData container leveraging provided mapper
NOTE: That this returns HoodieData and not HoodiePairData |
<O> HoodieData<O> |
HoodieData.map(SerializableFunction<T,O> func)
Maps every element in the collection using provided mapping
func. |
<O> HoodieData<O> |
HoodieListData.map(SerializableFunction<T,O> func) |
<O> HoodieData<O> |
HoodieData.mapPartitions(SerializableFunction<Iterator<T>,Iterator<O>> func,
boolean preservesPartitioning)
Maps every element in the collection's partition (if applicable) by applying provided
mapping
func to every collection's partition
This is an intermediate operation |
<O> HoodieData<O> |
HoodieListData.mapPartitions(SerializableFunction<Iterator<T>,Iterator<O>> func,
boolean preservesPartitioning) |
<W> HoodiePairData<K,W> |
HoodieListPairData.mapValues(SerializableFunction<V,W> func) |
<W> HoodiePairData<K,W> |
HoodiePairData.mapValues(SerializableFunction<V,W> func)
Maps values of this
HoodiePairData container leveraging provided mapper |
| Modifier and Type | Method and Description |
|---|---|
abstract <I,O> List<O> |
HoodieEngineContext.flatMap(List<I> data,
SerializableFunction<I,Stream<O>> func,
int parallelism) |
<I,O> List<O> |
HoodieLocalEngineContext.flatMap(List<I> data,
SerializableFunction<I,Stream<O>> func,
int parallelism) |
abstract <I,O> List<O> |
HoodieEngineContext.map(List<I> data,
SerializableFunction<I,O> func,
int parallelism) |
<I,O> List<O> |
HoodieLocalEngineContext.map(List<I> data,
SerializableFunction<I,O> func,
int parallelism) |
| Modifier and Type | Method and Description |
|---|---|
static <I,O> Function<I,Stream<O>> |
FunctionWrapper.throwingFlatMapWrapper(SerializableFunction<I,Stream<O>> throwingFlatMapFunction) |
static <I,O> Function<I,O> |
FunctionWrapper.throwingMapWrapper(SerializableFunction<I,O> throwingMapFunction) |
Copyright © 2024 The Apache Software Foundation. All rights reserved.