Software
Placeholder

tfdatasets

R interface to TensorFlow Datasets API

R

The tfdatasets package provides an R interface to TensorFlow’s Dataset API for creating scalable input pipelines that feed data into TensorFlow and Keras models.

This package enables efficient data handling for machine learning workflows through streaming interfaces that can process arbitrarily large datasets without loading everything into memory. It supports multiple data formats including CSV and TFRecords, allows transformations like mapping, shuffling, and batching, and executes these operations as TensorFlow graph operations in C++ for parallel processing alongside model training. This makes it particularly valuable for building production-scale training pipelines where data preprocessing needs to be fast and memory-efficient.

Contributors