Welcome to Xpark!#

Xpark is a multimodal AI data processing platform designed to streamline and optimize data workflows for AI applications. It provides comprehensive capabilities for data handling, transformation, and seamless integration with AI workflows.

Processing Multimodal Data with Xpark#

from xpark.dataset import TextEmbedding, from_items
from xpark.dataset.expressions import col

ds = from_items(
   [
      "what is the advantage of using the GPU rendering options in Android?",
      "Blank video when converting uncompressed AVI files with ffmpeg",
   ]
)
ds = ds.with_column(
   "embedding",
   TextEmbedding(
      # Local embedding model.
      "Qwen/Qwen3-Embedding-0.6B",
   )
   .options(num_workers={{"CPU": 1}})
   .with_column(col("item")),
)

output = ds.take_all()

Next Steps#