← 返回首页
fix: Fix SparkRetrievalJob.persist() failing for SparkSource by ntkathole · Pull Request #6410 · feast-dev/feast · GitHub
Skip to content

Navigation Menu

Toggle navigation
Sign in
Appearance settings
Search or jump to...

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Appearance settings
Resetting focus

fix: Fix SparkRetrievalJob.persist() failing for SparkSource#6410

Open
ntkathole wants to merge 1 commit into
feast-dev:masterfrom
ntkathole:fix_6261
Open

fix: Fix SparkRetrievalJob.persist() failing for SparkSource#6410
ntkathole wants to merge 1 commit into
feast-dev:masterfrom
ntkathole:fix_6261

Conversation

Copy link
Copy Markdown
Member

What this PR does / why we need it:

Fixes #6261

SparkRetrievalJob.persist() failed in two scenarios:

  1. Remote offline store path: When using type: remote in feature_store.yaml pointing to a Spark offline server, the server calls SavedDatasetStorage.from_data_source(data_source) to convert the registered SparkSource into storage. This raised ValueError because SparkSource was not registered in the _DATA_SOURCE_TO_SAVED_DATASET_STORAGE mapping, and SavedDatasetSparkStorage lacked a from_data_source() method.

  2. Path-based SparkSource: When using a path-based SparkSource (e.g., S3 with parquet), persist() required a table name and raised ValueError if one wasn't provided, even though the storage had a valid path configured.

ntkathole self-assigned this May 16, 2026
ntkathole requested a review from a team as a code owner May 16, 2026 15:58
Signed-off-by: ntkathole <nikhilkathole2683@gmail.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters. Learn more about bidirectional Unicode characters
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

SparkRetrievalJob.persist() fails due to missing SparkSource mapping in SavedDatasetStorage.from_data_source

1 participant

Footer

© 2026 GitHub, Inc.