fix: FeatureView serialization with cycle detection#5502

HaoXuAI

What this PR does / why we need it:

Follow up on PR: #5482.

Which issue(s) this PR fixes:

Misc

Copilot

Pull Request Overview

Adds support for on-demand feature sources, enforces cycle detection in FeatureView (de)serialization, standardizes the topological sort API, and refreshes related documentation.

Introduce OnDemandSourceType alias and refactor sources signature in OnDemandFeatureView.
Rename all topo_sort functions/methods to topological_sort and update callers.
Enhance FeatureView.to_proto/from_proto with cycle detection and update __copy__/__eq__.
Add DAG module README and update compute-engine reference docs.

Reviewed Changes

Copilot reviewed 8 out of 9 changed files in this pull request and generated 1 comment.

Show a summary per file File Description

sdk/python/feast/on_demand_feature_view.py	Add OnDemandSourceType alias and clean up sources hints
sdk/python/feast/infra/compute_engines/feature_resolver.py	Rename resolver method from topo_sort to topological_sort
sdk/python/feast/infra/compute_engines/feature_builder.py	Update builder calls to topological_sort
sdk/python/feast/infra/compute_engines/algorithms/topo.py	Rename functions topo_sort[_multiple] to topological_sort[_multiple]
sdk/python/feast/infra/compute_engines/dag/README.md	Add high-level DAG documentation
sdk/python/feast/feature_view.py	Implement cycle detection in (de)serialization, update copy/eq
docs/reference/compute-engine/README.md	Refresh compute-engine table and links
docs/getting-started/concepts/batch-feature-view.md	New guide for BatchFeatureView

Comments suppressed due to low confidence (1)

docs/reference/compute-engine/README.md:22

The markdown link for ExecutionPlan is nested as [link]([link](...)). It should be formatted as a single link, e.g. [link](URL).

| `ExecutionPlan` | Executes nodes in dependency order and stores intermediate outputs | [link]([link](https://github.com/feast-dev/feast/blob/master/sdk/python/feast/infra/compute_engines/dag/README.md)) |

Somehow this was breaking mypy, so put a fix on this.

is this exclusively limited to PySpark?

not exclusively to Pyspark, will update it

for some reason i thought we agreed on naming it sink.

Yeah, on a second thought, I think sink_source is more explicit for the user to know it is passing a data source to this config.

nit: maybe we should make it avg since summing a conversion rate is weird

missing a word here. ?

franciscojavierarceo

lgtm, had some small suggestions that would be great to incorporate though. thanks for the docs!!!

HaoXuAI added 2 commits July 9, 2025 22:41

update …

cdda0a0

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

update …

a482e1c

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

HaoXuAI requested a review from a team as a code owner July 10, 2025 05:43

HaoXuAI requested a review from Copilot July 10, 2025 05:43

This comment was marked as outdated.

Sign in to view

HaoXuAI added 2 commits July 9, 2025 23:46

fix linting …

d6c2704

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

fix doc …

3962b73

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

HaoXuAI requested a review from Copilot July 10, 2025 06:48

HaoXuAI added the ok-to-test label Jul 10, 2025

HaoXuAI changed the title Compute engine feaeture view serde fix: FeatureView serialization with cycle detection Jul 10, 2025

Copilot AI reviewed Jul 10, 2025

View reviewed changes

Comment thread sdk/python/feast/infra/compute_engines/feature_resolver.py Show resolved Hide resolved

HaoXuAI added 4 commits July 12, 2025 21:29

fix doc …

0237706

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

fix doc …

af000e7

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

fix linting …

20182b7

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

Merge branch 'master' into compute-engine-feaeture-view-serde

ca075bc

HaoXuAI commented Jul 13, 2025

View reviewed changes

franciscojavierarceo reviewed Jul 14, 2025

View reviewed changes

Comment thread docs/getting-started/concepts/batch-feature-view.md Outdated Show resolved Hide resolved

franciscojavierarceo reviewed Jul 14, 2025

View reviewed changes

Comment thread docs/reference/compute-engine/README.md Outdated Show resolved Hide resolved

franciscojavierarceo reviewed Jul 14, 2025

View reviewed changes

Comment thread docs/reference/compute-engine/README.md Outdated Show resolved Hide resolved

franciscojavierarceo approved these changes Jul 14, 2025

View reviewed changes

HaoXuAI and others added 5 commits July 13, 2025 23:52

Update docs/reference/compute-engine/README.md …

2803839

Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com>

Update docs/reference/compute-engine/README.md …

a554718

Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com>

Update docs/getting-started/concepts/batch-feature-view.md …

9919bcd

Co-authored-by: Francisco Arceo <arceofrancisco@gmail.com>

update doc …

4725d1b

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

update doc …

dde5028

Signed-off-by: HaoXuAI <sduxuhao@gmail.com>

franciscojavierarceo mentioned this pull request Jan 14, 2026

Add blog post on new transformation framework #5856

Open

Copilot AI mentioned this pull request Jan 14, 2026

docs: Add blog post on unified transformation framework #5858

Draft

                               for feature in on_demand_feature_view_proto.spec.features
                           ],
-                          sources=sources,
+                          sources=cast(List[OnDemandSourceType], sources),

+                      *,
+                      name: str,
+                      source: Union[DataSource, FeatureView, List[FeatureView]],
+                      sink_source: Optional[DataSource] = None,

+                      Field(name="conv_rate", dtype=Float32),
+                  ],
+                  aggregations=[
+                      Aggregation(column="conv_rate", function="sum", time_window=timedelta(days=1)),

+              ## Feature resolver and builder
+              The `FeatureBuilder` initialize a `FeatureResolver` that extracts a DAG from the `FeatureView` definitions, resolving dependencies and ensuring correct execution order. \
+              The FeatureView represents a logical data source, while DataSource represents the physical data source (e.g., BigQuery, Spark, etc.). \
+              When defines the FeatureView, the source can be a physical DataSource, a derived FeatureView, or a list of FeatureViews.

+              ## ✅ Key Capabilities
+              - **Composable DAG of FeatureViews**: Supports defining a `BatchFeatureView` on top of one or more other `FeatureView`s.
+              - **Transformations**: Apply PySpark-based transformation logic (`feature_transformation` or `udf`) to raw data source, can also be used to deal with multiple data sources.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: FeatureView serialization with cycle detection#5502