[SPARK-49409] CONNECT_SESSION_PLAN_CACHE_SIZE is too small for certain programming patterns - ASF JIRA

XML

Word

Printable

JSON

Example:

```

df_1 = df_a.filter(col('X').isNotNull())

df_2 = df_b.filter(col('SAFE_SU_Conv').isNotNull())

....

df_x = ...

for _ in range(0, 5):

df_x = df_x.select(...)

...

df_3 = df_1.join(df_2, ...)

```

=> df_x completely invalidates all the cached entries.

links to

GitHub Pull Request #47937