[YUNIKORN-386] Pass applicationID for spark pods - ASF JIRA

Details

Type: Sub-task
Status: Closed
Priority: Major
Resolution: Fixed
Affects Version/s: None
Fix Version/s: 0.10
Component/s: None
Labels:
- pull-request-available

Description

Right now we use spark-app-selector as the label for the applicationID for Spark pods: https://github.com/apache/incubator-yunikorn-k8shim/blob/master/pkg/common/utils/utils.go#L87

When linking the Spark pod group to the CRD we use the Application ID, but for the CRD we use namespace-name convention as ApplicationID and for the spark pod group we use the spark-app-selector what will result in having two different applications internally: one for the CRD and one for the Spark POD group.

I think we should change this label to something else, what we can modify easily without any side effects.

Attachments

- Sort By Name
- Sort By Date
- Ascending
- Descending

spark-driver.PNG
28/Aug/20 05:53
78 kB
huang ting yao

Issue Links

links to

GitHub Pull Request #185

Activity

Ascending order - Click to sort in descending order

Weiwei Yang added a comment - 28/Aug/20 05:22 - edited

hi kmarton

Looks like we can solve this by adding an annotation to spark driver/executor pods. There are 2 cases.
1. for spark jobs submitted by spark-submit
We need to add CLI options spark.kubernetes.driver.label.[LabelName], something like spark.kubernetes.driver.label.applicationId (same for executor). And in our code. We can identify the appID first by the label, then by spark-app-selector.

2. for spark jobs submitted by spark-k8s-operator
We can do something like this. I believe options set here will be picked up by spark pods, see the logic: here

Huang Ting Yao, kmarton Please let me know your thoughts

Weiwei Yang added a comment - 28/Aug/20 05:22 - edited hi kmarton Looks like we can solve this by adding an annotation to spark driver/executor pods. There are 2 cases. 1. for spark jobs submitted by spark-submit We need to add CLI options spark.kubernetes.driver.label. [LabelName] , something like spark.kubernetes.driver.label.applicationId (same for executor). And in our code . We can identify the appID first by the label, then by spark-app-selector . 2. for spark jobs submitted by spark-k8s-operator We can do something like this . I believe options set here will be picked up by spark pods, see the logic: here Huang Ting Yao , kmarton Please let me know your thoughts

TingYao Huang added a comment - 28/Aug/20 05:53 - edited

wwei Agree with case 2. The driver pod can already get application name in Annotations part(not in labels).

In our code, we can move annotation part before labels part, and add constant YunikoenApplication/Name in contstants.go.

Let it check the pod is belong to yunikorn CRD or not first, if the annotation isn't match then check label part.

And return appid in namespace + name format.

TingYao Huang added a comment - 28/Aug/20 05:53 - edited wwei Agree with case 2. The driver pod can already get application name in Annotations part(not in labels). In our code , we can move annotation part before labels part, and add constant YunikoenApplication/Name in contstants.go. Let it check the pod is belong to yunikorn CRD or not first, if the annotation isn't match then check label part. And return appid in namespace + name format.

Weiwei Yang added a comment - 28/Aug/20 05:58

hi Huang Ting Yao

OK. Can we use "yunikorn.apache.org/app-id" as the annotation name? We need to comply with the naming convention of K8s annotation fields.

Weiwei Yang added a comment - 28/Aug/20 05:58 hi Huang Ting Yao OK. Can we use "yunikorn.apache.org/app-id" as the annotation name? We need to comply with the naming convention of K8s annotation fields.

TingYao Huang added a comment - 28/Aug/20 06:02 - edited

hi wwei

sure. Just need to modify the yunikorn part in spark operator.

TingYao Huang added a comment - 28/Aug/20 06:02 - edited hi wwei sure. Just need to modify the yunikorn part in spark operator.

Weiwei Yang added a comment - 28/Aug/20 06:05

Sounds great. Please try this approach and let me know if this can work or not.
Thank you Huang Ting Yao, kmarton.

Weiwei Yang added a comment - 28/Aug/20 06:05 Sounds great. Please try this approach and let me know if this can work or not. Thank you Huang Ting Yao , kmarton .

Kinga Marton added a comment - 28/Aug/20 08:12

I think it cam work. I will implement today the YK part.

Kinga Marton added a comment - 28/Aug/20 08:12 I think it cam work. I will implement today the YK part.

Kinga Marton added a comment - 28/Aug/20 10:35

I have one note for the part when we populate the application ID:

The application ID should be in the form of namespace-name, where both the namespace and the name must be the values of the app CRD, not the name of the spark pods.
when setting the name for the CRD, we have to make sure that the name does not have ''-", since we use it as separator.

Kinga Marton added a comment - 28/Aug/20 10:35 I have one note for the part when we populate the application ID: The application ID should be in the form of namespace-name , where both the namespace and the name must be the values of the app CRD, not the name of the spark pods. when setting the name for the CRD, we have to make sure that the name does not have ''-", since we use it as separator.

Weiwei Yang added a comment - 28/Aug/20 22:43

hi kmarton

I just realize there might be a problem. Spark operator supports 2 types of jobs, SparkApplication and ScheduledSparkApplication.
The latter one can run job in a certain schedule, e.g @10minutes. This means, the ScheduledSparkApplication CRD will have a name, and every internal, it will create a Spark job with a different spark-app-id. If we generate the app-CRD based on ScheduledSparkApplication CRD, that will mean these jobs will be considered as 1 same app inside of YuniKorn. I guess we can live with that probably. Just bring this up in case we are missing anything in the design.

Weiwei Yang added a comment - 28/Aug/20 22:43 hi kmarton I just realize there might be a problem. Spark operator supports 2 types of jobs, SparkApplication and ScheduledSparkApplication . The latter one can run job in a certain schedule, e.g @10minutes. This means, the ScheduledSparkApplication CRD will have a name, and every internal, it will create a Spark job with a different spark-app-id. If we generate the app-CRD based on ScheduledSparkApplication CRD, that will mean these jobs will be considered as 1 same app inside of YuniKorn. I guess we can live with that probably. Just bring this up in case we are missing anything in the design.

Kinga Marton added a comment - 31/Aug/20 08:55

Hi wwei from YuniKorn point of view, can it cause troubles while we are doing the scheduling of that jobs, if they will all belong to the same application?

I think that since we are talking about one single spark application launching a bunch of jobs, it would be logic to handle it as one single YuniKorn application as well.

Kinga Marton added a comment - 31/Aug/20 08:55 Hi wwei from YuniKorn point of view, can it cause troubles while we are doing the scheduling of that jobs, if they will all belong to the same application? I think that since we are talking about one single spark application launching a bunch of jobs, it would be logic to handle it as one single YuniKorn application as well.

Weiwei Yang added a comment - 04/Sep/20 19:55

PR committed, thanks kmarton

Weiwei Yang added a comment - 04/Sep/20 19:55 PR committed, thanks kmarton

People

Assignee:: Kinga Marton

Reporter:: Kinga Marton

Votes:: 0 Vote for this issue

Watchers:: 2 Start watching this issue

Dates

Created:: 27/Aug/20 14:02

Updated:: 05/Sep/23 16:35

Resolved:: 04/Sep/20 19:55

Apache YuniKorn

Details

Description

Attachments

Attachments

Issue Links

Activity

People

Dates