Skip to content

Spark operator support scheduled log write to S3 #2716

@melin

Description

@melin

What feature you would like to be added?

In Spark on Kubernetes, the most challenging task is log collection. If the Spark operator can achieve the function of regularly writing logs to object storage, it will greatly facilitate users. Similar to AWS EMR Spark Serverless: https://docs.aws.amazon.com/emr/latest/EMR-Serverless-UserGuide/logging.html

  1. If the driver and executor pods are running, they will periodically (every 15 seconds) retrieve the full console logs and write them to S3.
  2. Before the driver and executor pods are terminated, they will retrieve the full console logs and write them to S3.

Why is this needed?

No response

Describe the solution you would like

No response

Describe alternatives you have considered

No response

Additional context

No response

Love this feature?

Give it a 👍 We prioritize the features with most 👍

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions