Details
-
New Feature
-
Status: Resolved
-
Major
-
Resolution: Duplicate
-
trunk
-
None
-
None
-
None
Description
When a user has a busy cluster and long running coordinator jobs, streaming the logs can be really slow. In this situation, the user is typically only interested in the latest logs and doesn't care about logs from a while ago. So, we can speed this up by adding a parameter or something to the log streaming API to only fetch the logs from the last x hours. The log files roll over every hour, so this should be pretty straightforward. We can also prepend a message to the logs streamed back mentioning that only the last x hours of logs are included. The existing way of getting all of the logs should remain unaffected and still work.
Attachments
Issue Links
- duplicates
-
OOZIE-1737 Oozie log streaming is slow
- Closed