Currently Feathr supports databricks on AWS; however it doesn't support EMR as a spark launcher.
貢獻者指南
技術棧
scalaaws
領域
backendclouddata
議題類型
feature
難度面向新貢獻者的預計實作難度,1 表示很小改動,5 表示專家級工作。
4
預計時間有經驗貢獻者完成調查、實作、測試並準備 pull request 的粗略時間範圍。
over 1 week
活動狀態議題目前的可參與程度:新鮮、活躍、陳舊、阻塞或等待維護者輸入。
stale
清晰度議題是否清楚說明預期改動、驗收標準和下一步。
mostly clear
前置要求
Basic knowledge of SparkAWS EMRScala programming
新手友善度1-100 的估計分數,表示該議題對首次貢獻者的友善程度。
30
研究方向
Examine the existing Databricks launcher implementation in the Feathr codebase (likely in Scala). Understand the EMR API and how to submit Spark jobs to EMR. Look for any existing discussions or PRs related to EMR support. The maintainer may need to clarify the desired approach (e.g., using EMR steps or a custom launcher).
Add EMR on AWS as a spark launcher · feathr-ai/feathr#446 | Good First Issue