Currently Feathr supports databricks on AWS; however it doesn't support EMR as a spark launcher.
贡献者指南
技术栈
scalaaws
领域
backendclouddata
议题类型
feature
难度面向新贡献者的预计实现难度,1 表示很小改动,5 表示专家级工作。
4
预计时间有经验贡献者完成调查、实现、测试并准备 pull request 的粗略时间范围。
over 1 week
活动状态议题当前的可参与程度:新鲜、活跃、陈旧、阻塞或等待维护者输入。
stale
清晰度议题是否清楚说明期望改动、验收标准和下一步。
mostly clear
前置要求
Basic knowledge of SparkAWS EMRScala programming
新手友好度1-100 的估计分数,表示该议题对首次贡献者的友好程度。
30
研究方向
Examine the existing Databricks launcher implementation in the Feathr codebase (likely in Scala). Understand the EMR API and how to submit Spark jobs to EMR. Look for any existing discussions or PRs related to EMR support. The maintainer may need to clarify the desired approach (e.g., using EMR steps or a custom launcher).
Add EMR on AWS as a spark launcher · feathr-ai/feathr#446 | Good First Issue