Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

这篇文档("0.1 Spark Streaming 实现思路与模块概述.md")存在描述错误的地方 #45

Open
highfei2011 opened this issue May 2, 2018 · 1 comment

Comments

@highfei2011
Copy link

如下:
DStream 和 RDD 的关系
既然 DStream 是 RDD 的模板,而且 DStream 和 RDD 具有相同的 transformation 操作,比如 map(), filter(), reduce() ……等等(正是这些相同的 transformation 使得 DStreamGraph 能够忠实记录 RDD DAG 的计算逻辑),那 RDD 和 DStream 有什么不一样吗?

此处描述有误:reduce()是action操作,而不是transformation操作

@Marcus366
Copy link

Marcus366 commented Nov 22, 2018

同意,按官网列举。
https://spark.apache.org/docs/latest/rdd-programming-guide.html
reduce()是action,reduceByKey()才是transformation
不过原文原意应该是想列reduceByKey作为例子

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants