Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Narrow dependencies-第二章第二节图FullDependency: N : N #43

Open
feitang0 opened this issue Nov 6, 2015 · 5 comments
Open

Narrow dependencies-第二章第二节图FullDependency: N : N #43

feitang0 opened this issue Nov 6, 2015 · 5 comments

Comments

@feitang0
Copy link

feitang0 commented Nov 6, 2015

Narrow dependencies: each partition of the parent RDD is used by at most one partition of the child RDD
第二章第二节中 FullDependency: N : N 那张图, 父RDD中的一个分区被子RDD的两个分区依赖, 不能被称为Narrow Dependency吧, 为啥说FullDenpency是NarrowDepency呢?

@zzl0
Copy link

zzl0 commented Nov 18, 2015

+1

2 similar comments
@pzz2011
Copy link

pzz2011 commented Mar 29, 2016

+1

@wojiaohgl
Copy link

+1

@JerryLead
Copy link
Owner

Narrow指的是完全依赖,parentRDD中每个p中的数据不需要再进行partition后发给childRDD。下面的cartesian(otherRDD)展示了N:N的Narrow Dependency,整个计算过程不需要shuffle。

@liqul
Copy link

liqul commented Feb 27, 2017

@JerryLead
个人觉得这里narrow vs. wide定义不是很清楚,感觉作者原意是想把确定的和随机的分开,所以如果中间有shuffle操作则为wide,否则为narrow。其实际的含义是确定和不确定的区别(即给定一个子partition,其父partition是完全确定的),而不是full还是partial。尤其,"essentially" 建议改为"typically",否则意思上也是有自相矛盾的地方。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

6 participants