2018 03 28

Inference Framework
- Verify the correctness of resnet50
- Analysis the profiling data of Fluid and TensorRT
- Start the work of integrating TensorRT
Mobile
- Support the MDL group

guosheng

NMT:
- Decouple the program desc with batch_size in Transformer.
  - https://github.com/PaddlePaddle/models/pull/783
- Refine the ReshapeOp enhancement.
  - https://github.com/PaddlePaddle/Paddle/pull/9008
- Transformer on NIST dataset related.

zhaochengduo

PR
- Add CUDAPinnedPlace
  - https://github.com/PaddlePaddle/Paddle/pull/9380
- Add SE-ResNeXt-152_parallel_exe
  - https://github.com/dzhwinter/benchmark/pull/91
- Add cos and sin
  - https://github.com/PaddlePaddle/Paddle/pull/9449
- Fix concat_op[merged]
  - https://github.com/PaddlePaddle/Paddle/pull/9337
- Add pinned memory[merged]
  - https://github.com/PaddlePaddle/Paddle/pull/9216
Review
- Cpp parallel executor
  - https://github.com/PaddlePaddle/Paddle/pull/9080
- Fix the order of reads and write from buffered channel
  - https://github.com/PaddlePaddle/Paddle/pull/9423
- Fluid channels should match the semantics of Go Channels
  - https://github.com/PaddlePaddle/Paddle/pull/9265
- Improve layer_norm speed
  - https://github.com/PaddlePaddle/Paddle/pull/9355

qiaolongfei

fluid

Fluid support Abacus(discuss with @wuyi @yanxu @helin @wangyi @lidong)
Project: https://github.com/PaddlePaddle/Paddle/projects/56
Fluid implementation: TODO: https://github.com/PaddlePaddle/Paddle/issues/9211
1. support empty tensor https://github.com/PaddlePaddle/Paddle/pull/9338
2. add split ids op https://github.com/PaddlePaddle/Paddle/pull/9370
3. fix compile send_op on mac https://github.com/PaddlePaddle/Paddle/pull/9360
4. WIP prefetch_op
Others:
- Fix data transform when inplace https://github.com/PaddlePaddle/Paddle/pull/9450
  - Paddle/python/paddle/fluid/tests/book/test_label_semantic_roles.py在给crf层添加正则项后CUDA下报错 https://github.com/PaddlePaddle/Paddle/issues/9234
  - 使用crf层，在多线程GPU下，如果batch_size不为1会出错 https://github.com/PaddlePaddle/Paddle/issues/9261
- change boost download url to speed up download https://github.com/PaddlePaddle/Paddle/pull/9331

fengjiayi

Profiling of C++ Reader:

instance/sec

Net Config	Simple Demo Net	VGG16
V2 Reader	819.11	57.49
V2 Reader with cache	-	58.9
C++ Reader	1629.88	61.44
C++ Reader with DoubleBuffer	2382.13	DOING

Kernels for increment_op:
- https://github.com/PaddlePaddle/Paddle/pull/9428
Reviews:
- [support empty tensor] https://github.com/PaddlePaddle/Paddle/pull/9338
- [SSD API Update] https://github.com/PaddlePaddle/Paddle/pull/9396
- [activation in place by default] https://github.com/PaddlePaddle/Paddle/pull/9417
- [Channel bug fix] https://github.com/PaddlePaddle/Paddle/pull/9423

gongweibao

grpc throughout test:
- https://github.com/gongweibao/tests/tree/develop/grpc_test
Add drop_out_op unit test
- https://github.com/PaddlePaddle/Paddle/pull/9364
SendOp can't capture sendop time:
- https://github.com/PaddlePaddle/Paddle/pull/9345

Xin Pan

Improve LayerNorm speed by 3x-4x. transformer speed up 15%~20%
- https://github.com/PaddlePaddle/Paddle/pull/9355
Follow up on P40 machines and configuration
- Have enough machine to develop and evaluate performance
- Have same configuration as Paddle Cloud machines
- Have 1 machine for continuous model evaluation.
Follow up on 5.1 Paddle Cloud goals
Review ParallelExecutor and ParallelGPUExecutor and profile speed

dongzhihong

[Speed] ~1x acceleration sequence expand/grad op by merging cuda kernels.
- https://github.com/PaddlePaddle/Paddle/pull/9289
[Speed] ~8x acceleration in sequence pooling op(max, average, ..) by merging cuda kernels
- https://github.com/PaddlePaddle/Paddle/pull/9217
[Speed] sequence softmax op by merging cuda kernels
- https://github.com/PaddlePaddle/Paddle/pull/9357
[Benchmark] migrate the benchmark repo into paddle main repo
- https://github.com/PaddlePaddle/Paddle/pull/9462
[Benchmark] add scripts for model CI
- https://github.com/dzhwinter/benchmark/pull/92
polish init code
- https://github.com/PaddlePaddle/Paddle/pull/9318
fix bug in parallel do
- https://github.com/PaddlePaddle/Paddle/pull/9318
fix bug in dropout
- https://github.com/PaddlePaddle/Paddle/pull/9318

helinwang

multiple GPU executor implementation and testing with YangYang: https://github.com/PaddlePaddle/Paddle/pull/9035
Discuss possible fluid imperative programming paradigms: https://github.com/PaddlePaddle/Paddle/issues/9466
PR and issue reviews:

cs2be(thuan)

PR:

Create go_op design doc (https://github.com/PaddlePaddle/Paddle/pull/9389)
Add in is_copy attribute to SelectCase (https://github.com/PaddlePaddle/Paddle/pull/9393)
Add channel design document (https://github.com/PaddlePaddle/Paddle/pull/9463)

Discussions:

Initial discussions about back propagation for CSP ops
Discuss possible fluid imperative programming paradigms: https://github.com/PaddlePaddle/Paddle/issues/9466

jetfuel(Jeff Wang)

PR:

Create Text storage backend component: https://github.com/PaddlePaddle/VisualDL/pull/333
Create Text frontend UI Vue component: https://github.com/PaddlePaddle/VisualDL/pull/337
Connect Text backend and frontend component with real data: https://github.com/PaddlePaddle/VisualDL/pull/341
Fix Travis CI script: https://github.com/PaddlePaddle/VisualDL/pull/336
Fix time format issue and disappearing slider issue: https://github.com/PaddlePaddle/VisualDL/pull/343

Research and Demo

Embedding Visualization: https://github.com/PaddlePaddle/VisualDL/issues/247#issuecomment-376629893, https://github.com/PaddlePaddle/VisualDL/issues/247#issuecomment-377047373

nickyfantasy

PR:

Create Audio preview feature API: https://github.com/PaddlePaddle/VisualDL/pull/344
Add Audio API Unit tests https://github.com/PaddlePaddle/VisualDL/pull/345

daming-lu

VisualDL:
1. Switched from Cytoscape to D3+Dagre as the latter is most robust and can build more complex node
2. Can distinguish different nodes (input, operator, output) and will add diff info for diff nodes
3. Helped ECharts to market VisualDL: http://www.iqiyi.com/w_19rwr76q69.html
- https://github.com/PaddlePaddle/VisualDL/pull/348
- https://github.com/PaddlePaddle/VisualDL/pull/338
PaddlePaddle.org:
Code Review:

Release Notes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2018 03 28

wangkuiyi

Yu Yang

wuyi

luotao

Yan Xu

ranqiu

kexinzhao

Chenxi

tangwei

Weixing

tonyyang-svail

wanghaoshuang

Dang qingqing

Yibing Liu

Liu Yiqun

guosheng

zhaochengduo

qiaolongfei

fluid

fengjiayi

gongweibao

Xin Pan

yangyaming

abhinavarora

Yan Chunwei

dongzhihong

helinwang

cs2be(thuan)

jetfuel(Jeff Wang)

nickyfantasy

daming-lu

Clone this wiki locally