Skip to content
Siu Kwan Lam edited this page Dec 13, 2018 · 1 revision

Numba Meeting: 2018-11-27

Attendees: Stuart, Todd, Siu, Ehsan

0. Feature Discussion

  • 0.41.0 Release status
  • 0.42.0 Release Planning
    • Shorter cycle
      • RC Dec 10; Final Dec 17
    • LLVM 7
      • 1 windows bug
    • Other string features/fixes
    • Caching bugs:
      • parallel=True no-threads caching issues

1. New issues

  • #3542 - Anything else than os.cpu_count() for NUMBA_NUM_THREADS threated as 1
    • could be import order
  • #3541 - numba.errors.LoweringError: Failed in nopython mode pipeline (step: nopython mode backend)
    • Invalid List.append pass type check
  • #3537 - Numba 0.41.0 final release checklist
  • #3534 - Parfors reordering print statement in fusion.
    • issues with aliasing of slice
  • #3533 - merging signatures and layout in guvectorized decorator
    • feature request
  • #3529 - Lowering error when using njit on inner function and not passing arguments
    • because some types can't be compile-time constants
    • documented behavior
    • need error message improvements
  • #3528 - numpy.median gives error in stencil
    • codegen bug
    • not median specific. issue with any reduction function it seems.
  • #3525 - "This should not have happened" ~ checking for value in numpy array
    • bug in error checking/reporting
  • #3524 - yield + for issue
    • bug confirmed
  • #3522 - prange induces race conditions, parallelization slows down code dramatically with range
    • another reason for parallel diagnostic, add prange in range warning
    • and more documentation/tutorial on parallel-accelerator
  • #3521 - Numba stencils unexpectedly upconvert float32 to float64 differently than numpy
  • #3518 - @stencil ignores cval when out kwarg supplied.
    • PR: #3520
  • #3517 - Unused functions abort compilation

Already Closed

  • #3540 - numba.errors.LoweringError: Failed in nopython mode pipeline (step: nopython mode backend)
  • #3523 - @njit is mentioned, but not explained, in chapter 1 of the user manual

2. Open PRs

New

  • #3538 - Avoid future C-level assertion error due to invalid visibility
  • #3532 - Unregister temporarily pinned host arrays at once
    • Siu will review
  • #3531 - Fix jitclass method inlining and avoid unnecessary increfs
    • Siu will review
  • #3520 - Fix @stencil ignoring cval if out kwarg supplied.
  • #3519 - WIP: fix-3457 support of numpy repeat.
  • #3516 - [WIP] Typeof dtype values
    • Siu will review

Already Merged

  • #3539 - Update changelog with 0.41.0 RC commits
  • #3536 - Sets list dtor linkage to linkonce_odr to fix visibility in AOT.
  • #3530 - This adds arg an check for the pattern supplied to Parfors.
  • #3527 - Fix complex math sqrt implementation for large -ve values.
  • #3526 - Fix 5 min guide with respect to @njit decl.
  • #3514 - Add missing unicode != unicode

Old

  • #3468 - Add support for np.clip and ndarray.clip.

    • depends on fix on @overload_method for kwargs
  • 3437 - Changes to accommodate LLVM 7.0.x

  • 3450 - [WIP] generated_jit for CUDA kernels

  • 3392 - Launch and attach gdb directly from Numba.

    • needs work
  • 3390 - typeinfer: use unknown_loc object instead of string literal

  • 3162 - Support constant dtype string in nopython mode in functions like numpy.empty.

    • Need to resolve #3195
  • 3160 - First attempt at parallel diagnostics

    • Stuart will implement Todd's suggestion
  • 3134 - [WIP] Cfunc x86 abi

    • Needs re-review
  • 3046 - Pairwise sum implementation.

  • #2999 - Support LowLevelCallable

  • #2942 - Fix linkage nature (declspec(dllexport)) of some test functions

  • #2894: [WIP] Implement jitclass default constructor arguments.

  • #2817: [WIP] Emit LLVM optimization remarks

Merged old PRs

  • 3449 - [WIP] Allow matching non-array objects in find_callname()
    • merged
  • 3414 - [WIP] Refactor Const type
    • merged
  • 3399 - Add max_registers Option to cuda.jit
    • merged
  • 3397 - Fix with-objmode warning
    • merged
  • #2950 - Fix dispatcher to only consider contiguous-ness.
    • merged
  • 3385 - conda recipe: whitelist libiomp5.dylib
    • merged
  • 3382 - CUDA_ERROR_MISALIGNED_ADDRESS Using Multiple Const Arrays
    • merged

===========================

4. Next Release: Version 0.41, RC=Nov 19, Final=Nov 26, 2018

  • Type refactoring
  • Parallel diagnostics
  • LLVM 7
  • Initial string support
  • Finishing off stalled PRs
  • Usual collection of bug fixes and small features
Clone this wiki locally