Skip to content

Latest commit

 

History

History
23 lines (18 loc) · 3.77 KB

ROADMAP.md

File metadata and controls

23 lines (18 loc) · 3.77 KB

Orange Roadmap

In the next few years, we plan to expand Orange in the following directions:

  • Scalability. Analyzing very large data in Orange is a challenge. Orange is an interactive data analytic tool with a flexible, user-defined workflow, which makes optimization for large data more complex than in the case of tools, which are either non-interactive or have a fixed data processing flow. We plan to address the following issues:

    • Memory efficiency. Orange can currently address data of up to around 500 MB.
    • Speed-up of complex operations. Hour-long processing is not acceptable for interactive data analysis.
    • Interruptibility of processes and progress updates. Operations that take too long are not always interruptible and may require shutting down the application; those same operations do not provide updates on progress reports.
  • Smoothness of the learning curve for new users. This is not about fixing a problem but about building on Orange’s existing strength.

    • Provide a tutorial for first-time users upon starting Orange. An excellent example is a guide in Open Street Map editor.
    • Simplification of the interface. Base Orange currently has under 200 components, and we keep this number low by design. Still, beginners should see fewer and be provided with assistance in further exploring.
    • Gamification. We are already exploring ways to implement self-paced tutorials within Orange and design ways to incorporate guidance for workflow construction, problem-driven exercises, and progress badges.
    • Video content. We will update and refresh existing YouTube videos and record new ones.
    • Lecture notes. We have lecture notes for almost a hundred hands-on workshops carried out around the world. They were typeset in Pages, which worked nicely in the beginning, but now the management of about 300 pages of material became cumbersome. We are converting the lecture notes to LaTeX, enabling fast assembly of notes for a custom-designed hands-on workshop. The current solution needs improvements in design, and text needs updates and additional proofreading.
  • Improved functionality in several application fields, with particular priority being

    • Spectroscopy: we are making continuous progress with an add-on, thanks to support from Synchrotron SOLEIL.
    • Text mining: our goal is to design components for document characterization, summarization, and explanation of point-based visualizations. This work is in progress, thanks to the support by the Slovene Ministry of Public Administration.
    • Network analysis: speed-up and beautify existing visualizations and extend the toolbox with new network analysis procedures. This is being done in collaboration with the International Laboratory for Applied Network Research, Higher Schools of Economics, Moscow.
    • Time series analysis: replace HighCharts with Qt-based visualizations, enable simultaneous analysis of a group of signals, improve modeling procedures, implement model evaluation techniques, and incorporate deep learning and auto-encoding for time series characterization, feature engineering, and prediction.

We need to address funding. The development of Orange relies on a core team sponsored by research and development grants. We will continue searching for new funding opportunities and seeking international collaborations to enrich Orange with new functionalities and improved interfaces. We seek funds both in developing Orange as a tool to democratize data science, assist in training machine learning, and support problem-solving in science and industry.