Skip to content
View HaolingZHANG's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@TranslationalBioinformaticsUnit @BGI-SynBio
Block or Report

Block or report HaolingZHANG

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HaolingZHANG/README.md

Hi, there 👋

I'm Haoling (Horus) Zhang, a senior algorithm engineer at BGI-Research and a member of George Church Institute of Regenesis. I led or participated in research of coding algorithms, architectures, software packages, and standardization in the field of DNA storage, published 4 papers including one in Nature Computational Science, under the guidance of Prof. George Church at Harvard Medical School and Dr. Yue Shen at BGI-Research. Besides, with my experience in engineering, mathematics and genomics, I have effectively undertaken the cooperation with coding scientists such as Prof. Gennian Ge at Capital Normal University and Prof. Yiwei Zhang at Shandong University.

Concurrently, I received valuable training from Prof. Jesper Tegnér at King Abdullah University of Science and Technology, Prof. ‪Hector Zenil‬ at University of Oxford, Prof. Narsis Kiani at Karolinska Institute, and Dr. Chao-Han Huck Yang at Georgia Institute of Technology on machine learning and causal discovery. Moreover, I have collaborated with Dr. Pin-Yu Chen at IBM Research on adversarial robustness.


I’m a full-stack developer. Since I have mainly worked in the scientific research community, most of my open-source projects are programmed in Python. I have also reprogrammed or reconstructed some other projects with low-level programming language into Python for better use by relevant scientists. In the past three years, I have developed more than 25 algorithms/frameworks and have coded more than 400,000 lines. The 19 open-source repositories are as follows, the remaining repositories will be released with sufficient tests and/or papers published.

Some key open-source repositories are:

  1. Chamaeleo: an integrated evaluation platform for DNA-based data storage.
  2. DNASpiderWeb: a general DNA-based data storage architecture and benchmark, the implementation of the work SPIDER-WEB generates coding algorithms with superior error tolerance and real-time information retrieval capacity.
  3. DNA-storage-YYC: an implementation of the work Towards Practical and Robust DNA-based Data Archiving using the Yin–Yang Codec System (published in Nature Computational Science).
  4. pyHEDGES: a rewritten implementation of the work HEDGES Error-Correcting Code for DNA Storage Corrects Indels and Allows Sequence Constraints in the Python platform (C++ originally, published in Proceedings of the National Academy of Sciences).
  5. ReverseEncodingTree: an implementation of the work Evolving Neural Network through the Reverse Encoding Tree.
  6. FasterHammingCalculator: A large-scale Hamming distance calculation accelerator using the customized three-dimensional matrix (10 ~ 100 times).
  7. pymolBE: Customized batch exhibition of three-dimensional molecules based on PyMOL framework.
  8. StructureCluster: It provides some scripts for protein comparison (clustering) and format conversion.
  9. GenomeCompact: a lossless compression algorithm for T4 Phage.

Pinned

  1. DNASpiderWeb DNASpiderWeb Public

    SPIDER-WEB generates coding algorithms with superior error tolerance and real-time information retrieval capacity

    Python 7 3

  2. FasterHammingCalculator FasterHammingCalculator Public

    Do massive Hamming distance calculation rapidly.

    Python 2

  3. ntpz870817/Chamaeleo ntpz870817/Chamaeleo Public

    BGI DNA Storage Kit

    Python 33 7

  4. ntpz870817/DNA-storage-YYC ntpz870817/DNA-storage-YYC Public

    Program designed for DNA storage using Yin-Yang transcoding algorithm

    Python 19 6

  5. pyHEDGES pyHEDGES Public

    HEDGES Error-Correcting Code for DNA (using Python platform)

    Python 4 1

  6. BGI-SynBio/PyMOL-PUB BGI-SynBio/PyMOL-PUB Public

    Rapid generation of high-quality structure figures for publication with PyMOL-PUB

    Python 37 2