🌟 Project Candidates #1

raynardj · 2023-10-19T18:39:44Z

🌟 Possible Targets for Vanilla Events

Here we maintain a list of ideas. Each idea will be a comment starting with light bulb emoji 💡

PLEASE USE 👍🏻 TO VOTE IDEAS

raynardj · 2023-10-19T18:40:56Z

💡 build your own tree 🌳

Decision tree from scratch

raynardj · 2023-10-19T18:41:56Z

💡 your own PCA or T-SNE 🗺

raynardj · 2023-10-19T18:43:55Z

💡 Mel-Spectrogram 🎙

Build a function to transform audio wav to mel-spectrogram and another function to transform it back

raynardj · 2023-10-19T18:58:11Z

💡 Our own stock picker pipeline

lrthomps · 2023-10-19T19:11:55Z

💡 build your own tree 🌳

Decision tree from scratch

and I could leverage some code I created for histogram based decision trees at work?

raynardj · 2023-10-19T21:43:17Z

💡 build your own tree 🌳

Decision tree from scratch

and I could leverage some code I created for histogram based decision trees at work?

why not, you need MIT license to use it more comfortably? we can put in one

histogram based decision tree --- isn't that already in many library like even sklearn?

lrthomps · 2023-10-19T23:29:13Z

💡

💡 build your own tree 🌳

Decision tree from scratch

and I could leverage some code I created for histogram based decision trees at work?

why not, you need MIT license to use it more comfortably? we can put in one

histogram based decision tree --- isn't that already in many library like even sklearn?

Yup, but it was fun to implement anyway and we were going to re-implement in c++ to be faaaast

raynardj · 2023-10-20T05:37:51Z

💡

💡 build your own tree 🌳

Decision tree from scratch

and I could leverage some code I created for histogram based decision trees at work?

why not, you need MIT license to use it more comfortably? we can put in one
histogram based decision tree --- isn't that already in many library like even sklearn?

Yup, but it was fun to implement anyway and we were going to re-implement in c++ to be faaaast

I'm pretty sure my c++ worthiness is less, I'm more of a Rust person.

RESPECT~~~

lrthomps · 2023-10-26T18:07:05Z

💡4. Download stock prices from your favorite online finance website over a period of at least three years. Create a dataset for testing portfolio selection algorithms by creating price-return vectors. Implement the OGD and ONS algorithms and benchmark them on your data. Introduction to Online Convex Optimization

elasticsearcher · 2023-10-26T18:55:03Z

I love all the projects here, but right now number 4 is my absolute favourite and I shamelessly encourage everyone to vote for it!! 🔥🔥🔥

For those who haven’t been reading the OCO textbook:

OGD stands for Online Gradient Descent, it’s similar to the regular, “offline” gradient descent but, unlike the latter, OGD isn’t trained on a fixed training set — instead, it continues training itself continuously in real time, always automatically and steadily adjusting itself in response to unpredictable, adversarial real world events 🔥🔥🔥
ONS stands for Online Newton Step, which is an online convex optimization algorithm that has a super tight, ie logarithmic, guarantee on the upper bound on the total regret attained as a function of hyper-parameters gamma, epsilon, and the number of training steps T.

This project is both self-contained and super straightforward to implement, consisting of clearly demarcated tasks:

Create an “online” dataset of real historical stock price data covering a period of at least 3 years, that will be used to simulate an online setting to test our online learning algorithm
Create a separate, much smaller, “debug” dataset that we can use as canon fodder while developing and debugging our algorithms; this is optional but I think it’s more fun to separate the development and the “production” phases of the project
Implement the general Online Gradient Descent algorithm
Implement the Online Newton Step algorithm
Benchmark both algorithms on the “production” dataset and make plots to report the results

raynardj · 2023-10-26T21:12:08Z

💡4. Download stock prices from your favorite online finance website over a period of at least three years. Create a dataset for testing portfolio selection algorithms by creating price-return vectors. Implement the OGD and ONS algorithms and benchmark them on your data. Introduction to Online Convex Optimization

well this is all just great.

My wife built something that can scrap financial data and analyze things in very simple way, and I asked can you make it more useful by add something that's beyond "asking chatgpt if this stock is going to rise". And we stuck there, so I guess your suggesting is right our answer. her homework

raynardj · 2023-10-26T21:12:50Z

@elasticsearcher u must be Andrew

tianyimasf · 2023-10-26T21:18:15Z

suggestion: MLP, ANN, Markov chain, reinforcement learning
also if anyone knows probabilistic graphic model...

raynardj · 2023-10-26T21:28:48Z

suggestion: MLP, ANN, Markov chain, reinforcement learning also if anyone knows probabilistic graphic model...

good suggestions, can you make it more specific

eg.

Create MLP with well defined back-propagation in using numpy etc

and lead with 💡 so we can vote on it! 🌟

tianyimasf · 2023-10-26T21:38:42Z

💡 MLP with back-propagation and inference using numpy

tianyimasf · 2023-10-26T21:39:12Z

💡 2-Layer ANN with back-propagation and inference function using numpy

tianyimasf · 2023-10-26T21:48:08Z

💡 A hidden Markov model with an adjustable number of hidden states.

Training it with the Expectation Maximization algorithm, and empirically investigating applications using the Forward-Backward (sum-product) and Viterbi (max-product) algorithms. It'll accept commandline arguments for the path to the training data, the number of hidden units to use, and the maximum number of iterations of EM to apply. By default, it should simply “do EM on the dataset” and print out the overall likelihood at initialization and again after each iteration of EM. Evaluate accuracy when predicting “into the future”. You may calculate the accuracy when predicting the “next state”, averaged over all states in the training data. You may explore how the accuracy drops off when predicting t steps into the future. https://github.com/tianyimasf/sequence-hmm/blob/main/sequenceProject.pdf

tianyimasf · 2023-10-26T21:53:24Z

💡 RL with Q-learning -- training & prediction using numpy

tianyimasf · 2023-10-26T21:53:32Z

💡 RL with SARSA -- training & prediction using numpy

tianyimasf · 2023-10-26T21:54:24Z

suggestion: MLP, ANN, Markov chain, reinforcement learning also if anyone knows probabilistic graphic model...

good suggestions, can you make it more specific

eg.

Create MLP with well defined back-propagation in using numpy etc

and lead with 💡 so we can vote on it! 🌟

idk anything about probabilistic graphic model so I'll leave to others to suggest the details.

raynardj added good first issue Good for newcomers help wanted Extra attention is needed question Further information is requested labels Oct 19, 2023

raynardj self-assigned this Oct 19, 2023

raynardj changed the title ~~🌟 possible targets~~ 🌟 Project Candidates Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🌟 Project Candidates #1

🌟 Project Candidates #1

raynardj commented Oct 19, 2023

raynardj commented Oct 19, 2023

raynardj commented Oct 19, 2023 •

edited

raynardj commented Oct 19, 2023 •

edited

raynardj commented Oct 19, 2023 •

edited

lrthomps commented Oct 19, 2023

💡 build your own tree 🌳

raynardj commented Oct 19, 2023

💡 build your own tree 🌳

lrthomps commented Oct 19, 2023

💡 build your own tree 🌳

raynardj commented Oct 20, 2023

💡

💡 build your own tree 🌳

lrthomps commented Oct 26, 2023

elasticsearcher commented Oct 26, 2023 •

edited

raynardj commented Oct 26, 2023

raynardj commented Oct 26, 2023

tianyimasf commented Oct 26, 2023 •

edited

raynardj commented Oct 26, 2023

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023

🌟 Project Candidates #1

🌟 Project Candidates #1

Comments

raynardj commented Oct 19, 2023

🌟 Possible Targets for Vanilla Events

PLEASE USE 👍🏻 TO VOTE IDEAS

raynardj commented Oct 19, 2023

💡 build your own tree 🌳

raynardj commented Oct 19, 2023 • edited

💡 your own PCA or T-SNE 🗺

raynardj commented Oct 19, 2023 • edited

💡 Mel-Spectrogram 🎙

raynardj commented Oct 19, 2023 • edited

💡 Our own stock picker pipeline

lrthomps commented Oct 19, 2023

💡 build your own tree 🌳

raynardj commented Oct 19, 2023

💡 build your own tree 🌳

lrthomps commented Oct 19, 2023

💡

💡 build your own tree 🌳

raynardj commented Oct 20, 2023

💡

💡 build your own tree 🌳

lrthomps commented Oct 26, 2023

elasticsearcher commented Oct 26, 2023 • edited

raynardj commented Oct 26, 2023

raynardj commented Oct 26, 2023

tianyimasf commented Oct 26, 2023 • edited

raynardj commented Oct 26, 2023

tianyimasf commented Oct 26, 2023 • edited

💡 MLP with back-propagation and inference using numpy

tianyimasf commented Oct 26, 2023 • edited

💡 2-Layer ANN with back-propagation and inference function using numpy

tianyimasf commented Oct 26, 2023 • edited

💡 A hidden Markov model with an adjustable number of hidden states.

tianyimasf commented Oct 26, 2023 • edited

💡 RL with Q-learning -- training & prediction using numpy

tianyimasf commented Oct 26, 2023 • edited

💡 RL with SARSA -- training & prediction using numpy

tianyimasf commented Oct 26, 2023

raynardj commented Oct 19, 2023 •

edited

raynardj commented Oct 19, 2023 •

edited

raynardj commented Oct 19, 2023 •

edited

elasticsearcher commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited

tianyimasf commented Oct 26, 2023 •

edited