Financial Machine Learning

AI in financial Economics Capstone Project Page

Marcos Lopez De Prado의 Advances in Financial Machine Learning을 참조하여 제작하였습니다
서강대학교 경제학부 AI Finance의 프로젝트 페이지 입니다
개인 사용 및 프로젝트 목적으로 만들었으므로 틀린 부분이 있을 수 있습니다

Introduce FinancialMachineLearning Library

주요 함수에 대해 설명합니다

Bar Sampling
BarSampling 함수를 사용해 간편하게 Sampling이 가능합니다

import FinancialMachineLearning as fml
dollar_df = fml.BarSampling(df, 'dv', dollar_M)

Fractionally Differencing
fracDiff 함수를 사용하여 분수 미분이 가능합니다

frac_diff = fml.fracDiff_FFD(dollar_df, min_ffd, thres = 1e-5)

CUSUM Filtering
getTEvents 함수를 이용해 fractionally Differentiated Dollar bar price 계열로부터 이벤트 추출 Sampling이 가능합니다

tEvents = fml.getTEvents(frac_diff.price, h = dfx2.std().iat[0] * 2)

Rolling Volatility
getDailyVolatility 함수를 사용하여 지정된 기간동안의 변동성 추정이 가능합니다(default = 100)

dailyVol = fml.getDailyVolatility(dollar_df)

Vertical Barrier
addVerticalBarrier 함수를 이용하여 수직 배리어 구축이 가능합니다

t1 = fml.addVerticalBarrier(tEvents, dollar_df, numDays = 5)

Concurrency
getConcurrentBar : 중첩 레이블 상태에 있는 Bar 추출 가능
getAvgLabelUniq : Label의 평균 고유도를 추정
mpPandasObj를 통해서 python MultiProcessing이 가능

numCoEvents = fml.mpPandasObj(fml.getConcurrentBar, ('molecule', events.index), cpus, closeIdx = feature_Mat.index, t1 = events['t1'])
numCoEvents = numCoEvents.loc[~numCoEvents.index.duplicated(keep = 'last')]
numCoEvents = numCoEvents.reindex(feature_Mat.index).fillna(0)
out = pd.DataFrame()
out['tW'] = fml.mpPandasObj(fml.getAvgLabelUniq, ('molecule', events.index), cpus, t1 = events['t1'], numCoEvents = numCoEvents)

Cross Validation Score
cvScore함수를 통해 순차적 데이터에 대한 교차 검증 점수를 계산할 수 있습니다

rf = RandomForestClassifier(n_estimators = 1000, criterion = "entropy", bootstrap = True,
                                n_jobs=1, random_state=42, class_weight = 'balanced_subsample', oob_score=False)
cv_gen = KFold(n_splits = 10, shuffle = False)
score = fml.cvScore(rf, X, y, sample_weight = cweight, scoring = 'neg_log_loss', cv = None, cvGen = cv_gen, pctEmbargo = 0)

단, 여기서 사용되는 sample_weight는 행렬 X,y와 길이가 같아야 합니다. 또한, pandas version은 1.5.3으로 맞춰야 합니다

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
.idea		.idea
.ipynb_checkpoints		.ipynb_checkpoints
Data		Data
catboost_info		catboost_info
.gitattributes		.gitattributes
AIFinance.ipynb		AIFinance.ipynb
Chapter 02.ipynb		Chapter 02.ipynb
Chapter 03.ipynb		Chapter 03.ipynb
Chapter 04.ipynb		Chapter 04.ipynb
Chapter 05.ipynb		Chapter 05.ipynb
Chapter 06.ipynb		Chapter 06.ipynb
Chapter 07.ipynb		Chapter 07.ipynb
Chapter 08.ipynb		Chapter 08.ipynb
Chapter 09.ipynb		Chapter 09.ipynb
Chapter 10.ipynb		Chapter 10.ipynb
Chapter 11.ipynb		Chapter 11.ipynb
Chapter 12.ipynb		Chapter 12.ipynb
Chapter 13.ipynb		Chapter 13.ipynb
Chapter 14.ipynb		Chapter 14.ipynb
Chapter 15.ipynb		Chapter 15.ipynb
Chapter 16.ipynb		Chapter 16.ipynb
DRB test.ipynb		DRB test.ipynb
FinancialMachineLearning.py		FinancialMachineLearning.py
KOSPI200_future.ipynb		KOSPI200_future.ipynb
KOSPI_future200.ipynb		KOSPI_future200.ipynb
README.md		README.md
Real project.ipynb		Real project.ipynb
brownian_motion.py		brownian_motion.py
kosdaq.ipynb		kosdaq.ipynb
sp500feature.csv		sp500feature.csv
sp500featureBin.csv		sp500featureBin.csv
test.py		test.py

Tommylee1013/Advances-in-Financial-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Financial Machine Learning

Introduce FinancialMachineLearning Library

About

Topics

Resources

Stars

Watchers

Forks

Languages