Statmanager-kr

Available Operating Systems

Availabe Python Versions

Statmanager-kr

Open-source Statistical Package for Python based on the Pandas.

Python과 Pandas 사용자를 위한 오픈소스 통계 패키지

Statmanager-kr is developed for researchers, data scientists, psychologist, studends, and anyone who need statistical analysis to validate their hypothesis. The statmanager-kr aims to organize packages that are "convenient to use", "uncompliated to use", and "convenient to see results". The end goal of statmanager-kr is to be a simple and useful package that can be used by people who don't know much about Python and Pandas.

Statmanager-kr은 가설을 검증하기 위해 통계 분석이 필요한 연구원, 데이터분석가, 심리학자, 학생 등을 위해 개발되었습니다. statmanager-kr은 사용이 쉽고, 복잡하지 않은 통계 패키지를 목표로 지속적으로 개발됩니다. statmanager-kr 개발의 최종 목표는 Python과 Pandas를 잘 알지 못하는 사람도 이용할 수 있는 매우 간편하면서도 유용한 통계 패키지를 만드는 것입니다.

Currently, KOREAN and ENGLISH are supported.
현재 지원하는 언어 세팅은 한글과 영어입니다.

Documentaion

한글 공식 문서
Official Documentation

Notifications :

Source codes are available in the Github respository
소스코드는 깃헙 레포지토리에서 확인할 수 있습니다.

For updates, please see the notice in the documentation or the Github release.
업데이트 내역은 정식 문서 내 공지사항 혹은 Github release에서 확인하시기 바랍니다.

Contribution Guidelines

Please check the guidelines in official documentation.
공식 문서 내 기여 가이드라인을 확인해주시기 바랍니다.

Please use Github Discussion to let me know if you have any questions, bugs you encounter, suggestions, etc. Of course, you can also email the developer directly.
궁금하신 점, 발생하는 버그, 제안 사항 등 모든 것은 Github Discussion을 활용해서 알려주시면 감사하겠습니다. 물론, 개발자에게 직접 이메일을 보내셔도 됩니다.

Quick Start with sample jupyter notebook file
Available functions | 현재 사용 가능한 분석
- Read detailed instructions | 상세 사용법 열람
  1. Normality assumption | 정규성 가정
  2. Homoskedasticity assumption | 등분산성 가정
  3. Reliability | 신뢰도 확인
  4. Frequency analysis | 빈도분석
  5. Correlation analysis | 상관분석
  6. Comparison (2) | 차이비교 (2)
  7. Comparison (3) | 차이비교 (3)
  8. Regression
Available functions to make figure or graph | 그래프 혹은 그림 제작에 활용되는 기능
- P-P plot
- Q-Q plot
- Histogram
- Histogram (cumulative)
- Pointplot (within differences)
- Boxplot (between group difference)

Dependency

pandas
statsmodels
scipy
numpy
matplotlib
seaborn
XlsxWriter

It is recommended to use the latest versions of these libraries and packages to avoid unexpected errors. 예상치 못한 오류를 방지하기 위하여, 위 패키지 및 라이브러리는 항상 최신 버전으로 업데이트할 것을 권고합니다.

Recommendation

Using "Jupyter Notebook" is STRONGLY RECOMMENDED (Of course, statmanager-kr works just as well in a Python environment)
"주피터 노트북(Jupyter Notebook)" 사용을 강력하게 권고합니다. 물론, Python 환경에서도 statmanager-kr은 문제없이 작동합니다.

Installing statmanager-kr

pip install statmanager-kr

Updating statmanager-kr

pip install statmanager-kr --upgrade

Quick Start

Import

import pandas as pd
from statmanager import Stat_Manager

df = pd.read_csv('testdf.csv', index_col = 'id')
sm = Stat_Manager(df, language = 'eng')

Independent Samples T-test

sm.progress(method = 'ttest_ind', vars = 'age', group_vars = 'sex').figure()

Output (Click to See)

	female	male
n	15.00	15.00
mean	27.33	28.00
median	26.00	26.00
sd	4.88	6.94
min	21.00	20.00
max	39.00	39.00

dependent variable	t-value	degree of freedom	p-value	95% CI	Cohen'd
height	-0.304	28	0.763	[-5.153, 3.820]	-0.111

Dependent Samples T-test

sm.progress(method = 'ttest_rel', vars = ['prescore', 'postscore']).figure()

Output (Click to See)

	prescore	postscore
n	…	…
mean	5.13	4.23
median	5.50	4.00
sd	2.85	2.91
min	…	…
max	…	…

variables	t-value	degree of freedom	p-value	95% CI	Cohen's d
['prescore', 'postscore']	1.198	29	0.24	[-0.636, 2.436]	0.313

Pearson's Correlation

sm.progress(method = 'pearsonr', vars = ['income', 'prescore', 'age']).figure()

Output (Click to See)

	n	Pearson's r	p-value	95%_confidence_interval
income & prescore	30	-0.103	0.588	[-0.447, 0.267]
income & age	30	-0.051	0.789	[-0.404, 0.315]
prescore & age	30	-0.044	0.816	[-0.398, 0.321]

	income	prescore	age
income	1.000	-0.103	-0.051
prescore	-0.103	1.000	-0.044
age	-0.051	-0.044	1.000

One-way ANOVA with Post-hoc test

sm.progress(method = 'f_oneway', vars = 'age', group_vars = 'condition', posthoc = True).figure()

Output (Click to See)

	test_group	sham_group	control_group
n	10	10	10
mean	28.5	28.3	26.2
median	27	29	25.5
sd	6.57	5.56	5.88
min	…	…	…
max	…	…	…

	sum_sq	df	F	p-value	partial eta squared
Intercept	6864.4	1	189.469	0	0.872
C(condition)	32.467	2	0.448	0.644	0.004
Residual	978.2	27	NaN	NaN	0.124

Test Multiple Comparison ttest_ind FWER=0.05 method=bonf alphacSidak=0.02, alphacBonf=0.

group1	group2	stat	pval	pval_corr	reject
control_group	sham_group	-0.8204	0.4227	1	FALSE
control_group	test_group	-0.8246	0.4204	1	FALSE
sham_group	test_group	-0.0735	0.9422	1	FALSE

One-way Repeated Measure ANOVA with Post-hoc test

sm.progress(method = 'f_oneway_rm', vars = ['prescore','postscore','fupscore'], posthoc = True).figure()

Output (Click to See)

	prescore	postscore	fupscore
n	30.00	30.00	30.00
mean	5.13	4.23	4.37
median	5.50	4.00	4.00
sd	2.85	2.91	2.62
min	…	…	…
max	…	…	…

	F Value	Num DF	Den DF	p-value	partial etq squared
variable	1.079	2	58	0.347	0.02

Test Multiple Comparison ttest_ind FWER=0.05 method=bonf alphacSidak=0.02, alphacBonf=0.

group1	group2	stat	pval	pval_corr	reject
fupscore	postscore	0.1866	0.8526	1	FALSE
fupscore	prescore	-1.0849	0.2824	0.8473	FALSE
postscore	prescore	-1.2106	0.231	0.6929	FALSE

Related Software

As mentioned earlier, Statmanager-kr was developed to provide a user-friendly way to perform statistical analysis methods to test hypotheses, even if the researcher is not familiar with programming languages such as Python. As such, a related software that provides similar user-friendly features is Pingouin.

The main difference is that Statmanager-kr was developed with the goal of being a package that can be used by researchers who lack programming knowledge or experience. To this end, rather than implementing independent methods for each analysis, Statmanager-kr is designed to allow users to enter code in the same way at any time to perform statistical analysis and obtain the results. Of course, Pingouin also has user-friendly characteristics, but it is a package that is better suited for users with more programming experience and knowledge than Statmanager-kr. Due to this difference in characteristics, Statmanager-kr does not support the ability to fine-tune analysis methods by adjusting parameters, whereas Pingouin is useful for adjusting parameters to obtain more careful and suitable results.

In conclusion, Statmanager-kr is a good package for researchers who lack programming experience and knowledge and want to see results quickly. Pingouin, on the other hand, is a more suitable package for researchers with more programming experience and knowledge, who need a fine-tuned approach to each analysis method.

앞서 말했듯, Statmanager-kr은 Python과 같은 프로그래밍 언어에 익숙하지 않더라도, 가설을 검증하기 위한 통계 분석 방법을 유저 친화적으로 제공하기 위해 개발되었습니다. 이와 같이, 유저 친화적인 기능을 제공하는 유관 소프트웨어로는 대표적으로 Pingouin을 들 수 있습니다.

가장 큰 차이점은, Statmanager-kr은 프로그래밍 관련 지식이나 경험이 부족한 연구자도 사용할 수 있는 패키지를 목표로 개발되었다는 점입니다. 이를 위해 Statmanager-kr은 분석방법 별로 독립적인 메소드를 구현하기보다, 사용자가 언제든 하나의 메소드에 동일한 방식의 코드를 입력하여 통계 분석을 실행하고 결과를 얻을 수 있도록 설계되었습니다. Pingouin도 사용자 친화적인 특성을 공유하나, Statmanager-kr과 비교하였을 대에는 비교적 프로그래밍 경험과 지식이 많은 사용자에게 적합한 패키지입니다. 다만, 이러한 차이로 인해 Statmanager-kr은 매개 변수를 조정하여 분석 방법을 세밀하게 조율하는 기능을 지원할 수 없습니다. 반면, Pingouin은 매개 변수를 조정하여 보다 세심하고 적합한 결과를 확보하는 데 유용합니다.

결론적으로, Statmanager-kr은 프로그래밍 경험과 지식이 부족하며, 빠르게 결과를 확인하고자 하는 연구자에게 적합한 패키지입니다. 반면, Pingouin은 프로그래밍 경험과 지식이 비교적 풍부하며, 각 분석 방법별로 세밀한 조정과 접근이 필요한 연구자에게 보다 적합한 패키지입니다.

Name		Name	Last commit message	Last commit date
Latest commit History 203 Commits
.github/workflows		.github/workflows
article		article
doc		doc
statmanager		statmanager
test		test
testdata		testdata
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Available Operating Systems

Availabe Python Versions

Statmanager-kr

Open-source Statistical Package for Python based on the Pandas.

Python과 Pandas 사용자를 위한 오픈소스 통계 패키지

Documentaion

Notifications :

Contribution Guidelines

Dependency

Recommendation

Installing statmanager-kr

Updating statmanager-kr

Quick Start

Import

Independent Samples T-test

Dependent Samples T-test

Pearson's Correlation

One-way ANOVA with Post-hoc test

One-way Repeated Measure ANOVA with Post-hoc test

Related Software

Development: Changseok Lee

Copyright (C) 2023 Changseok Lee

About

Releases 17

Contributors 2

Languages

License

ckdckd145/statmanager-kr

Folders and files

Latest commit

History

Repository files navigation

Available Operating Systems

Availabe Python Versions

Statmanager-kr

Open-source Statistical Package for Python based on the Pandas.

Python과 Pandas 사용자를 위한 오픈소스 통계 패키지

Documentaion

Notifications :

Contribution Guidelines

Dependency

Recommendation

Installing statmanager-kr

Updating statmanager-kr

Quick Start

Import

Independent Samples T-test

Dependent Samples T-test

Pearson's Correlation

One-way ANOVA with Post-hoc test

One-way Repeated Measure ANOVA with Post-hoc test

Related Software

Development: Changseok Lee

Copyright (C) 2023 Changseok Lee

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 17

Contributors 2

Languages