Skip to content

kadeep47/EDGAR-Explorer-Analyzing-Management-Discussions-in-10-K-Filings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Retrieve 10-K Filings from Edgar Website

This project retrieves 10-K filings from the Edgar website for all the companies in the S&P 500 for the years 2011 to 2022. The code is written in R and uses the quantmod and edgar packages.

Getting Started

To get started, clone this repository to your local machine. Then, install the quantmod and edgar packages.


install.packages("quantmod")
install.packages("edgar")

source("retrieve_10k_filings.R")

This will download the 10-K filings for all the companies in the S&P 500 for the years 20XX to 20XX and save them to a file called cik_year.rds, where cik is the company's CIK number and year is the year of the filing.

Data

The data is stored in a file called cik_year.rds. This file is a compressed R data file and can be loaded into R using the loadRDS() function.

Analysis

The data can be analyzed using a variety of R packages. For example, the dplyr package can be used to clean and transform the data, and the ggplot2 package can be used to visualize the data.

Future Work

Expand the data set to include more companies filing 10-Ks with the SEC. Update the data set to include more recent filings for up-to-date analysis. Improve the data cleaning process using advanced techniques such as natural language processing (NLP). Develop new analysis methods using advanced statistical and machine learning techniques. These future work points aim to enhance the project by including a wider range of companies, capturing more recent data, improving data quality through advanced cleaning techniques, and applying advanced analysis methods. These improvements will help gain a deeper understanding of changing strategies, market share, and customer-centric improvements made by companies.

Contributing

If you would like to contribute to this project, please fork the repository and submit a pull request.

License

This project is licensed under the MIT License.

Acknowledgments

This project utilizes the following open-source libraries and resources:

An R package for interacting with the SEC's EDGAR filing search and retrieval system.

A tool for the U.S. SEC EDGAR retrieval and parsing of corporate filings.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages