-
Notifications
You must be signed in to change notification settings - Fork 0
Home
Manuel A. Vázquez edited this page Dec 19, 2019
·
8 revisions
Welcome to the nihmporter wiki!
You can use make_conda_environment.sh to build a proper Anaconda environment (by default, named nih
).
Activate the above environment and run
# after activating the appropriate conda environement
./import.py
It should result in some feather/pickle (as of December 2019, feather files don't suport > 2GB columns) files, each one storing a Pandas dataframe.
connectivity_stats.py
allows to compute some statistics (such as "number of projects that are not associated with any publication", and such) from the CSV data generated by the main program. Hence, connectivity_stats.py
can only be run after import.py
.
Some useful links:
Some acronyms that are mentioned above:
- DUNS: Data Universal Numbering System: for identifying organizations