Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pull main codebase #1

Merged
merged 108 commits into from
Sep 25, 2021
Merged

pull main codebase #1

merged 108 commits into from
Sep 25, 2021

Conversation

sa501428
Copy link
Owner

No description provided.

nchernia and others added 30 commits August 17, 2017 09:26
Update generate_site_positions.py
…riate place; bug fix in countligations for stem
This is a complete rewrite of generate_site_positions.py.
The new version runs under python3 as well as python2.
It corrects an error found in the previous version where the very last number in the result was wrong (it was one too many).
It can work with multiple test strings at the same time (they must be enumerated in a python list).
Wildcard search (using N, M, R, W, Y, S, K, H, B, V, D) is also possible with the new algorithm.
Compared to the previous version this program runs faster for simple test strings (strings without wildcards).
The script contains a dictionary for genome/filename mappings and another one for enzyme/test string mappings which users can modify easily according to their needs.
When there is no 50 bp bin with 1000 or more counts, the awk command failed to create the "sum" variable, so "bins1000" was NULL. I added a condition for this awk command to return 0 when this happens.
Update calculate_map_resolution.sh
Enable multithreading option for "bwa aln"
multithreading for "bwa aln"
cErikson and others added 28 commits August 8, 2019 14:27
Added a check were if ligation is a regex, and unquoted, then add quotes. Removed quotes from count_ligation.sh
Fixed issues with multiple ligation, add Arima ligation
A complete rewrite of generate_site_positions.py
Major:
- BWA now aligns in paired end mode. This requires BWA version 0.7.17 or higher; short read and short end mode
are now deprecated

Minor:
- The chimera handling script now includes the header and prints out tab-delimited, for better conversion to BAM; it also no longer looks for the /1, /2 but rather looks for the SAM flag
- We dedup collisions now
- An addition to the dups script makes it run faster and with less memory when there are a lot of duplicates
- Statistics updated in CPU to properly handle multiple ligations; also added scripts in CPU that were missing for mega
- Made the names correct in the stats_sub script
- Count ligations explicitly excludes the readname in the fastq file
…ace difference in split_rmdups.awk. Dups.awk now the same as rest of Juicer, with additional code block to exit out of loop when second read end gets too far away.
…icer versions, made fragment delimited maps no longer the default; can include via the -f flag
suppress too many args error if host returns >1 line
@sa501428 sa501428 merged commit 40261bf into sa501428:master Sep 25, 2021
sa501428 pushed a commit that referenced this pull request Jan 23, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
10 participants