Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sz dyst peaker #457

Open
wants to merge 13 commits into
base: main
Choose a base branch
from
Open

Sz dyst peaker #457

wants to merge 13 commits into from

Conversation

shadizaheri
Copy link
Collaborator

@shadizaheri shadizaheri commented Jul 24, 2024

  • Modified DystPeaker WDL to include SaveRLArray.gcs_path as a main output
  • Ensured that saveLRArrayOutput is available for copying to the Terra data table
  • The modified wdl is validated on Terra

SHuang-Broad and others added 13 commits November 8, 2023 12:08
  * update Hifiasm to version 0.19.5
  * update how Hifiasm outputs are compressed (bgz replacing gz), also
  * monitor hifiasm resources usage
  * update docker used in PBSV tasks to the version coming with official SMRTLink releases (2.9.0)
  * change how the 2-step PBSV process is done (following the recommended way now)
  * to version 2.0.7
  * using TRF bed
  * conditionally phase sv (requires phased bam)
  * generates its own vcf.gz and tbi
Overhaul how small variants are called in the WG pipelines

  * default to use DV to call small variants, Clair3 analysis needs to be requested explicitly
  * retire the Pepper toolchain completely from the CCS pipeline, using DV directly
  * for R10.4+ ONT data, also use DV directly
  * older ONT data would still use the PEPPER-DV-Margin pipeline
  * offers GPU version (though based on, it's not worth it yet)
  * update how bam haplotagging is done

Cleanup structural variants calling
  * experiment with SNF2 phasing SV calls (implicitly depends on small variants calling now)
  * tune PBSV calling
    - discover now supports --hifi
    - output vcf.gz and tbi
    - less verbose logging by default

Misc.:
  * optimizations to BAM merging and metrics workflow
  * updates coverage collection step
  * new R script to visualize log from vm_monitoring_script.sh
  * organize dockstore.yml file a bit

  * make WDL validation shell script more usable

  * update pbmm2 and pbindex to versions in SMRTLink

  * update GeneralUtils.wdl
    - two bash-like new tasks [CoerceMapToArrayOfPairs, CoerceArrayOfPairsToMap]
    - cleanup task CollapseArrayOfStrings

  * update resource allocations to tasks
    - NanoplotFromBam (also changes docker)
    - MosDepthWGS
  * incorporates gcloud cli (not just gsutil)
  * integrate libdeflate for more speedups
incorporate new tasks and optimize them

  * [CountMethylCallReads, GatherReadsWithoutMethylCalls]
    from sh_beans

  * [GetPileup, BamToRelevantPileup]
    from sh_more_atomic_qc

  * [GetReadGroupLines, GetSortOrder, SplitNameSortedUbam]
    from sh_ont_fc

  * [SamtoolsFlagStats, ParseFlagStatsJson]
    from sh_trvial_stats

  * [FilterBamByLen, InferSampleName]
    from sh_seqkit

  * [CountAlignmentRecords, StreamingBamErrored, CountAlignmentRecordsByFlag]
    from sh_maha_aln_metrics

  * [ResetSamplename]
    from sh_ingest_singlerg

  * [MergeBamsWithSamtools]
    from sh_ont_fc.Utils.wdl

  * [BamToFastq]
    from sh_more_bam_qcs
    and optimize it with
    sh_ingest_singlerg.Utils.wdl

delete
  * GetSortOrder as that's now implemented in GatherBamMetadata
  * Drop2304Alignments as that's no longer used

update dockers to the latest
@shadizaheri shadizaheri marked this pull request as ready for review July 24, 2024 12:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants