Methods
- Import / Export
- Statistics
- Genetics
VEPConfig
VEPConfigGRCh37Version85
VEPConfigGRCh38Version95
balding_nichols_model()
concordance()
filter_intervals()
filter_alleles()
filter_alleles_hts()
hwe_normalized_pca()
genetic_relatedness_matrix()
realized_relationship_matrix()
impute_sex()
ld_matrix()
ld_prune()
compute_charr()
mendel_errors()
de_novo()
nirvana()
sample_qc()
_logistic_skat()
skat()
lambda_gc()
split_multi()
split_multi_hts()
summarize_variants()
transmission_disequilibrium_test()
trio_matrix()
variant_qc()
vep()
- Relatedness
- Miscellaneous
Import / Export
|
Export a |
|
Export a |
|
Export MatrixTable as |
|
Export a |
|
Export a |
|
Extract metadata from VCF header. |
|
Import a UCSC BED file as a |
|
Import BGEN file(s) as a |
|
Import a PLINK FAM file into a |
|
Import GEN file(s) as a |
|
Import a locus interval list as a |
|
Import tab-delimited file(s) as a |
|
Import a PLINK dataset (BED, BIM, FAM) as a |
|
Import delimited text file (text table) as |
|
Import VCF file(s) as a |
|
Index BGEN files as required by |
|
Read in a |
|
Read in a |
Statistics
|
Initialize a linear mixed model from a matrix table. |
|
For each row, test an input variable for association using a linear mixed model. |
|
For each row, test an input variable for association with response variables using linear regression. |
|
For each row, test an input variable for association with a binary response variable using logistic regression. |
|
For each row, test an input variable for association with a count response variable using Poisson regression. |
|
Run principal component analysis (PCA) on numeric columns derived from a matrix table. |
|
Computes the correlation matrix between row vectors. |
Genetics
|
Generate a matrix table of variants, samples, and genotypes using the Balding-Nichols or Pritchard-Stephens-Donnelly model. |
|
Calculate call concordance with another dataset. |
|
Filter rows with a list of intervals. |
|
Filter alternate alleles. |
|
Filter alternate alleles and update standard GATK entry fields. |
|
Compute the genetic relatedness matrix (GRM). |
|
Run principal component analysis (PCA) on the Hardy-Weinberg-normalized genotype call matrix. |
|
Impute sex of samples by calculating inbreeding coefficient on the X chromosome. |
|
Computes the windowed correlation (linkage disequilibrium) matrix between variants. |
|
Returns a maximal subset of variants that are nearly uncorrelated within each window. |
|
Compute CHARR, the DNA sample contamination estimator. |
|
Find Mendel errors; count per variant, individual and nuclear family. |
|
Call putative de novo events from trio data. |
|
Annotate variants using Nirvana. |
|
Computes the realized relationship matrix (RRM). |
|
Compute per-sample metrics useful for quality control. |
|
Test each keyed group of rows for association by linear or logistic SKAT test. |
|
Compute genomic inflation factor (lambda GC) from an Expression of p-values. |
|
Split multiallelic variants. |
|
Split multiallelic variants for datasets that contain one or more fields from a standard high-throughput sequencing entry schema. |
|
Performs the transmission disequilibrium test on trios. |
|
Builds and returns a matrix where columns correspond to trios and entries contain genotypes for the trio. |
|
Compute common variant statistics (quality control metrics). |
|
Annotate variants with VEP. |
Relatedness
Hail provides three methods for the inference of relatedness: PLINK-style identity by descent [1], KING [2], and PC-Relate [3].
identity_by_descent()
is appropriate for datasets containing one homogeneous population.king()
is appropriate for datasets containing multiple homogeneous populations and no admixture. It is also used to prune close relatives before usingpc_relate()
.pc_relate()
is appropriate for datasets containing multiple homogeneous populations and admixture.
|
Compute matrix of identity-by-descent estimates. |
|
Compute relatedness estimates between individuals using a KING variant. |
|
Compute relatedness estimates between individuals using a variant of the PC-Relate method. |
Miscellaneous
|
Searches given paths for all lines containing regex matches. |
|
Return a table containing the vertices in a near maximal independent set of an undirected graph whose edges are given by a two-column table. |
|
Rename duplicate column keys. |
|
Segment the interval keys of ht at a given set of points. |