Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Method for grouping samples which is understood by library functions

See original GitHub issue

Population structure analysis depends on us being able to assign labels to samples. In this thread we discussed the possibility of using individual datasets, one for each group, but grouping by a categorical variable is much more flexible and idiomatic.

We need to develop some conventions which allows the user to do some standard grouping of samples, and functions which use these groupings (Fst, or divergence, for example), should understand these conventions and update the output dataset accordingly.

It may also be worth thinking about how we might group the variants dimension while we’re thinking about it. For example, we might want to get the pairwise Fst values for all pairs of populations, for all chromosomes in a dataset. It would be super-nice if we had an idiomatic way of running these calculations.

@eric-czech - you had some concrete ideas on this in the earlier thread, what are your thoughts?

Issue Analytics

State:
Created 3 years ago
Comments:7 (1 by maintainers)

Top GitHub Comments

1reaction

tomwhitecommented, Oct 15, 2020

We still need to implement cohort subsets (and add some docs), so I’d like to leave this open until that’s done.

1reaction

jeromekellehercommented, Sep 2, 2020

Thanks @eric-czech - cohort is an excellent choice. It’s clearly about samples, but also quite flexible in that it just means “samples with some shared characteristic”.

Top Results From Across the Web

Group by function in R using Dplyr - GeeksforGeeks

This is the simplest way by which a column can be grouped, just pass the name of the column to be grouped in...

Library classification - Wikipedia

A library classification is a system of organization of knowledge by which library resources are arranged and ordered systematically.

Library Function - an overview | ScienceDirect Topics

These simplify the programming process, with a small set of functions usually providing all the initialization and operating sequences required. Example ...

Standard C Library Functions Table, By Name - IBM

Function System Include File Function Prototype abort stdlib.h void abort(void) abs stdlib.h int abs(int n) acos math.h double acos(double x)

Standard Libraries – Programming Fundamentals - Rebus Press

They are organized into a group of functions (think of them as several books) and are collectively called a standard library. There are...