Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

`Groupby` behaves differently depending on the order of the columns

See original GitHub issue

Describe the bug When creating a DataFrame, depending on the order of the columns the groupby() function works properly or returns an error.

To Reproduce This column order works perfectly:

let data = {
    worker: ["david", "david", "john", "alice", "john", "david"],
    hours: [5, 6, 2, 8, 4, 3],
    day: ["monday", "tuesday", "wednesday", "thursday", "friday", "friday"],
};
let df = new dfd.DataFrame(data);

df.groupby(["day"]).col(["hours"]).sum().print()

// ╔════════════╤═══════════════════╤═══════════════════╗
// ║            │ day               │ hours_sum         ║
// ╟────────────┼───────────────────┼───────────────────╢
// ║ 0          │ monday            │ 5                 ║
// ╟────────────┼───────────────────┼───────────────────╢
// ║ 1          │ tuesday           │ 6                 ║
// ╟────────────┼───────────────────┼───────────────────╢
// ║ 2          │ wednesday         │ 2                 ║
// ╟────────────┼───────────────────┼───────────────────╢
// ║ 3          │ thursday          │ 8                 ║
// ╟────────────┼───────────────────┼───────────────────╢
// ║ 4          │ friday            │ 7                 ║
// ╚════════════╧═══════════════════╧═══════════════════╝

df.groupby(["worker"]).count().print()
// ╔════════════╤═══════════════════╤═══════════════════╤═══════════════════╗
// ║            │ worker            │ hours_count       │ day_count         ║
// ╟────────────┼───────────────────┼───────────────────┼───────────────────╢
// ║ 0          │ david             │ 3                 │ 3                 ║
// ╟────────────┼───────────────────┼───────────────────┼───────────────────╢
// ║ 1          │ john              │ 2                 │ 2                 ║
// ╟────────────┼───────────────────┼───────────────────┼───────────────────╢
// ║ 2          │ alice             │ 1                 │ 1                 ║
// ╚════════════╧═══════════════════╧═══════════════════╧═══════════════════╝

But when I change the column order to the following it doesn’t work:

let data = {
    hours: [5, 6, 2, 8, 4, 3],
    worker: ["david", "david", "john", "alice", "john", "david"],
    day: ["monday", "tuesday", "wednesday", "thursday", "friday", "friday"],
};
let df = new dfd.DataFrame(data);

df.groupby(["day"]).col(["hours"]).sum().print()
// Uncaught Error: Can't perform math operation on column hours
//    arithemetic groupby.ts:266
//    operations groupby.ts:417
//    count groupby.ts:431

df.groupby(["worker"]).count().print()
// Uncaught Error: Can't perform math operation on column hours
//    arithemetic groupby.ts:266
//    operations groupby.ts:417
//    count groupby.ts:431

Expected behavior I would expect that changing the order of the columns wouldn’t make any change on the result.

Desktop (please complete the following information):

OS: Windows 11
Browser: Firefox v97.0.1, Chrome v98.0.4758.102, Edge v98.0.1108.56
Version: -

Additional context I’m using the browser version, not the node.js one.

Issue Analytics

State:
Created 2 years ago
Comments:10 (9 by maintainers)

Top GitHub Comments

1reaction

igonrocommented, Feb 22, 2022

Now I’m having a problem with ChromeHeadless in WSL2.

No binary for ChromeHeadless browser on your platform

I saw that @sponsfreixes had a similar issue in #173, I will try to fix it myself 😅

1reaction

igonrocommented, Feb 22, 2022

Thanks @risenW, I will try it!

Top Results From Across the Web

group by pandas dataframe and select latest in each group

In my tests, last() behaves a bit differently than nth(), when there are None values in the same column. For example, if first...

pandas GroupBy: Your Guide to Grouping Data in Python

groupby () can accept several different arguments: A column or list of columns; A dict or pandas Series; A NumPy array or pandas...

Group by: split-apply-combine - Pandas

Splitting the data into groups based on some criteria. ... If we also have a MultiIndex on columns A and B , we...

Groupby behaves differently when using levels and list of ...

When grouping by several levels of a MultiIndex, groupby evaltuates all possible combinations of the groupby keys. When grouping by column ...

Pandas Groupby Sort within Groups - Spark by {Examples}

You can sort values in descending order by using ascending=False param to sort_values() method. The head() function is used to get the first...

Troubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.

Start Free

Top Related Reddit Thread

No results found

Top Related Tweet

No results found

Top Related Dev.to Post

No results found

`Groupby` behaves differently depending on the order of the columns

Issue Analytics

Top GitHub Comments

Top Results From Across the Web

Top Related Medium Post

Top Related StackOverflow Question

Troubleshoot Live Code

Top Related Reddit Thread

Top Related Hackernoon Post

Top Related Tweet

Top Related Dev.to Post

Top Related Hashnode Post

Can't create a dataframe from a horizontal array?

New methods `iat` and `at` to access single value in DataFrame/Series