question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

docs: Create and Populate How-To Guides

See original GitHub issue

Create+Populate How-To Guides

There is a lack of how to guides on the website. Let’s make a list of them and start populating a useful section for code schematics.

I hope to keep this issue up as a rolling list of How to Guides, so if you stumble on this and your need isn’t included in the docs or this list, then comment your request here!

v3.2

  • How to compute the top k records
  • How to register tables in a DuckDB backend #4386
  • How to ffill and bfill using window functions (source) #4402

v4.0

  • How to join a table expression to a pandas dataframe (merged) (docs PR)
  • (WIP) How to chain ibis expressions (_ API) (#4914)
  • How to sessionize a log of events based on the elapsed time between successive transactions for a give user identifier (#4843).
  • How to convert column structures into columns (unpack)

v4.1+

  • How to create a table with a user-defined schema and efficiently load a large CSV file into it (handling common annoying things like the choice of the separator and custom date / time formats).
  • How to efficiently sample random rows from a large table (or expr) to quickly compute approximate descriptive statistics such as quantiles.
  • How to perform an asof_join (source)
  • How to interpolate dates (and/or other sequences)
  • [maybe?] How to use Ibis to generate and save Substrait/SQL
  • How to create a backend (incl. a section on SQLAlchemy backends specifically, but should focus on min requirements to get one up and running w/o SQLAlch)
  • How to chain joins (unknown if this should even be a how-to guide, but if #4309 is a fact of life then we should add this)

Let me know if you have any ideas and I’ll add them to the list.

Issue Analytics

  • State:open
  • Created a year ago
  • Comments:7 (6 by maintainers)

github_iconTop GitHub Comments

3reactions
ogriselcommented, Oct 7, 2022

Here are some more ideas:

  • How to create a table with a user-defined schema and efficiently load a large CSV file into it (handling common annoying things like the choice of the separator (and maybe quotes) and custom date / time formats).

  • How to sessionize a log of events based on the elapsed time between successive transactions for a give user identifier.

  • How to efficiently sample random rows from a large table (or expr) to quickly compute approximate descriptive statistics such as quantiles.

For the first item about CSV loading, I don’t know if there is a standard way to use the COPY statements of popular databases from ibis. Personally I used a raw_sql command with duckdb to be able to specify a custom date format in a recent experiment.

The second is about event-based data and should probably cross-reference #4402 (about ffil).

2reactions
jcmkk3commented, Oct 7, 2022

Below is a list of some resources that I think are good examples to better learn tools in the data ecosystem. Maybe some parts of these could be translated to use Ibis. Sometimes one of the hardest parts is to just figure out a good narrative flow or interesting example use case when writing tutorials.

Many of these examples have other elements outside of just data manipulation like visualization or maybe some stats/machine learning. It could get a little bit tricky to choose which other libraries to feature, but I do think that it is useful to see how different tools interact.

Read more comments on GitHub >

github_iconTop Results From Across the Web

How to Create and Autopopulate a Google Docs Template
How to automatically create a document from a template · Step 1: Create a new Zap. · Step 2: Set up your trigger....
Read more >
Create and manage documents | Google Docs
This page describes how to perform certain high-level tasks involving documents, such as: Create a document; Copy an existing document.
Read more >
How to Build the Best User Documentation (New Guide) | Blog
Have a product or service that needs user documentation? Here are some simple tips on how to build the best documentation for consumers....
Read more >
How to use Google Forms - Google Docs Editors Help
Step 1: Set up a new form or quiz · On a computer, open a spreadsheet at sheets.google.com. · Click Tools and then...
Read more >
Making a Fillable Google Doc - YouTube
In this video I go over how to make a fill in the blank setup in a google doc. Hope this helps! ......
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found