docs: Create and Populate How-To Guides
See original GitHub issueCreate+Populate How-To Guides
There is a lack of how to guides on the website. Let’s make a list of them and start populating a useful section for code schematics.
I hope to keep this issue up as a rolling list of How to Guides, so if you stumble on this and your need isn’t included in the docs or this list, then comment your request here!
v3.2
- How to compute the top k records
- How to register tables in a DuckDB backend #4386
- How to ffill and bfill using window functions (source) #4402
v4.0
- How to join a table expression to a pandas dataframe (merged) (docs PR)
- (WIP) How to chain ibis expressions (
_
API) (#4914) - How to sessionize a log of events based on the elapsed time between successive transactions for a give user identifier (#4843).
- How to convert column structures into columns (
unpack
)
v4.1+
- How to create a table with a user-defined schema and efficiently load a large CSV file into it (handling common annoying things like the choice of the separator and custom date / time formats).
- How to efficiently sample random rows from a large table (or expr) to quickly compute approximate descriptive statistics such as quantiles.
- How to perform an
asof_join
(source) - How to interpolate dates (and/or other sequences)
- [maybe?] How to use Ibis to generate and save Substrait/SQL
- How to create a backend (incl. a section on SQLAlchemy backends specifically, but should focus on min requirements to get one up and running w/o SQLAlch)
- How to chain joins (unknown if this should even be a how-to guide, but if #4309 is a fact of life then we should add this)
Let me know if you have any ideas and I’ll add them to the list.
Issue Analytics
- State:
- Created a year ago
- Comments:7 (6 by maintainers)
Top Results From Across the Web
How to Create and Autopopulate a Google Docs Template
How to automatically create a document from a template · Step 1: Create a new Zap. · Step 2: Set up your trigger....
Read more >Create and manage documents | Google Docs
This page describes how to perform certain high-level tasks involving documents, such as: Create a document; Copy an existing document.
Read more >How to Build the Best User Documentation (New Guide) | Blog
Have a product or service that needs user documentation? Here are some simple tips on how to build the best documentation for consumers....
Read more >How to use Google Forms - Google Docs Editors Help
Step 1: Set up a new form or quiz · On a computer, open a spreadsheet at sheets.google.com. · Click Tools and then...
Read more >Making a Fillable Google Doc - YouTube
In this video I go over how to make a fill in the blank setup in a google doc. Hope this helps! ......
Read more >Top Related Medium Post
No results found
Top Related StackOverflow Question
No results found
Troubleshoot Live Code
Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start FreeTop Related Reddit Thread
No results found
Top Related Hackernoon Post
No results found
Top Related Tweet
No results found
Top Related Dev.to Post
No results found
Top Related Hashnode Post
No results found
Top GitHub Comments
Here are some more ideas:
How to create a table with a user-defined schema and efficiently load a large CSV file into it (handling common annoying things like the choice of the separator (and maybe quotes) and custom date / time formats).
How to sessionize a log of events based on the elapsed time between successive transactions for a give user identifier.
How to efficiently sample random rows from a large table (or
expr
) to quickly compute approximate descriptive statistics such as quantiles.For the first item about CSV loading, I don’t know if there is a standard way to use the
COPY
statements of popular databases from ibis. Personally I used araw_sql
command with duckdb to be able to specify a custom date format in a recent experiment.The second is about event-based data and should probably cross-reference #4402 (about
ffil
).Below is a list of some resources that I think are good examples to better learn tools in the data ecosystem. Maybe some parts of these could be translated to use Ibis. Sometimes one of the hardest parts is to just figure out a good narrative flow or interesting example use case when writing tutorials.
Many of these examples have other elements outside of just data manipulation like visualization or maybe some stats/machine learning. It could get a little bit tricky to choose which other libraries to feature, but I do think that it is useful to see how different tools interact.