Thursday, March 27, 2025

Show HN: Xorq – open-source Python-first Pandas-style pipelines https://ift.tt/iFAE6sO

Show HN: Xorq – open-source Python-first Pandas-style pipelines Hi HN, Dan, Hussain and Daniel here… After years of struggling with data pipelines that worked in notebooks but failed in production, we decided to do something about it. We created xorq to eliminate the constant headaches of SQL/pandas impedance mismatch, runtime debugging, wasteful recomputations and unreliable research-to-production deployments that plague traditional pandas-style pipeline workflows. xorq is built on Ibis and DataFusion. We’d love your feedback and contributions. xorq is [Apache 2.0 licensed]( https://ift.tt/1VZT7tH ) to encourage open collaboration. Repo : https://ift.tt/rHbinCh Docs : https://docs.xorq.dev Roadmap Issues : https://ift.tt/rHbinCh You can get started `pip install xorq`. Or, if you use nix, you can simply run `nix run github:xorq-labs/xorq` and drop into an IPython shell. Demo video: https://youtu.be/jUk8vrR6bCw Here are some vignettes to look into next: 1. MCP Server + Flight + XGBoost: https://ift.tt/KtmrFo6 2. 1 DuckDB + 2 Writers + 1 Reader: https://ift.tt/1ScLgEO 3. OpenAI UDF: https://ift.tt/DBM5gHV Some features to note: - Ibis-based multi-engine expression system: effortless engine-to-engine streaming - Cache expressions with `.cache` operator - Portable DataFusion-backed UDF engine with first class support for pandas dataframes - Serialize Expressions to and from YAML - Easily build Flight end-points by composing UDFs thanks for checking this out, and we’re here to answer any questions! https://ift.tt/e0cQOWC March 27, 2025 at 10:57PM

No comments:

Post a Comment

Show HN: Infrabase – Prompt-Ops for AWS https://ift.tt/M8z7y4K

Show HN: Infrabase – Prompt-Ops for AWS We too at first thought that letting an LLM manage your AWS account must be a bad idea. What could p...