A mechanical typewriter

Open text files in legacy Python

At a first glance, opening files in Python is easy. All you have to do is call the built-in function open() and then you start reading from the file. However, often the content to be read is text, not just … Read More

Featured Post

Blue Yonder at PyCon.DE 2017 and PyData Karlsruhe

  Blue Yonder uses Python throughout its technology stack. We are aware that we profit a lot from open source and it has been always our policy to give back to the community. We have open sourced libraries like turbodbc, sqlalchemy_exasol, Mesos … Read More

Featured Post

Turbodbc and Apache Arrow

With the release of turbodbc 2.0.0, the popular ODBC module for Python provides built-in support for Apache Arrow, a universal data layer for fast column-oriented analytics. Head over to our guest post at the Apache Arrow blog to learn more … Read More

Featured Post

Public cloud cost control with Prometheus

Over the past decade, the entire IT industry has been pushing toward cloud computing. A recent article by Gartner argues the IT industry will be entering a second wave of cloud computing growth fueled by enterprise adoption. At Blue Yonder, we use the … Read More

Featured Post
A logo illustrating the construction on turbodbc with a crane and a hardhat

Making of turbodbc (part 2): C++ to Python

In the first part of this series, we learned how turbodbc builds a few layers of C++ code on top of the open database connectivity (ODBC) C API. We ended up with a reduced API that improves on its foundation … Read More

Featured Post

Efficient DataFrame Storage with Apache Parquet

In our daily work as data scientists, we deal with a lot with tabular data, also called DataFrames. At Blue Yonder, we use Pandas DataFrames to analyze and transform the data we need. One of our main challenges when we integrate new systems … Read More

Featured Post
A logo illustrating the construction on turbodbc with a crane and a hardhat

Making of turbodbc (part 1): Wrestling with the side effects of a C API

Databases are the core of modern information technology. Databases are used by web servers to store content, by businesses to place orders, and by data scientists as their data sources. To give Python users easy database access, the Python community … Read More

Featured Post

Data Science and DevOps: A Success Story

What is “Data Science”? There are many definitions of what data science is though some are not really serious: “Data science is statistics on a Mac. ” @bigdataborat Or this one, which is surprisingly to the point: “Data scientist (n.): Person … Read More

Featured Post
Apache Big Data

Blue Yonder at Apache Big Data Europe

At Blue Yonder, we make use of several Apache projects and actively contribute to three projects at the moment (Apache Aurora, Apache Arrow, and Apache Parquet) as project management committee (PMC) members and committers. The Apache Big Data Europe conference … Read More

Featured Post

It’s fresh. It’s scientific. It’s free.

Blue Yonder’s python library tsfresh automatically extracts features from time series or sequences and it has just been released as an open source project. Time series: They’re everywhere The computerization and automation of manufacturing operations as part of Industry 4.0 … Read More

Featured Post