Product Archives

Product 13 August, 2018

Wild Data Part 2: Transfer Style

Transfer Style allows to use the inner understanding of an already trained Convolutional Neural Network to transfer style from one picture to another.

Product 1 August, 2018

Wild Data Part 1: Augmentation

Data augmentation is a basic technique to increase our dataset without new data. Although the technique can be applied in a variety of domains, it’s very commonly used in Computer Vision, and this will be the focus of the post.

Product 18 July, 2018

Swarm Intelligence Metaheuristics, part 1: Ant Colony Optimization

This post will focus on a class of metaheuristics known as Swarm Intelligence. The most amazing feature of these algorithms is their ability to solve complex problems by a set of cooperative agents posing very simple intelligence.

Product 5 July, 2018

Mesos multi Data Center architecture for Disaster Recovery

This post aims to show how to build an on-premise Mesos architecture to handle a disaster scenario when an entire Data Center is not available, covering also some framework strategies for zero data loss.

Product 19 June, 2018

Correlation does not imply… sluggishness

Correlation is very often used within the initial exploratory stage when given a dataset, because of its ability to comb through pairs of variables and swiftly summarize whether they appear to be related or not.

Product 5 June, 2018

Statistical Comparison of Machine Learning Algorithms (Part 2)

This is the second (and last) part of the series dealing with the formal comparison of Machine Learning (ML) algorithms from a statistical point of view. In this post, we examine how statistical tests are applied to performance data of ML algorithms.

Product 16 May, 2018

Apache Ignite: More than a simple cache

Apache Ignite is a distributed in-memory cache, query and processing platform for working with large-scale data sets in real-time (leaving aside, streaming processing, Spark integration, Machine learning grid, Ignite FileSystem, persistence, transactions…)

Product 8 May, 2018

Cooking ML Models

Have you ever watched the cooking teaching shows? You have probably noticed that chefs have usually already all the ingredients separated and chopped. Likewise, a data scientist will be more useful and creative building models rather than spending time with data preprocessing…

Product 20 April, 2018

Statistical Comparison of Machine Learning Algorithms (Part 1)

In industry, when a practitioner (often a Data Scientist) uses a machine learning algorithm to build a predictive model to solve a real-world problem, they are interested in the performance when the model is deployed into a production environment…

Product 5 April, 2018

The definitive visual build tool for Apache Spark: Sparta 2.0

Spark Streaming is one of the most widely used frameworks for real time processing in the world with Apache Flink, Apache Storm and Kafka Streams. However, when compared to the others, Spark Streaming has more performance problems and its process is through time windows instead of event by event, resulting in delay.

Product

Wild Data Part 2: Transfer Style

Wild Data Part 1: Augmentation

Swarm Intelligence Metaheuristics, part 1: Ant Colony Optimization

Mesos multi Data Center architecture for Disaster Recovery

Correlation does not imply… sluggishness

Statistical Comparison of Machine Learning Algorithms (Part 2)

Apache Ignite: More than a simple cache

Cooking ML Models

Statistical Comparison of Machine Learning Algorithms (Part 1)

The definitive visual build tool for Apache Spark: Sparta 2.0

Product

Solutions

Use case

Partners

About us

Social