Using Spark SQLContext, HiveContext & Spark Dataframes API with ElasticSearch, MongoDB & Cassandra
In this post we will show how to use the different SQL contexts for data query on Spark. We will begin with Spark SQL and follow up with HiveContext. In additio... Read more.
How to aggregate Data in Real-Time with Stratio Sparta
When working with Big Data, it's frequent to have the need to aggregate data in real-time, whether it comes from a specific service, such as social networks (Tw... Read more.
Variance in Scala (“Luke, he is your father too”)
When working with Big Data, sometimes it’s useful to remember that powerful products wouldn't work properly without the tools that build them.... Read more.
Stratio’s Lucene-based index for Cassandra is now a plugin
Thanks to the changes proposed at CASSANDRA-8717, CASSANDRA-7575 and CASSANDRA-6480, Stratio is glad to present its Lucene-based implementation of Cassandr... Read more.
A Spark-based analytics solution for Online Advertisers
This post contains the winning solution for the Stratio challenge 2015 developed by Marco Piva, Leonardo Biagioli, Fabio Fantoni and Andrea De Marco (BitBang).... Read more.
We write code in Scala and… We love it!
If you really want to learn and soak up every bit of Scala’s powerful functional features try not to learn them all at once, pick one and try to think of part... Read more.
Supporting service-based multi realm authentication and authorization
Security is often a forgotten concern in Big Data environments. However, as these technologies are being embraced by companies with sensitive data (think, for e... Read more.
Top-k queries in Cassandra: An embedded mapreduce approach
Stratio has just added top-k queries support to its Lucene based implementation of the Cassandra’s secondary indexes. This implementation was originally desig... Read more.
Spark-MongoDB library
Once the Data Sources API has been released, we've wanted to take advantage of these new features and, for this reason, we have developed a Spark-MongoDB libr... Read more.
Big Data Spain 2014 summary
Once Data Sources API has been released, we've wanted to take advantage of these new features and, for this reason, we have developed a Spark-MongoDB librar... Read more.