When surfing the internet, it is quite easy to find sites comparing the most popular Machine learning toolkits. These sites give you a lot of information about the strengths and weaknesses of the libraries, how they work and some examples to compare how easy it is to use these types of tools.
In this post we will show how to use the different SQL contexts for data query on Spark. We will begin with Spark SQL and follow up with HiveContext. In addition to this, we will conduct queries on various NoSQL databases and analyze the advantages / disadvantages of using them.
When working with Big Data, it’s frequent to have the need to aggregate data in real-time, whether it comes from a specific service, such as social networks (Twitter, Facebook…) or even from more diverse sources, like a weather station.
We’re just a couple of days away from the Spanish general elections and Twitter is boiling up with campaign related messages. People want to have a say in what goes on in their country and they turn Twitter to express their opinions and feelings.
Proud to share the press release announcing Stratio as Huawei’s technological partner and looking forward to working together.
When working with Big Data, sometimes it’s useful to remember that powerful products wouldn’t work properly without the tools that build them.
Friday, July 31st is the SysAdmin Appreciation Day and at Stratio we really want to go all out and celebrate it. During the entire day we will be giving sysadmin related talks and workshops.