Hadoop is a big data technology suite that provides a range of tools for storing, interacting with and manipulating large data sets, or to help solve big data problems which might relate to the structure and distribution of the data just as much as the size of it. With many companies approaching points where they need to be able to handle large amounts of data, either right now or looking forward in order to scale, Hadoop is one of the core technologies that can help aid you there. In this talk I'll provide an outline of HDFS, Hive and Spark, how you can use them; the differences between Hadoop and other technologies such as Elasticsearch; about Presto, a distributed SQL query engine that will allow you to query your data with simple SQL queries. And finally, I'll talk about how you can utilise these resources from within your Symfony application, allowing your platform to interact with huge volumes of data easily and quickly.

Comments

Comments are closed.