Setting up Hadoop, YARN, MR2, Hbase, Hive, Spark2, Kafka, Zookeeper cluster
Student at Faculty of organization and informatics
- Croatia, HR
- [email protected]
Innovative IT novice offering his full knowledge and experience to deliver highly effective and creative solutions to technology challenges. Highly organized with strong capacity to prioritize workload and always trying to preform within deadlines.
Successfully implemented and delivered application written in Swift.
Final work about distributed databases in Apache Cassandra.
Prototype of Android Application TKDtracer
In order to enroll, course "Database and Knowledge Bases"
Setting up cluster which will handle fault-tolerance, availability, NRT and Real-Time data processing.
HDP is giving ability to not only setting up cluster with key components but also ability to monitor whole cluster much easily. Confluent upon Kafka which gives kafka-connect another dimension with avro schema but also with huge amount of source and sink connectors. Configured Landoop UIs allowing easily tracking Kafka topics and it data, schema registries and also ability to manipulate the connectors.Cassandra brings availability, fault tolerance + brilliant performance under large sets of data.
With HBase we accomplished the main benefits over NoSQL data store solutions. His ability to quickly store and access sparse data is allowing in architecture to provide NRT random R/W upon Hadoop.
Testing throughput and latency of Apache Cassandra on Multi-node Cluster using Apache Spark as engine for big data processing. Also trying to compare the same results with Microsoft SQL Server performance.
The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure make it the perfect platform for mission-critical data.
Also was testing performance of Hadoop multi-node cluster configuration with Apache Drill and Zookeeper along with.
Created kafka-connect-hbase-sink for Hortonworks Data Platform v2.6 which has HBase v1.1.2 (by then, was available sinker for HBase v1.2)