Data ingestion is an art and science in itself. Ingesting data effectively into a Hadoop cluster or any other data store, requires a good understanding of the source and sink with an ability to configure data pipelines. Ingestion becomes a complex task, as we source events from multiple sources in parallel and need to deliver them to various destinations in real-time. High-speed data ingestion is especially critical when implementing real-time analytics.