Talend and MapR share a common vision around enabling businesses to utilize data as a strategic asset in their armory, so they can be more agile and remain competitive in the big data era. A recent survey of CIOs found that up to 80 percent are planning to deploy at least one (if not more) big data projects in 2017. It’s clear that big data has moved past what industry analyst firm Gartner would term ‘the hype cycle’ and into the mainstream. That said there are a few things still holding enterprises back from realizing the full business benefits big data technologies such as Spark and Hadoop can deliver.
Let’s ‘double click’ into some issues facing IT organizations that can hinder them from making progress towards experiencing real-time value from strategic investments in Hadoop and Spark.
Most IT teams are constantly being asked to do more with less, so it’s tempting to look for quick and easy fixes. Yet, ‘quick and easy’ is not always the best solution for the long term. One example of where most enterprises still tend to think ‘short term’ instead of ‘long term’ is when it comes to hand coding. Oftentimes, hand coding can offer simple fix to an integration challenge. It may even save 20% of your deployment costs in the beginning. However, the maintenance costs will increase by 200% down the road, according to research by Gartner.1 As big data projects scale, more systems and more coding are involved and maintaining the complex manual coding would be even more challenging with different people supporting the projects.
In today’s quickly evolving and competitive marketplace, big data and cloud technologies are moving at a breakneck pace. While you may be leveraging the right tools today, those may very likely be replaced/outdated within the next 18 months. Thus, IT decision makers need to invest wisely in solutions that are open, adaptable and easy to deploy/update within their existing infrastructures as business and market demands change.
Complexity of Data Movement
According to a survey by IDC, 82% of organizations are in some phase of adopting real-time analytics.2 With more big data use cases requiring real-time and streaming capabilities, Hadoop is no longer the “panacea” to everything. Oftentimes enterprises need to set up different clusters for streaming data such as IoT device data, clickstream data, network data, in addition to what they have as Hadoop clusters, enterprise storage and operational databases. The movement of data between these systems becomes a new challenge—creating a siloed architecture (as depicted in the figure below) that not only adds to overhead but also complicates the management of your technology stack.
Image Courtesy of MapR
Talend + MapR Unified Platform
So, how can organizations simplify this complexity? First and foremost, companies need solutions that streamline their existing data infrastructure and break down silos. Enterprises also need to ensure that the flow of information can scale to meet big data volumes and the need for real-time business insight. Recently Talend announced certification for Talend Data Fabric on the MapR Converged Data Platform to help enterprises get value out of their big data deployments in no time.
The MapR Converged Data Platform is the industry’s only enterprise-grade software solution that unifies big data and open source technologies with fast, native access to global event streaming, real-time database capabilities, and web-scale storage. Talend Data Fabric is now the industry’s first integration platform to natively support MapR Streams—among other MapR technologies—which helps customers continuously synchronize event processing across databases as information is updated, delivering accurate insight in real-time.
Together MapR and Talend help increase customer’s data agility with a unified platform approach that is open source-based, making it interoperable with multiple standards. As illustrated in the diagram below, Talend enables a smart data pipeline with MapR to design, deploy, ingest, integrate, prepare, cleanse, and optimize as you prepare your data for big data applications.
Talend also natively supports the entire MapR Converged Data Platform so you can tackle multiple big data workloads with this unified approach, whether it’s operational or analytical, batch or real-time. Project time-to-market can be greatly improved because customers can leverage the real-time/streaming capabilities from MapR Streams and Spark Streaming (natively supported by Talend) with no hand coding required, for automated integration at scale, with broad connectivity from Talend.
To learn more about the Talend and MapR joint solution, watch this webinar on-demand and download the Big Data Sandbox now with real-world scenarios that will allow you to trial all the capabilities of the Talend Data Fabric platform on MapR.
 “Does Custom Coding Stack Up to Data Integration Tools,” Mark A. Beyer, Ehtisham Zaidi, Eric Thoo, Gartner Research, September 2016.
 IDC, CloudView Survey 2016: Real-Time Analytics Adoption to Grow Rapidly, Especially for IoT, March 2016.f
About the Author - Shiyi Gu:
Shiyi Gu is the Product Marketing Manager for Big Data at Talend. Shiyi brings her expertise in Data Integration, Big Data and NoSQL, and is passionate about open source technologies. She loves helping customers connect the dots between technology and business value.