
Fig 02 - The two Snaplexes
The key elements of SnapLogic are Snaps and Snaplexes. Snaps are connectors, of which the company provides more than 500 out of the box, and which may be directed towards either structured or unstructured data, including B2B interchange formats such as HL7 or EDIFACT. There is also an SDK for developing your own snaps, you can leverage SOAP or REST, and you can even write scripts using Java, Python or JRuby in the Script Snap, or develop your own Snaps using the provided Java SDK. As far as Snaplex is concerned this is the data processing engine, of which there are two versions: Cloudplex, which runs in the cloud; and Groundplex, which is available in the customer domain, on-premises, or in your choice of cloud. These are shown in Figure 2. Note that data is never within the SnapLogic environment per se, only metadata. The Control Plane serves as the web browser-based user interface, as well as storing the metadata, providing authentication, authorisation, scheduling and relevant management functions to the platform as a whole.
SnapLogic refers to its solution as a “streaming data platform” and, while it supports streaming software such as Kafka, this is not what the company means. It is in fact referring to the fact that one step in a data pipeline will not have to complete before the next step begins. This is important for performance reasons. While on the subject of pipelines there are several other comments to make. To begin with, pipelines can call other pipelines. In addition, the Iris AI engine built into the Intelligent Integration Platform can make suggestions with respect to pipelines (and Snaps) that you might want to use. The software also has the ability to check the validity of any pipeline you define, and it can show you a preview of the results.
Going back to Iris we should say that SnapLogic was an early adopter of machine learning (and natural language processing) and it is significantly ahead of many of its competitors in exploiting this, adding productivity to the SnapLogic development environment. Iris is powered by the SnapLogic Data Science capability.
Finally, there is a Fast Data Loader, providing both schema and data replication, offered within the context of the platform. Unfortunately, it is currently relatively limited in terms of the sources (Oracle, SQL Server, Salesforce, Coupa and Servicenow) and targets (Snowflake, Amazon Redshift and SAP DW Cloud) it supports at present, though the company will be building these out over time. As an alternative or in addition, there is SnapLogic eXtreme which is a managed service for big data environments, which supports the population of NoSQL-based data lakes and lakehouses. It is based on a serverless environment and supports elastic scaling.