Our blog

Resources and insights

The latest industry news, interviews, technologies, and resources.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
We care about your data in our Privacy Policy.
Data Integration Platform
5
min read

Introducing Prophecy Data Copilot

Unlock the power of generative AI in data engineering with Prophecy’s Data Copilot.
Mitesh Shah
June 22, 2023
February 14, 2025
June 22, 2023
February 14, 2025
June 22, 2023
February 14, 2025
Events + Announcements
10
min read

Prophecy at Data + AI Summit

Top 10 reasons to visit Prophecy at Data + AI Summit 2023
Emily Lewis
June 16, 2023
June 17, 2023
June 16, 2023
June 17, 2023
June 16, 2023
June 17, 2023
Data Integration Platform
6
min read

Getting started with low-code SQL

Empowering business data users to quickly and easily build scalable data pipelines on the lakehouse
Anya Bida
February 14, 2025
February 14, 2025
February 14, 2025
Data Integration Platform
6
min read

How to implement ETL on Apache Spark

Learn key steps to follow, how to define the ETL pipeline using Spark APIs and dataframes, and best practices for testing and optimizing pipelines for maximum efficiency
Mei Long
July 1, 2025
July 1, 2025
July 1, 2025
Events + Announcements
3
min read

Event spotlight: Prophecy at Data + AI Summit

Bringing the power of low-code data engineering to the masses
Emily Lewis
May 16, 2024
May 16, 2024
May 16, 2024
Data Integration Platform
5
min read

Hitting data driven home runs: How the Texas Rangers win by harnessing Prophecy in their data mesh architecture

Learn how the Texas Rangers use Prophecy and Databricks Lakehouse as the foundation of their data mesh architecture to gain a competitive advantage with low-code data engineering.
Alexander Booth
February 14, 2025
February 14, 2025
February 14, 2025
Data Integration Platform
11
min read

Empower all business data users with interactive SQL development

Leverage Prophecy’s low-code approach and integration with dbt Core to build SQL data pipelines without compromising on engineering best practices
Anya Bida
February 14, 2025
February 14, 2025
February 14, 2025
Events + Announcements
5
min read

Announcing Prophecy 3.0: low-code SQL transformations

Discover how Prophecy 3.0 arms all data users with low-code SQL, so they can quickly and easily build scalable data pipelines, resulting in highly impactful data products.
Maciej Szpakowski
April 26, 2023
February 14, 2025
April 26, 2023
February 14, 2025
April 26, 2023
February 14, 2025
ETL modernization
7
min read

From novice to expert: A blueprint for data analysts to build robust data transformation pipelines without relying on engineering

Learn essential skills and techniques to empower non-technical data practitioners to build ETL pipelines without relying on data engineering.
Mei Long
May 30, 2023
May 30, 2023
May 30, 2023
Prophecy for Databricks
3
min read

Data engineering for the data lakehouse: Four guiding principles

A rising number of enterprises are adopting the lakehouse to unite their analytics projects and foster innovation on a shared, cloud-based platform.
Kevin Petrie, VP of Research at the Eckerson Group
May 29, 2023
June 2, 2023
May 29, 2023
June 2, 2023
May 29, 2023
June 2, 2023
Events + Announcements
8
min read

Webinar spotlight — Moneyball: How the Texas Rangers use low-code data engineering and analytics to identify MVPs

Join this webinar to learn how the Texas Rangers are creating a competitive advantage in professional baseball by using data analytics and AI to improve player performance and evaluate new talent.
Emily Lewis
May 28, 2023
June 8, 2023
May 28, 2023
June 8, 2023
May 28, 2023
June 8, 2023
Data Integration Platform
2
min read

Getting started with low-code

Let's create a pipeline using Prophecy's visual, low-code interface for Spark.
Anya Bida
June 8, 2023
June 8, 2023
June 8, 2023
Data Integration Platform
15
min read

PySpark hands-on tutorial using a visual IDE

Let's get started with PySpark using a visual interface to generate open-source Spark code.
Anya Bida
July 1, 2025
July 1, 2025
July 1, 2025
Data Integration Platform
5
min read

Use Spark interims to troubleshoot and polish low-code Spark pipelines: Part 2

In Part 1, we learned an easy way to troubleshoot a data pipeline using historical, read-only metadata. Now, I want to dig in and polish my individual spark data frames.
Anya Bida
June 8, 2023
June 8, 2023
June 8, 2023
Data Integration Platform
4
min read

Use Spark interims to troubleshoot and polish low-code Spark pipelines: Part 1

Let’s take advantage of Spark’s interim metadata to understand our Spark job behavior with low-code tooling.
Anya Bida
July 1, 2025
July 1, 2025
July 1, 2025
ETL modernization
4
min read

Cloud data engineering on Spark — factors for transitioning to the stack

For enterprises, navigating the cloud transition can be tricky. This describes the two paradigms for ETL in the cloud, and the factors to consider when choosing one.
Raj Bains
November 8, 2022
May 16, 2024
November 8, 2022
May 16, 2024
November 8, 2022
May 16, 2024
ETL modernization
6
min read

Ab Initio to Spark: Modernize your ETL and lower costs

Prophecy also provides a highly automated replacement for legacy ETL products to accelerate the journey to open source and cloud computing.
Raj Bains
July 12, 2022
January 31, 2025
July 12, 2022
January 31, 2025
July 12, 2022
January 31, 2025
Prophecy for Databricks
10
min read

Deep dive into Prophecy for Databricks

Just a few days ago, we announced a Prophecy for Databricks. In this blog post — Part 2 of that announcement — we dig into how Prophecy makes data engineering simple for any data practitioner on Databricks.‍
Maciej Szpakowski
June 23, 2022
July 1, 2025
June 23, 2022
July 1, 2025
June 23, 2022
July 1, 2025
Data Integration Platform
10
min read

Financial reporting simplified: Working capital forecasting

In this blog, we learn about the concept of financial reporting, its importance, the various challenges faced while generating an efficient and reliable financial report and how Prophecy makes the complete process easy sailing.
Anshuman Agrawal
May 31, 2021
February 14, 2025
May 31, 2021
February 14, 2025
May 31, 2021
February 14, 2025
Prophecy for Databricks
15
min read

Prophecy with Delta — making data lakehouses easier

Learn more about the evolution of data architectures, from traditional ETL through two-tier and modern data lakehouses. Explore how Prophecy can enable you to leverage the lakehouse by making common use cases like advanced merges and slowly changing dimensions significantly easier.
Maciej Szpakowski
May 15, 2022
April 27, 2023
May 15, 2022
April 27, 2023
May 15, 2022
April 27, 2023
Events + Announcements
5
min read

Prophecy raises Series A to industrialize data refining

Prophecy is delighted to announce we raised a $25M Series A led by Insight Partners, with participation from SignalFire, Dig Ventures and Berkeley SkyDeck.
Raj Bains
January 20, 2022
February 14, 2025
January 20, 2022
February 14, 2025
January 20, 2022
February 14, 2025
Prophecy for Databricks
5
min read

Prophecy: Low-code data engineering on Databricks lakehouses via Partner Connect

We're excited to be a part of Databricks Partner connect! Read this blog to learn how Databricks Partner Connect makes getting started with Prophecy simple via single-click signup and sign-in.
Shagun Bains
November 18, 2021
April 22, 2023
November 18, 2021
April 22, 2023
November 18, 2021
April 22, 2023
Data Engineering
3
min read

Data Engineering Battle: Python vs SQL++ vs Visual=Code

In a Data Engineering Battle, Prophecy presented a complete, low-code data engineering product with a session for low-code Spark, low-code Airflow and column-level lineage that hundreds attended. Here are the poll results.
Raj Bains
July 13, 2021
February 14, 2025
July 13, 2021
February 14, 2025
July 13, 2021
February 14, 2025
Events + Announcements
5
min read

DATA+AI Summit 2021: Poll results and takeaways for low-code products

Prophecy presented a complete, low-code data engineering product at the Data + AI Summit 2021. Hundreds attended our sessions on low-code Spark, low-code Airflow and column-level lineage. Here are the results from the polls we conducted there.
Prophecy Team
May 1, 2021
May 16, 2024
May 1, 2021
May 16, 2024
May 1, 2021
May 16, 2024
Data Integration Platform
6
min read

Be more productive on Spark with low-code

Learn about the four main pillars of productive, powerful, low-code data engineering on Spark. See all the pieces in action!
Prophecy Team
July 1, 2025
July 1, 2025
July 1, 2025
Data Engineering
3
min read

Prophecy SaaS: Low-code data engineering for your Spark

Announcing the launch of Prophecy SaaS!
Prophecy Team
February 23, 2021
July 1, 2025
February 23, 2021
July 1, 2025
February 23, 2021
July 1, 2025
Data Engineering
4
min read

ELT is not the disruption — data engineering is!

The disruption is agile software practices in data engineering made usable for the many. Allow me to explain.
Raj Bains
October 20, 2020
February 14, 2025
October 20, 2020
February 14, 2025
October 20, 2020
February 14, 2025
6
min read

Apache Spark™ vs Snowflake: The cloud data engineering (ETL) debate!

We discuss the two cloud data engineering architectures and advise which one to pick as you move your ETL to the cloud.
Raj Bains
July 13, 2020
May 16, 2024
July 13, 2020
May 16, 2024
July 13, 2020
May 16, 2024
Events + Announcements
2
min read

Announcing Prophecy's public beta!

We're excited to share what we have built with you. Sign up and let us know what you think!
Prophecy Team
May 22, 2020
February 14, 2025
May 22, 2020
February 14, 2025
May 22, 2020
February 14, 2025
6
min read

ProphecyHub: Metadata re-invented with Git and GraphQL for data engineering

Prophecy has innovated in metadata management by merging code on git with metadata, making it code-first.
Raj Bains
April 22, 2020
April 22, 2023
April 22, 2020
April 22, 2023
April 22, 2020
April 22, 2023
6
min read

Spark: Column-level lineage

Prophecy computes column-level lineage from code for Spark code on Git.
Raj Bains
March 17, 2020
April 22, 2023
March 17, 2020
April 22, 2023
March 17, 2020
April 22, 2023
3
min read

Spark deserves a better IDE

Prophecy IDE enables visual and code developers to produce high-quality Spark code.
Raj Bains
January 28, 2020
April 22, 2023
January 28, 2020
April 22, 2023
January 28, 2020
April 22, 2023
5
min read

Scala packrat parser combinators for DSLs

Parser combinators allow you to develop sophisticated parsers quickly and see how to use them.
Raj Bains
October 11, 2019
May 16, 2024
October 11, 2019
May 16, 2024
October 11, 2019
May 16, 2024
4
min read

Startup superpower: customer discovery machine

Customer discovery is essential to figuring out product market fit. Learn how to do it well and when to use it.
Raj Bains
September 5, 2019
April 22, 2023
September 5, 2019
April 22, 2023
September 5, 2019
April 22, 2023
5
min read

Introducing Prophecy.io — cloud-native data engineering

Introducing Prophecy.io — a high-performance, zero-compromise, cloud-native data engineering product powered by Spark for enterprise data engineering teams.
Raj Bains
September 15, 2019
April 22, 2023
September 15, 2019
April 22, 2023
September 15, 2019
April 22, 2023