Our blog

Resources and insights

The latest industry news, interviews, technologies, and resources.

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
We care about your data in our Privacy Policy.
Prophecy for Databricks
3
min read

Data engineering for the data lakehouse: Four guiding principles

A rising number of enterprises are adopting the lakehouse to unite their analytics projects and foster innovation on a shared, cloud-based platform.
Kevin Petrie, VP of Research at the Eckerson Group
June 1, 2023
August 4, 2025
June 1, 2023
August 4, 2025
June 1, 2023
August 4, 2025
ETL Modernization
5
min read

3 ways Prophecy modernizes ETL on Databricks

By modernizing the ETL process on Databricks, Prophecy can provide organizations with a more agile, scalable, and cost-effective solution for managing their data.
Maciej Szpakowski
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
Data Integration Platform
6
min read

How to implement ETL on Apache Spark

Learn key steps to follow, how to define the ETL pipeline using Spark APIs and dataframes, and best practices for testing and optimizing pipelines for maximum efficiency
Mei Long
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
ETL Modernization
7
min read

From novice to expert: A blueprint for data analysts to build robust data transformation pipelines without relying on engineering

Learn essential skills and techniques to empower non-technical data practitioners to build ETL pipelines without relying on data engineering.
Mei Long
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
May 29, 2023
July 15, 2025
Events + Announcements
3
min read

Event spotlight: Prophecy at Data + AI Summit

Bringing the power of low-code data engineering to the masses
Emily Lewis
May 26, 2023
July 15, 2025
May 26, 2023
July 15, 2025
May 26, 2023
July 15, 2025
Data Integration Platform
5
min read

Hitting data driven home runs: How the Texas Rangers win by harnessing Prophecy in their data mesh architecture

Learn how the Texas Rangers use Prophecy and Databricks Lakehouse as the foundation of their data mesh architecture to gain a competitive advantage with low-code data engineering.
Alexander Booth
May 24, 2023
August 4, 2025
May 24, 2023
August 4, 2025
May 24, 2023
August 4, 2025
Data Integration Platform
11
min read

Empower all business data users with interactive SQL development

Leverage Prophecy’s low-code approach and integration with dbt Core to build SQL data pipelines without compromising on engineering best practices
Anya Bida
May 10, 2023
August 4, 2025
May 10, 2023
August 4, 2025
May 10, 2023
August 4, 2025
Prophecy for Databricks
15
min read

Prophecy with Delta — making data lakehouses easier

Learn more about the evolution of data architectures, from traditional ETL through two-tier and modern data lakehouses. Explore how Prophecy can enable you to leverage the lakehouse by making common use cases like advanced merges and slowly changing dimensions significantly easier.
Maciej Szpakowski
April 27, 2023
July 15, 2025
April 27, 2023
July 15, 2025
April 27, 2023
July 15, 2025
Events + Announcements
5
min read

Announcing Prophecy 3.0: low-code SQL transformations

Discover how Prophecy 3.0 arms all data users with low-code SQL, so they can quickly and easily build scalable data pipelines, resulting in highly impactful data products.
Maciej Szpakowski
April 24, 2023
July 15, 2025
April 24, 2023
July 15, 2025
April 24, 2023
July 15, 2025
Prophecy for Databricks
5
min read

Prophecy: Low-code data engineering on Databricks lakehouses via Partner Connect

We're excited to be a part of Databricks Partner connect! Read this blog to learn how Databricks Partner Connect makes getting started with Prophecy simple via single-click signup and sign-in.
Shagun Bains
April 21, 2023
August 4, 2025
April 21, 2023
August 4, 2025
April 21, 2023
August 4, 2025
6
min read

ProphecyHub: Metadata re-invented with Git and GraphQL for data engineering

Prophecy has innovated in metadata management by merging code on git with metadata, making it code-first.
Raj Bains
April 21, 2023
August 4, 2025
April 21, 2023
August 4, 2025
April 21, 2023
August 4, 2025
6
min read

Spark: Column-level lineage

Prophecy computes column-level lineage from code for Spark code on Git.
Raj Bains
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
3
min read

Spark deserves a better IDE

Prophecy IDE enables visual and code developers to produce high-quality Spark code.
Raj Bains
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
5
min read

Introducing Prophecy.io — cloud-native data engineering

Introducing Prophecy.io — a high-performance, zero-compromise, cloud-native data engineering product powered by Spark for enterprise data engineering teams.
Raj Bains
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
April 21, 2023
July 15, 2025
Data Integration Platform
15
min read

PySpark hands-on tutorial using a visual IDE

Let's get started with PySpark using a visual interface to generate open-source Spark code.
Anya Bida
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Data Integration Platform
4
min read

Use Spark interims to troubleshoot and polish low-code Spark pipelines: Part 1

Let’s take advantage of Spark’s interim metadata to understand our Spark job behavior with low-code tooling.
Anya Bida
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
ETL Modernization
4
min read

Cloud data engineering on Spark — factors for transitioning to the stack

For enterprises, navigating the cloud transition can be tricky. This describes the two paradigms for ETL in the cloud, and the factors to consider when choosing one.
Raj Bains
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
ETL Modernization
6
min read

Ab Initio to Spark: Modernize your ETL and lower costs

Prophecy also provides a highly automated replacement for legacy ETL products to accelerate the journey to open source and cloud computing.
Raj Bains
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
Prophecy for Databricks
10
min read

Deep dive into Prophecy for Databricks

Just a few days ago, we announced a Prophecy for Databricks. In this blog post — Part 2 of that announcement — we dig into how Prophecy makes data engineering simple for any data practitioner on Databricks.‍
Maciej Szpakowski
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Data Integration Platform
10
min read

Financial reporting simplified: Working capital forecasting

In this blog, we learn about the concept of financial reporting, its importance, the various challenges faced while generating an efficient and reliable financial report and how Prophecy makes the complete process easy sailing.
Anshuman Agrawal
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
Events + Announcements
5
min read

Prophecy raises Series A to industrialize data refining

Prophecy is delighted to announce we raised a $25M Series A led by Insight Partners, with participation from SignalFire, Dig Ventures and Berkeley SkyDeck.
Raj Bains
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Data Engineering
3
min read

Data Engineering Battle: Python vs SQL++ vs Visual=Code

In a Data Engineering Battle, Prophecy presented a complete, low-code data engineering product with a session for low-code Spark, low-code Airflow and column-level lineage that hundreds attended. Here are the poll results.
Raj Bains
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Events + Announcements
5
min read

DATA+AI Summit 2021: Poll results and takeaways for low-code products

Prophecy presented a complete, low-code data engineering product at the Data + AI Summit 2021. Hundreds attended our sessions on low-code Spark, low-code Airflow and column-level lineage. Here are the results from the polls we conducted there.
Prophecy Team
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
Data Integration Platform
6
min read

Be more productive on Spark with low-code

Learn about the four main pillars of productive, powerful, low-code data engineering on Spark. See all the pieces in action!
Prophecy Team
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Data Engineering
3
min read

Prophecy SaaS: Low-code data engineering for your Spark

Announcing the launch of Prophecy SaaS!
Prophecy Team
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
Data Engineering
4
min read

ELT is not the disruption — data engineering is!

The disruption is agile software practices in data engineering made usable for the many. Allow me to explain.
Raj Bains
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
6
min read

Apache Spark™ vs Snowflake: The cloud data engineering (ETL) debate!

We discuss the two cloud data engineering architectures and advise which one to pick as you move your ETL to the cloud.
Raj Bains
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
April 9, 2023
August 4, 2025
Events + Announcements
2
min read

Announcing Prophecy's public beta!

We're excited to share what we have built with you. Sign up and let us know what you think!
Prophecy Team
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
April 9, 2023
July 15, 2025
5
min read

Scala packrat parser combinators for DSLs

Parser combinators allow you to develop sophisticated parsers quickly and see how to use them.
Raj Bains
October 11, 2019
May 16, 2024
October 11, 2019
May 16, 2024
October 11, 2019
May 16, 2024
4
min read

Startup superpower: customer discovery machine

Customer discovery is essential to figuring out product market fit. Learn how to do it well and when to use it.
Raj Bains
September 5, 2019
April 22, 2023
September 5, 2019
April 22, 2023
September 5, 2019
April 22, 2023
ETL Modernization
4
min read

Weigh Your Options As You Move Off Alteryx

Explore alternatives to Alteryx for modern data prep. Explore options and how Prophecy delivers scalability and openness to modernize data preparation.
Raj Bains
November 18, 2024
November 18, 2024
November 18, 2024
Data Integration Platform
8
min read

How to build generative AI applications on enterprise data

Discover how to build generative AI applications leveraging enterprise data. Learn to identify use cases, choose effective models, and navigate challenges with our comprehensive guide.
Mei Long
February 14, 2025
February 14, 2025
February 14, 2025