Skip survey header

2025 Data Engineering Survey

Welcome!

Daniel Stori comic about SQL
This year, we are seeking data engineering experts and gurus to help drive the research for our upcoming DZone Trend Report. Covering everything from data-centric cultures, streaming and real-time data, and DataOps to AI strategies and more, your insights will inform the key takeaways for our global developer and IT audience.

This survey will guide you through a variety of questions, requiring about 12 minutes to complete.

Thank you being a part of our research!
1. Which DataOps practices have you implemented in your data pipelines or workflows? Select all that apply.
2. Where does your organization currently store its data and/or database solutions?
3. How do you integrate or orchestrate data flows between systems in your current stack? Select all that apply.
4. What factors most influence your choice of data processing architecture? Select all that apply.
5. What components of a data mesh architecture do you or your team support? Select all that apply.
6. Which open-source data engineering tools or frameworks have you used in the past 12 months? Select all that apply.
7. What strategies and/or architectural considerations do you regularly examine for ensuring data security in the software you currently contribute to? Select all that apply.
8. Which of the following security concerns from the OWASP Data Security Top 10 do you believe to be significant potential threats to software you currently contribute to? Select all that apply.
9. How often do you use SQL and Python together in the same workflow?
What are your primary use cases for combining Python and SQL? Select all that apply.
What is your biggest challenge when working with both Python and SQL in a single workflow?
10. Why is your organization migrating, or considering migrating, its databases to the cloud? Select all that apply.
11. In your opinion, what was, or would be, most challenging with migrating to a cloud DBMS? Select all that apply.
12. For database management and maintenance, what best describes the arrangement your organization has, or is considering establishing, with its cloud database provider(s)? Select all that apply.
This question requires a valid percent format.
14. When are you planning to use an ETL and/or reverse ETL solution?
15. How often do you extract, transform, and load data using each of the following approaches? Select all that apply.
16. Which components make up your current BI data stack? Select all that apply.
17. How do you validate and monitor data quality for BI systems?
18. What strategies or tools do you use to manage data duplication, fragmentation, or sprawl in your data architecture?
Which of the following challenges related to data sprawl have you encountered in your current or past projects? Select all that apply
19. How important are real-time analytics for your organization's operations?
20. Where is your organization in its adoption of real-time analytics?
21. Does your organization keep data in any of the following locations?
Space Cell YesNoI don't know
Data warehouse
Data lake
Data lakehouse
22. Which of the following languages does your team currently use to analyze and/or visualize data? Select all that apply.
23. Which of the following tools does your organization use for data observability? Select all that apply.
24. How do you currently supply data or context to support generative or agentic AI applications? Select all that apply.
25. How confident are you in your understanding of best practices for preparing clean, high-quality data for AI/ML systems?
26. Which of the following practices do you use to ensure data quality when preparing datasets for AI or ML models? Select all that apply.
27. Have you ever learned a new technology (language, library, platform) in order to implement data engineering strategies?
28. Which of the following technical challenges has your team experienced when implementing data engineering strategies? Select all that apply.
29. In your opinion, which capabilities would be ideal for democratizing data engineering within your organization? Select all that apply.

Background and Experience

Information from the questions below will enable us to provide a more granular analysis of survey responses based on factors such as technical role, programming language, and years' experience.

Note: Answers remain anonymous unless you choose to enter the raffle. Emails will be used for raffle purposes only.
 
30. What types of software are you currently developing? Select all that apply.
31. What programming language ecosystems does your company use? Select all that apply.
32. What is your primary programming language at work?
Response should be between 0-60 This question requires a valid number format.
34. What best describes your primary role in your company? *This question is required.
35. What is the size of your organization in terms of employees? *This question is required.
36. Do you wish to enter the survey raffle?
Note: You must be a DZone member to enter (you can join here).