About

Data Analytics Engineer & Data Science Enthusiast
Leveraging data-driven strategies to optimize business decisions and architect scalable, high-impact solutions.
- Email: raj.ar@northeastern.edu
- Phone:+91-9065704710
- City: Bengaluru, KA - India
- Roles: Data Engineer, Data Engineer Researcher, Software Engineer - Data/Backend, Freelance Full Stack Data Engineer
- Companies: Abecedarian LLC, IBM, Vittude, 7bi
- Latest Degree: Masters of Science
- Alma Mater: Northeastern University (MS)
Iowa State University (BS)
- Specialization: Data Engineering & Analytics, Management Information Systems
- Interest: Big Data Pipeline Creation and Management, Data Analytics and Visualization, Database Management, Cloud Platforms
Every dataset tells a story—I make sure it's the right one. At the intersection of technology and analytics, I engineer scalable data pipelines, automate workflows, and extract meaningful insights. That’s my world. At the intersection of technology and analytics, I’ve worked with leading organizations like IBM, Vittude, and 7bi, building scalable data solutions that turn complexity into clarity.
My journey through Northeastern University and Iowa State University wasn’t just about degrees— it was a battleground for mastering data. From engineering robust pipelines to crafting compelling analytics, I’ve honed the skills that make data work for businesses, not the other way around.
Beyond the code and queries, big data is my playground, visualization is my canvas, and cloud solutions are my launchpad. Whether it's architecting real-time data pipelines or uncovering trends that drive decisions, I don’t just analyze data— I bring it to life. 🚀
Facts
Immersed in the vast expanse of the data universe, I've honed a unique expertise.
Through unwavering dedication to projects, client assistance, and ceaseless learning, I've achieved the following milestones:
ETL/ELT Tools Used
RDBMS and NoSQL Experienced
Cloud Platforms
Projects
Skills
With a foundation in data engineering, I bring a blend of technical proficiency and analytical acumen to transform raw data into actionable insights.
Resume
SUMMARY
Data Engineer with 4+ years of experience building batch and real-time pipelines—boosting efficiency by 30% and system uptime by 40%. Proficient in Python, SQL, Kafka, PySpark, dbt, AWS (S3, Glue, Lambda, Redshift) for cloud-native architectures and microservices- based frameworks. Skilled at ETL, workflow automation (Airflow, Prefect), and streaming analytics—driving initiatives to reduce costs, enhance customer experience, and deliver ROI. Committed to delivering scalable, data-driven solutions fuelling innovation
EDUCATION
Master of Science in Data Analytics Engineering
NORTHEASTERN UNIVERSITY, Boston, MA, USA
Sept 2021 - Jul 2023
- Data Engineering: Data pipeline development, dataset management, and SQL optimization.
- Big Data & Cloud: Cloud infrastructure solutions, PySpark, Map Reduce, and streaming data.
- Visualization: Insights with Tableau, Looker, and PowerBI.
- Machine Learning: Model creation, fit testing, training, and regression analysis.
- Project Management: Projects end-to-end with effective team collaboration.
Bachelor of Science in Management Information Systems
IOWA STATE UNIVERSITY, Ames, IA, USA
Aug 2016 - May 2020
- Business Systems Analysis: Mastered techniques to analyze, design, and implement information systems in a business context.
- Database Management: RDBMS concepts, SQL querying, and database design.
- IT Infrastructure: Gained insights into networking, cybersecurity, and IT management practices.
- Application Development: Acquired skills in programming and software development for business solutions.
- Project Management: Learned methodologies like Agile and Waterfall for effective IT project execution.
PROFESSIONAL EXPERIENCE
Senior Data Engineer
Abecedarian LLC, Boston, MA - USA
Sep 2023 - Present
- Developed real-time customer analytics pipelines in Databricks using PySpark, SQL, and DBT, reducing data processing time by 40% and improving AI-driven engagement by 15%.
- Tech Stack: Databricks, PySpark, SQL, DBT, Airflow, AWS (S3, Glue, Lambda), Tableau
Senior Data Engineer Intern/Co-op - Data & AI
IBM, San Jose, CA
Jun 2022 - Dec 2022
- Led migration of core IBM Db2 components from PL/SQL to REST API, enhancing system interoperability, increasing data processing efficiency by 30% and access speed by 50%.
- Tech Stack: IBM Db2, SQL, REST APIs, CI/CD, Geospatial Data, AWS (Redshift, Lambda), Power BI
Data Engineer
Vittude, Sao Paulo, Brazil
Jan 2020 - Aug 2021
- Built a highly available live data ingestion pipeline using Kafka (MSK), AWS S3, and Lambda, optimizing ETL efficiency to 99.4% and refining real-time feature tracking.
- Tech Stack: PostgreSQL, Python (NumPy, Pandas, PySpark), Django, DBT, AWS (S3, Redshift, Lambda, MSK Kafka), Tableau
Data Engineer Intern
7bi, Sao Paulo, Brazil
May 2018 - Jan 2019
- Automated ETL pipelines with AWS Glue and Lambda, improving data ingestion by 20% and reducing KPI reporting lag by 9 hours per week.
- Tech Stack: AWS (Glue, Glue Crawler, Lambda, Redshift, S3), Power BI
Portfolio Projects
Contact
Location:
Bengaluru, KA - India - 560048
Email:
raj.ar@northeastern.edu
Call:
+91(90657)-04710