Archit Raj

I'm a Data E

I turn coffee into scalable data pipelines. (And sometimes, insights too.)

About

Data Analytics Engineer & Data Science Enthusiast

Leveraging data-driven strategies to optimize business decisions and architect scalable, high-impact solutions.

  • Email: raj.ar@northeastern.edu
  • Phone:+91-9065704710
  • City: Bengaluru, KA - India
  • Roles: Data Engineer, Data Engineer Researcher, Software Engineer - Data/Backend, Freelance Full Stack Data Engineer
  • Companies: Abecedarian LLC, IBM, Vittude, 7bi
  • Latest Degree: Masters of Science
  • Alma Mater: Northeastern University (MS)

    Iowa State University (BS)

  • Specialization: Data Engineering & Analytics, Management Information Systems
  • Interest: Big Data Pipeline Creation and Management, Data Analytics and Visualization, Database Management, Cloud Platforms

Every dataset tells a story—I make sure it's the right one. At the intersection of technology and analytics, I engineer scalable data pipelines, automate workflows, and extract meaningful insights. That’s my world. At the intersection of technology and analytics, I’ve worked with leading organizations like IBM, Vittude, and 7bi, building scalable data solutions that turn complexity into clarity.

My journey through Northeastern University and Iowa State University wasn’t just about degrees— it was a battleground for mastering data. From engineering robust pipelines to crafting compelling analytics, I’ve honed the skills that make data work for businesses, not the other way around.

Beyond the code and queries, big data is my playground, visualization is my canvas, and cloud solutions are my launchpad. Whether it's architecting real-time data pipelines or uncovering trends that drive decisions, I don’t just analyze data— I bring it to life. 🚀

Facts

Immersed in the vast expanse of the data universe, I've honed a unique expertise.

Through unwavering dedication to projects, client assistance, and ceaseless learning, I've achieved the following milestones:

0

ETL/ELT Tools Used

0

RDBMS and NoSQL Experienced

0

Cloud Platforms

0

Projects

Skills

With a foundation in data engineering, I bring a blend of technical proficiency and analytical acumen to transform raw data into actionable insights.

Python (Pandas, NumPy, PySpark, Requests, SQLAlchemy) 95%
SQL & NoSQL95%
Data Warehousing, Data Processing & other ETL/ELT processes 90%
End-to-end Data Pipeline Production & Management Profess 85%
Cloud Tools (AWS, GCP, Azure, IBM Cloud)90%
Big Data Stack: Airflow, Hadoop, Kafka, Spark & Others 88%
Statistical Analysis 80%
Machine Learning 75%

Resume

SUMMARY

Data Engineer with 4+ years of experience building batch and real-time pipelines—boosting efficiency by 30% and system uptime by 40%. Proficient in Python, SQL, Kafka, PySpark, dbt, AWS (S3, Glue, Lambda, Redshift) for cloud-native architectures and microservices- based frameworks. Skilled at ETL, workflow automation (Airflow, Prefect), and streaming analytics—driving initiatives to reduce costs, enhance customer experience, and deliver ROI. Committed to delivering scalable, data-driven solutions fuelling innovation

EDUCATION

Master of Science in Data Analytics Engineering

NORTHEASTERN UNIVERSITY, Boston, MA, USA
Sept 2021 - Jul 2023
  • Data Engineering: Data pipeline development, dataset management, and SQL optimization.
  • Big Data & Cloud: Cloud infrastructure solutions, PySpark, Map Reduce, and streaming data.
  • Visualization: Insights with Tableau, Looker, and PowerBI.
  • Machine Learning: Model creation, fit testing, training, and regression analysis.
  • Project Management: Projects end-to-end with effective team collaboration.

Bachelor of Science in Management Information Systems

IOWA STATE UNIVERSITY, Ames, IA, USA
Aug 2016 - May 2020
  • Business Systems Analysis: Mastered techniques to analyze, design, and implement information systems in a business context.
  • Database Management: RDBMS concepts, SQL querying, and database design.
  • IT Infrastructure: Gained insights into networking, cybersecurity, and IT management practices.
  • Application Development: Acquired skills in programming and software development for business solutions.
  • Project Management: Learned methodologies like Agile and Waterfall for effective IT project execution.

PROFESSIONAL EXPERIENCE

Senior Data Engineer

Abecedarian LLC, Boston, MA - USA
Sep 2023 - Present
  • Developed real-time customer analytics pipelines in Databricks using PySpark, SQL, and DBT, reducing data processing time by 40% and improving AI-driven engagement by 15%.
  • Tech Stack: Databricks, PySpark, SQL, DBT, Airflow, AWS (S3, Glue, Lambda), Tableau

Senior Data Engineer Intern/Co-op - Data & AI

IBM, San Jose, CA
Jun 2022 - Dec 2022
  • Led migration of core IBM Db2 components from PL/SQL to REST API, enhancing system interoperability, increasing data processing efficiency by 30% and access speed by 50%.
  • Tech Stack: IBM Db2, SQL, REST APIs, CI/CD, Geospatial Data, AWS (Redshift, Lambda), Power BI

Data Engineer

Vittude, Sao Paulo, Brazil
Jan 2020 - Aug 2021
  • Built a highly available live data ingestion pipeline using Kafka (MSK), AWS S3, and Lambda, optimizing ETL efficiency to 99.4% and refining real-time feature tracking.
  • Tech Stack: PostgreSQL, Python (NumPy, Pandas, PySpark), Django, DBT, AWS (S3, Redshift, Lambda, MSK Kafka), Tableau

Data Engineer Intern

7bi, Sao Paulo, Brazil
May 2018 - Jan 2019
  • Automated ETL pipelines with AWS Glue and Lambda, improving data ingestion by 20% and reducing KPI reporting lag by 9 hours per week.
  • Tech Stack: AWS (Glue, Glue Crawler, Lambda, Redshift, S3), Power BI

Portfolio Projects

TrendWatch: YouTube Trending Video Analytics

YouTube Trending Video Analytics using AWS Infrastructure

PDF Malware Detection using Machine Learning

Supervised ML with SVM Model

Quality Assessment for Wine Production

ML Regression Model for Wine Quality

Renter Management Database & Visualizations

Using MySQL, MongoDB, and Python

Obesity Estimation using Health Data

Using Python, NumPy, Pandas, and Tableau

CO2 Emissions & Fuel Type Analysis

Using Python, Pandas, Plotly, Folium

Contact

Location:

Bengaluru, KA - India - 560048

Call:

+91(90657)-04710

Loading
Your message has been sent. Thank you!