Aspiring Data Engineer · Data Science Undergraduate

Janushiya Rajakumar

I’m an aspiring Data Engineer with a strong foundation in data science, specialising in building ETL pipelines, cloud-based data platforms, and analytics-ready datasets with SQL, Python, Azure, and modern BI tools.

📍 Colombo, Sri Lanka 🎓 BSc IT (Data Science) · SLIIT 🔍 Open to Data Engineering & Analytics roles

About

I am a fourth-year Information Technology undergraduate specializing in Data Science at the Sri Lanka Institute of Information Technology (SLIIT), with a CGPA of 3.46. My academic and industry experience has given me a solid foundation in data engineering, analytics, and machine learning.

I focus on building scalable data solutions—from ETL pipelines and data warehousing to interactive dashboards that support decision-making. I enjoy working with complex datasets, transforming raw information into clean, structured, and analytics-ready formats.

As a curious and fast learner, I am always looking for opportunities to apply my skills in real-world projects, collaborate with cross-functional teams, and contribute to impactful data-driven products.

Portrait of Janushiya Rajakumar

Skills

Programming & Data

  • Python
  • Java
  • C / C++
  • SQL
  • Pandas
  • NumPy

Data Platforms & BI

  • Power BI
  • Tableau
  • Excel
  • ETL Workflows
  • Data Warehousing
  • SSIS
  • SSAS
  • Spark

Databases & Cloud

  • T-SQL
  • SQL
  • SQL Server
  • Oracle
  • Azure (ADF, Databricks, Synapse)
  • Azure Fabric
  • Snowflake

Machine Learning & NLP

  • Scikit-learn
  • TensorFlow
  • CNN / RNN
  • NLP (tokenization, stemming)
  • Recommendation Systems

Web & Apps

  • React
  • Flask
  • Streamlit
  • HTML
  • CSS
  • JavaScript

Professional Skills

  • Critical Thinking
  • Problem-Solving
  • Communication
  • Teamwork
  • Time Management
  • Adaptability
  • Detail-Oriented
  • Analytical Thinking

Experience

Mar 2025 – Present Creative Software

Data Engineer Intern

  • Contributed to the development of an on-premise data warehousing solution to consolidate data from multiple sources for enterprise-wide reporting and analytics.
  • Supported scalable data integration and ensured delivery of high quality, reliable datasets for business stakeholders through interactive dashboards.
  • Designed and maintained ETL pipelines using on-premise data tools.
  • Extracted, transformed, and integrated data from multiple database systems.
  • Developed report logic and prepared Power BI dashboards to support data-driven insights.
  • Prepared accurate, reliable datasets to support reporting and dashboard development.

Jun 2024 – Dec 2024 HSBC Bank

Project Coordinator

  • Analyzed customer profiles to assess the need for customer exit decisions.
  • Prepared detailed reports and maintained project documentation to support operational decision-making and track project progress.
  • Ensured compliance with internal and external standards throughout project execution.
  • Managed customer exit processes with a focus on accuracy and attention to detail.

May 2022 – May 2024 HSBC Bank

Customer Solution Representative

  • Provided customer support, resolved queries, and delivered tailored solutions that contributed to customer satisfaction and business growth.
  • Built strong relationships with customers and internal teams to ensure smooth service delivery and issue resolution.
  • Managed customer feedback and used insights to improve service offerings and operational processes.
  • Actively identified and escalated issues to the appropriate teams, ensuring quick resolution and minimal disruption.

Education

BSc (Hons) in Information Technology Specialization in Data Science

Sri Lanka Institute of Information Technology (SLIIT)
Malabe, Sri Lanka
Oct 2022 – Sept 2026

CGPA: 3.46

GCE Advanced Level Examination

Ramanathan Hindu Ladies’ College
Bambalapitiya, Colombo, Sri Lanka
Feb 2022 (2021)

Combined Maths: A · Information Technology: B · Physics: C

Z-Score: 1.3758

GCE Ordinary Level Examination

Ramanathan Hindu Ladies’ College
Bambalapitiya, Colombo, Sri Lanka
Dec 2018

Overall Results: 6A · 2B · C

Key Subjects: Mathematics: A · English: A · ICT: B

Projects

A selection of academic and personal projects that showcase my experience in data engineering, analytics, machine learning, and NLP, and how I approach real-world problem solving with data.

Azure Data Engineer – End-to-End Data Pipeline

Designed and deployed an end-to-end data platform using Azure Data Factory for orchestration, Databricks (PySpark) for transformations, and Synapse Analytics for querying. Implemented Medallion Architecture (Bronze–Silver–Gold) to improve data quality and deliver analytics-ready datasets for Power BI.

Azure Data Factory Azure Databricks Azure Data Lake Storage Azure Synapse Analytics Power BI SQL

Data Warehousing & Business Intelligence: Meal Demand Analysis

Built an end-to-end ETL pipeline using SSIS to load data into a data warehouse and designed OLAP cubes with SSAS. Developed interactive dashboards in Power BI and Excel to analyze meal demand trends, supporting various business intelligence use cases.

SQL SSIS SSAS Power BI Excel

Earthquake Fabric Artifacts

Built an end-to-end pipeline using Microsoft Fabric to process and analyze global earthquake data from the USGS Earthquake API, applying Medallion Architecture (Bronze → Silver → Gold) for analytics-ready datasets visualized in Power BI.

Microsoft Fabric USGS Earthquake API Medallion Architecture Data Lakehouse Spark Pools Data Factory Power BI

Risk Management System: Data Analytics Tool

Building an on-premise data warehouse to support risk analytics and real-time monitoring of key risk indicators. Developing data pipelines and interactive dashboards to provide stakeholders with timely, actionable insights.

SQL Python Power BI SSIS SSAS Power BI Report Server

Curriculum Optimization Using NLP

Analyzed past papers and lecture content using Natural Language Processing (NLP) to identify key themes and patterns in the curriculum. Automated study material recommendations and assessment guidance based on the extracted insights.

Python NLP Scikit-learn Vectorization

Laptop Price Prediction Model

Built a price prediction model using classification and regression techniques to estimate laptop prices based on product features. Cleaned, processed, and evaluated data to ensure robust and accurate predictions, presented through an interactive interface.

Streamlit Python Pandas NumPy Scikit-learn Matplotlib Seaborn

Song Recommendation System – Content-Based

Developed a content-based music recommendation system using feature extraction and similarity metrics. Implemented the recommendation algorithm in Python to deliver personalized song suggestions based on user preferences.

Flask HTML NLP Cosine Similarity Pandas Scikit-learn Vectorization

Let’s build something with data

I am open to junior roles and freelance opportunities in data engineering, analytics, and related fields. If my skills align with your team’s needs or project, I would be excited to connect and discuss how I can contribute.