Rajrup Ghosh

Ph.D. Candidate, USC@NSL

(2019-2026) Ph.D. Candidate in Computer Science, University of Southern California (USC).
(2017) M.Tech in Computational Science, Indian Institute of Science (IISc), Bangalore.
(2015) B.E. in Computer Science, Indian Institute of Engineering Science and Technology (IIEST), Shibpur.

Jump to Biography, Publications, Experiences.

Biography

I am a Ph.D. student in Networked System Lab (NSL) at University of Southern California. I am fortunate to be advised by Prof. Ramesh Govindan. I frequently collaborate with Prof. Harsha Madhyastha, and Prof. Antonio Ortega. My primary research interests are in areas of Volumetric Video, 3D Compression, AR/VR Streaming, Edge+Cloud Computing and Systems for ML.

Prior to joining USC, I completed my Masters in Computational Science at the Department of Computational and Data Sciences (CDS), Indian Institute of Science (IISc), Bangalore. I was advised by Prof. Yogesh Simmhan at DREAM:Lab.

Looking for a Postdoctoral/Industry position starting Summer 2026. Please reach out if I seem fit!

Work Experience

Graduate Research Assistant (Aug 2019 - May 2026 (Exp.))
Department of Computer Science, University of Southern California.

Research Intern (May 2022 - Aug 2022)
Microsoft Research, Redmond, Washington.
Mentors: Krishna Chintalapudi, Nikunj Raghuvanshi, Ranveer Chandra

Research Intern (June 2020 - Aug 2020)
Microsoft Research, Redmond, Washington.
Mentor: Krishna Chintalapudi

Lead Engineer (Research) (July 2017 - July 2019)
Samsung R&D Institute India, Bangalore.

Teaching Experience

Teaching Assistant at USC
Course: CSCI 353 - Internetworking, Fall 2025, Instructor: Prof. Ramesh Govindan

Teaching Assistant at USC
Course: CS 551/651: Advanced Computer Networks, Spring 2022, Instructor: Prof. Ramesh Govindan

Guest Lecture at Princeton University, Topic: Volumetric Video Streaming [PPT]
Course: COS 598a: Machine Learning-Driven Video Systems, Spring 2022, Instructor: Prof. Ravi Netravali

Awards

CoNEXT 2025 Travel Grant Award.

Graduate Student Annenberg Fellowship, 2019 - 2023.

Received IISc Motorola Medal for Best CDS M.Tech. Thesis, 2015 - 2017.

Publications

In Progress

GS-NFS++: Fast inter-frame compression of Dynamic Gaussian Splats

Rajrup Ghosh, Haoran Wang, Haoran Hong, Eduardo Pavez, Weiwu Pang, Harsha V. Madhyastha, Antonio Ortega, and Ramesh Govindan
ArXiv

GS-NFS: Bandwidth-adaptive Streaming of Dynamic Gaussian Splats and Point Clouds

Rajrup Ghosh, Haodong Wang, Haoran Hong, Eduardo Pavez, Amartya Chaudhuri, Weiwu Pang, Harsha V. Madhyastha, Antonio Ortega, and Ramesh Govindan

In ArXiv Preprint 2026

Abs Link

Dynamic 3D Gaussian Splatting (3DGS) holds great promise as a 3D video streaming technology since it can represent complex 3D scenes with high fidelity. In this approach, every frame in a 3D video represents the environment as a collection of Gaussians with position and other attributes such as scale, rotation, opacity, and color. Frames capture fine details, permit views from any arbitrary perspective, but are an order of magnitude, or more, larger than 2D video frames. A line of recent work has explored how to compress dynamic 3DGS frames, but these approaches are often slow, in part because their compression techniques are not amenable to efficient acceleration. GS-NFS accelerates dynamic 3DGS compression and decompression on a GPU, to the point where it can encode and decode at full frame rate. It achieves this by developing novel GPU-based parallelizations of existing algorithms for encoding both positions and attributes of Gaussians. As a result, it is 1-2 orders of magnitude faster than the state-of-the-art in encoding and decoding a frame, while offering competitive compression performance and rendering quality.
CoNEXT

LiVo: Toward Bandwidth-adaptive Fully-Immersive Volumetric Video Conferencing

Rajrup Ghosh, Christina Suyong Shin, Lei Zhang, Muyang Ye, Tao Jin, Harsha V. Madhyastha, Ravi Netravali, Antonio Ortega, Sanjay Rao, Anthony Rowe, and Ramesh Govindan

Proc. ACM Netw. (CoNEXT/PACMNET) Nov 2025

Abs Link Code

Volumetric video allows users 6 degrees of freedom (6-DoF) in viewing continuously evolving scenes in 3D. Given broadband speeds today, volumetric video conferencing will soon be feasible. Even so, these scenes will need to be compressed, and compression will need to adapt to variations in bandwidth availability. Existing 3D compression techniques cannot adapt to bandwidth availability, are slow, and utilize bandwidth inefficiently, so they don’t scale well to large scene descriptions. LiVo achieves low-latency and large-scene two-way conferencing by maximally leveraging existing 2D video infrastructure, including compression standards, rate-adaptive codecs, and real-time transport protocols. To achieve high quality, LiVo must carefully compose scenes from multiple cameras into multiple streams, encode scene geometry in a novel way, adapt to and apportion available bandwidth dynamically between streams to ensure high reconstruction quality, and cull content outside the receiver’s field of view to reduce information sent into the network. These novel contributions enable LiVo to outperform the state-of-the-art by over 20% in objective quality. In a user study, LiVo achieves a mean opinion score of 4.1, while other approaches achieve significantly lower values.
ACM MM

SplatPose: On-Device Outdoor AR Pose Estimation Using Gaussian Splatting

Weiwu Pang, Rajrup Ghosh, Jiawei Yang, Ziyu Wei, Branden Leong, Yue Wang, and Ramesh Govindan

In Proceedings of the 33rd ACM International Conference on Multimedia (ACM MM) Nov 2025

Abs Link

Outdoor AR applications on mobile devices need accurate estimates for the pose of the device. In this paper, we develop SplatPose, a novel pose estimation technique that uses a data-driven 3D modeling technique called Gaussian Splatting. SplatPose uses a trained Gaussian Splatting model to render an image at an estimated device location, then matches features with the camera image to estimate pose. % Because this matching can be fast, SplatPose can, in theory, estimate pose entirely on a mobile device, while existing approaches cannot. To this end, SplatPose trains Gaussian Splatting models to be robust to appearance changes, thereby improving accuracy. It also incorporates a novel fast renderer to improve rendering speed. Using an AR pose estimation benchmark dataset, we show that SplatPose outperforms the state-of-the-art in terms of accuracy, and is up to an order of magnitude faster on a mobile device.
Ubicomp/IMWUT

AeroTraj: Trajectory Planning for Fast, and Accurate 3D Reconstruction Using a Drone-based LiDAR

Fawad Ahmad, Christina Suyong Shin, Rajrup Ghosh, John D’Ambrosio, Eugene Chai, Karthikeyan Sundaresan, and Ramesh Govindan

Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. (Ubicomp/IMWUT) Sep 2023

Abs Link

This paper presents AeroTraj, a system that enables fast, accurate, and automated reconstruction of 3D models of large buildings using a drone-mounted LiDAR. LiDAR point clouds can be used directly to assemble 3D models if their positions are accurately determined. AeroTraj uses SLAM for this, but must ensure complete and accurate reconstruction while minimizing drone battery usage. Doing this requires balancing competing constraints: drone speed, height, and orientation. AeroTraj exploits building geometry in designing an optimal trajectory that incorporates these constraints. Even with an optimal trajectory, SLAM’s position error can drift over time, so AeroTraj tracks drift in-flight by offloading computations to the cloud and invokes a re-calibration procedure to minimize error. AeroTraj can reconstruct large structures with centimeter-level accuracy and with an average end-to-end latency below 250 ms, significantly outperforming the state of the art.
SoCC

Scrooge: A Cost-Effective Deep Learning Inference System

Yitao Hu, Rajrup Ghosh, and Ramesh Govindan

In Proceedings of the ACM Symposium on Cloud Computing (SoCC) Sep 2021

Abs Link

Advances in deep learning (DL) have prompted the development of cloud-hosted DL-based media applications that process video and audio streams in real-time. Such applications must satisfy throughput and latency objectives and adapt to novel types of dynamics, while incurring minimal cost. Scrooge, a system that provides media applications as a service, achieves these objectives by packing computations efficiently into GPU-equipped cloud VMs, using an optimization formulation to find the lowest cost VM allocations that meet the performance objectives, and rapidly reacting to variations in input complexity (e.g., changes in participants in a video). Experiments show that Scrooge can save serving cost by 16-32% (which translate to tens of thousands of dollars per year) relative to the state-of-the-art while achieving latency objectives for over 98% under dynamic workloads.
IoTDI

Rim: Offloading Inference to the Edge

Yitao Hu, Weiwu Pang, Xiaochen Liu, Rajrup Ghosh, Bongjun Ko, Wei-Han Lee, and Ramesh Govindan

In Proceedings of the International Conference on Internet-of-Things Design and Implementation (IoTDI) Sep 2021

Abs Link

Video cameras are among the most ubiquitous sensors in the Internet-of-Things. Video and audio applications, such as cross-camera activity detection, avatar extraction or language translation will, in the future, offload processing to an edge cluster of GPUs. Rim is a management system for such clusters that satisfies throughput and latency requirements of these applications, while enabling high cluster utilization. It uses coarse-grained knowledge of application structure to profile throughput of applications on resources, then uses these profiles to place applications on cluster nodes to achieve these goals. It dynamically adapts placement to load and failures. Experiments show that on maximal workloads on a testbed, Rim can satisfy requirements of all applications, but competing approaches designed for low-latency GPU execution cannot.
TCPS

Distributed Scheduling of Event Analytics across Edge and Cloud

Rajrup Ghosh, and Yogesh Simmhan

ACM Trans. Cyber-Phys. Syst. Jul 2018

Abs Link

Internet of Things (IoT) domains generate large volumes of high-velocity event streams from sensors, which need to be analyzed with low latency to drive decisions. Complex Event Processing (CEP) is a Big Data technique to enable such analytics and is traditionally performed on Cloud Virtual Machines (VM). Leveraging captive IoT edge resources in combination with Cloud VMs can offer better performance, flexibility, and monetary costs for CEP. Here, we formulate an optimization problem for energy-aware placement of CEP queries, composed as an analytics dataflow, across a collection of edge and Cloud resources, with the goal of minimizing the end-to-end latency for the dataflow. We propose a Genetic Algorithm (GA) meta-heuristic to solve this problem and compare it against a brute-force optimal algorithm (BF). We perform detailed real-world benchmarks on the compute, network, and energy capacity of edge and Cloud resources. These results are used to define a realistic and comprehensive simulation study that validates the BF and GA solutions for 45 diverse CEP dataflows, LAN and WAN setup, and different edge resource availability. We compare the GA and BF solutions against random and Cloud-only baselines for different configurations for a total of 1,764 simulation runs. Our study shows that GA is within 97% of the optimal BF solution that takes hours, maps dataflows with 4–50 queries in 1–26s, and only fails to offer a feasible solution ≤20% of the time.
CCGRID

Adaptive Energy-Aware Scheduling of Dynamic Event Analytics Across Edge and Cloud Resources

Rajrup Ghosh, Siva Prakash Reddy Komma, and Yogesh Simmhan

In 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) May 2018

Abs Link

The growing deployment of sensors as part of Internet of Things (IoT) is generating thousands of event streams. Complex Event Processing (CEP) queries offer a useful paradigm for rapid decision-making over such data sources. While often centralized in the Cloud, the deployment of capable edge devices on the field motivates the need for cooperative event analytics that span Edge and Cloud computing. Here, we identify a novel problem of query placement on edge and Cloud resources for dynamically arriving and departing analytic dataflows. We define this as an optimization problem to minimize the total makespan for all event analytics, while meeting energy and compute constraints of the resources. We propose 4 adaptive heuristics and 3 rebalancing strategies for such dynamic dataflows, and validate them using detailed simulations for 100 - 1000 edge devices and VMs. The results show that our heuristics offer O(seconds) planning time, give a valid and high quality solution in all cases, and reduce the number of query migrations. Furthermore, rebalance strategies when applied in these heuristics have significantly reduced the makespan by around 20 - 25%.
ICAPR

Exploring the self similar properties for monitoring of air quality information

Rajrup Ghosh, Dipanjan Ghosh, Sreemoyee Roy, and Abhik Mukherjee

In 2015 Eighth International Conference on Advances in Pattern Recognition (ICAPR) Jan 2015

Abs Link

Air quality information has assumed much importance over the years due to the increase in air pollution. One major hindrance in monitoring of air pollutants is the dearth of spatial availability of aerosol concentration measurements due to the cost involved in deployment of sensors. In this respect, self similarity analysis of data can be very useful. This work is based on standard grid based pollutant dispersion models in a simulated environment over different scales of grid size. The fractal dimension is considered as a scale invariant metric which gives an idea about the variation in pollutant concentration across different scales. A method is detailed for measuring the fractal dimension properties. Results indicate that it is possible to apply the dispersion models across different scales and also the air quality monitored in one region can be compared with other regions.