Fiodar Kazhamiaka

Fiodar Kazhamiaka

Fiodar
Kazhamiaka

Postdoc, Computer Science, Stanford

<my first name>@stanford.edu

Gates Building 422

I work at the Future Data Systems lab at Stanford as a postdoc, advised by Matei Zaharia and Peter Bailis. My research interests are in Systems for Sustainability, Data Science, and AI. My current research spans a range of topics: scalable and efficient systems for machine learning, query systems for data collected by autonomous vehicles, and PV+battery system design and control. I'm also a co-host of the weekly Stanford MLSys Seminar series; catch us live Thursdays at 1:30 pm Pacific!

I completed my PhD in Computer Science at the University of Waterloo under the guidance of Srinivasan Keshav and Catherine Rosenberg at the ISS4E lab. I modelled and optimized energy storage systems, with a focus on renewable energy sources and behind-the-meter applications. Results include new algorithms for operating and sizing PV+storage systems, a framework for solar farm operators to allocate budget between PV panels and batteries to maximize revenue, and tractable models for Lithium-ion batteries for optimization and simulation. My work was recognized through the 2020 SCS Cheriton Dissertation award and featured on ACM TechNews.

I'm keen on interdisciplinary work, and have had the pleasure of working with academics from power systems, economics, optimization, and electrochemistry disciplines. In my free time, I train and compete internationally as a member of Canada's national beach volleyball team.

Latest News

Aug 10

Career move: joining the newly formed Azure Systems Research Group in September 2023!

July 1

Milestone! 9000 subscribers to the Stanford MLSys Seminars YouTube channel

Dec 10

Our paper on scaling query serving systems was accepted to NSDI '22! Congratulations Peter!

Aug 25

Our work on solving large resource allocation problems was accepted to SOSP '21! [link]

Aug 12

The Stanford MLSys seminars are now in conjunction with a credited seminar course (CS 528).

May 3

My PhD thesis has been selected for the 2020 UWaterloo SCS Cheriton Dissertation award!

April 20

Our work on sizing multi-roof PV+storage systems was accepted to ACM eEnergy '21! [latest draft]

April 1

Check out our preprint of an algorithm for solving hyper-scale resource allocation problems in seconds!

Recent Projects

Resource Allocation in Computer Systems

Many problems in computer systems can be formulated as optimization problems, from job scheduling in clusters, to traffic engineering in wide area networks, to load balancing in distributed databases. Computing exact soluions to these problems is canonically considered to be intractable for large systems (eg. datacenters), so it's common to deploy fast but non-optimal heuristics. In our recent work on POP (Partitioned Optimization Problems), we show how near-optimal resource allocations can be computed several orders of magnitude faster than the exact solution. This work motivates the use of opimization in real systems. For more, check out our paper at SOSP 2021, as well as our pre-print featuring a similar algorithm for resource allocation problems.

Autonomous Vehicle Query Systems

Modern vehicles with various levels of autonomy (AV) are equipped with high-resolution sensors and processors that measure the state of the world as they drive through it. This data can answer fine-grained queries on the state of the physical world. How many people are in line at my favourite coffee shop? How many cyclists have crossed a specific intersection this year? Where is the nearest open parking spot? To be realized, AV query systems must address challenges around data volume, bias, and privacy. This is an ongoing project; for more, check out our position paper at CIDR 2021.

Heterogeneity-Aware Cluster Scheduling

The end of Moore's Law has brought about an era of specialized accelerators, such as GPUs, TPUs, FPGAs, and other accelerators. How do we extend common notions of fairness and throughput in job scheduling policies to a compute cluster with heterogeneous hardware?
We approach this question with Gavel, a scheduler for DNN training jobs that systematically generalizes a wide range of existing scheduling policies, such a max-min fairness, finish-time fairness, and minimum makespan. Gavel expresses these policies as convex mathematical optimization problems, and extends them to consider the setting with heterogeneous hardware. With Gavel, we can sustain higher job load, and improve end objectives such as makespan and job completion time by over 40% compared to heterogeneity-agnostic policies. For more details, check out our paper at OSDI 2020.

Future-Proof Solar PV and Storage Sizing

Suppose you want to purchase a system with solar panels and battery to power your home. How many panels do you buy? How big of a battery do you need? These questions are coupled, and depend on how often you're willing to go without power. Maybe you have some data to help understand what kind of system would have worked for you in the past, but what can this reliably tell you about the future?
To address this problem, we use of a recent advancement in empirical multi-variate probability concentration bounds to compute a robust least-cost system size for a given load target. This work was presented at ACM eEnergy 2018, and was voted as runner-up for the best paper (audience choice).
A refined version of our method was published in a journal in 2019. [link]
We recently extended this work to cover muti-roof settings! Find it at eEnergy 2021 [link]

Papers

Beobench: a Toolkit for Unified Access to Building Simulations for RL

Arduin Findeis, Fiodar Kazhamiaka, Scott Jeen, Srinivasan Keshav

ACM eEnergy 2022

Reinforcement learning (RL) is often considered a promising approach for controlling complex building operations. In this context, RL algorithms are typically evaluated using a testing framework that simulates building operations. To make general claims and avoid overfitting, an RL method should be evaluated on a large and diverse set of buildings. Unfortunately, due to the complexity of creating building simulations, none of the existing frameworks provide more than a handful of simulated buildings. Moreover, each framework has its own particularities, which makes it difficult to evaluate the same algorithm on multiple frameworks. To address this, we present Beobench: a Python toolkit that provides unified access to building simulations from multiple frameworks using a container-based approach. We demonstrate the power of our approach with an example showing how Beobench can launch RL experiments in any supported framework with a single command.

+ View Abstract

Allocation of Fungible Resourcews via a Fast, Scalable Price Discovery Method

Akshay Agrawal, Stephen Boyd, Deepak Narayanan, Fiodar Kazhamiaka, Matei Zaharia

Mathematical Programming Computation

We consider the problem of assigning or allocating resources to a set of jobs. We consider the case when the resources are fungible, that is, the job can be done with any mix of the resources, but with different efficiencies. In our formulation we maximize a total utility subject to a given limit on the resource usage, which is a convex optimization problem and so is tractable. In this paper we develop a custom, parallelizable algorithm for solving the resource allocation problem that scales to large problems, with millions of jobs. Our algorithm is based on the dual problem, in which the dual variables associated with the resource usage limit can be interpreted as resource prices. Our method updates the resource prices in each iteration, ultimately discovering the optimal resource prices, from which an optimal allocation is obtained. We provide an open-source implementation of our method, which can solve problems with millions of jobs in a few seconds on CPU, and under a second on a GPU; our software can solve smaller problems in milliseconds. On large problems, our implementation is up to three orders of magnitude faster than a commercial solver for convex optimization.

+ View Abstract

Carbon Explorer: A Holistic Approach for Designing Carbon Aware Datacenters

Bilge Acun, Benjamin Lee, Fiodar Kazhamiaka, Kiwan Maeng, Manoj Chakkaravarthy, Udit Gupta, David Brooks, and Carole-Jean Wu

Technology companies have been leading the way to a renewable energy transformation, by investing in renewable energy sources to reduce the carbon footprint of their datacenters. In addition to helping build new solar and wind farms, companies make power purchase agreements or purchase carbon offsets, rather than relying on renewable energy every hour of the day, every day of the week (24/7). Relying on renewable energy 24/7 is challenging due to the intermittent nature of wind and solar energy. Inherent variations in solar and wind energy production causes excess or lack of supply at different times. To cope with the fluctuations of renewable energy generation, multiple solutions must be applied. These include: capacity sizing with a mix of solar and wind power, energy storage options, and carbon aware workload scheduling. However, depending on the region and datacenter workload characteristics, the carbon-optimal solution varies. Existing work in this space does not give a holistic view of the trade-offs of each solution and often ignore the embodied carbon cost of the solutions. In this work, we provide a framework, Carbon Explorer, to analyze the multi-dimensional solution space by taking into account operational and embodided footprint of the solutions to help make datacenters operate on renewable energy 24/7. The solutions we analyze include capacity sizing with a mix of solar and wind power, battery storage, and carbon aware workload scheduling, which entails shifting the workloads from times when there is lack of renewable supply to times with abundant supply. Carbon Explorer will be open-sourced soon.

+ View Abstract

Data-Parallel Actors: A Programming Model for Scalable Query Serving Systems

Peter Kraft, Fiodar Kazhamiaka, Peter Bailis, and Matei Zaharia

We present data-parallel actors (DPA), a programming model for building distributed query serving systems. Query serving systems are an important class of applications characterized by low-latency data-parallel queries and frequent bulk data updates; they include data analytics systems like Apache Druid, full-text search engines like ElasticSearch, and time series databases like InfluxDB. They are challenging to build because they run at scale and need complex distributed functionality like data replication, fault tolerance, and update consistency. DPA makes building these systems easier by allowing developers to construct them from purely single-node components while automatically providing these critical properties. In DPA, we view a query serving system as a collection of stateful actors, each encapsulating a partition of data. DPA provides parallel operators that enable consistent, atomic, and fault-tolerant parallel updates and queries over data stored in actors. We have used DPA to build a new query serving system, a simplified data warehouse based on the single-node database MonetDB, and enhance existing ones, such as Druid, Solr, and MongoDB, adding missing user-requested features such as load balancing and elasticity. We show that DPA can distribute a system in < 1K lines of code (> 10× less than typical implementations in current systems) while achieving state-of-the-art performance and adding rich functionality.

+ View Abstract

Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP

Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Akshay Agrawal, Srikanth Kandula, Stephen Boyd, and Matei Zaharia

Resource allocation problems in many computer systems can be formulated as mathematical optimization problems. However, finding exact solutions to these problems using off-the-shelf solvers is often intractable for large problem sizes with tight SLAs, leading system designers to rely on cheap, heuristic algorithms. We observe, however, that many allocation problems are granular: they consist of a large number of clients and resources, each client requests a small fraction of the total number of resources, and clients can interchangeably use different resources. For these problems, we propose an alternative approach that reuses the original optimization problem formulation and leads to better allocations than domain-specific heuristics. Our technique, Partitioned Optimization Problems (POP), randomly splits the problem into smaller problems (with a subset of the clients and resources in the system) and coalesces the resulting sub-allocations into a global allocation for all clients. We provide theoretical and empirical evidence as to why random partitioning works well. In our experiments, POP achieves allocations within 1.5% of the optimal with orders-of-magnitude improvements in runtime compared to existing systems for cluster scheduling, traffic engineering, and load balancing.

+ View Abstract

Sizing Solar Panels and Storage for Multiple Roofs

Brad Huang, Fiodar Kazhamiaka, and Srinivasan Keshav

ACM eEnergy (2021)

Choosing the number of solar panels and the amount of storage needed to meet a certain fraction of the load in a microgrid setting is a difficult problem that needs to balance the competing objectives of efficiency, robustness, and cost. Prior work in this area makes the unrealistic assumption that solar panels are to be installed on a single roof that is capable of supporting all the panels required. In reality, we may need to deploy solar panels on several roof segments, each of limited size, and each with its own tilt, orientation and installation cost. This paper presents an algorithm for sizing solar panels and storage in this context. We evaluate the robustness of our approach using traces derived from the Pecan Street Dataport dataset and demonstrate the value of our approach by using it to size a hypothetical installation on the British Antarctic Survey’s research base in Antarctica.

+ View Abstract

Don't Give Up on Large Optimization Problems; POP Them!

Deepak Narayanan, Fiodar Kazhamiaka, Firas Abuzaid, Peter Kraft, Matei Zaharia

Resource allocation problems in many computer systems can be formulated as mathematical optimization problems. However, finding exact solutions to these problems using off-the-shelf solvers in an online setting is often intractable for "hyper-scale" system sizes with tight SLAs, leading system designers to rely on cheap, heuristic algorithms. In this work, we explore an alternative approach that reuses the original optimization problem formulation. By splitting the original problem into smaller, more tractable problems for subsets of the system and then coalescing resulting sub-allocations into a global solution, we achieve empirically quasi-optimal (within 1.5%) performance for multiple domains with several orders-of-magnitude improvement in runtime. Deciding how to split a large problem into smaller sub-problems, and how to coalesce split allocations into a unified allocation, needs to be performed carefully in a domain-aware way. We show common principles for splitting problems effectively across a variety of tasks, including cluster scheduling, traffic engineering, and load balancing.

+ View Abstract

Allocation of Fungible Resources via a Fast, Scalable Price Discovery Method

Akshay Agrawal, Stephen Boyd, Deepak Narayanan, Fiodar Kazhamiaka, Matei Zaharia

We consider the problem of assigning or allocating resources to a set of jobs. We consider the case when the resources are fungible, that is, the job can be done with any mix of the resources, but with different efficiencies. In our formulation we maximize a total utility subject to a given limit on the resource usage, which is a convex optimization problem and so is tractable. In this paper we develop a custom, parallelizable algorithm for solving the resource allocation problem that scales to large problems, with millions of jobs. Our algorithm is based on the dual problem, in which the dual variables associated with the resource usage limit can be interpreted as resource prices. Our method updates the resource prices in each iteration, ultimately discovering the optimal resource prices, from which an optimal allocation is obtained. We provide an open-source implementation of our method, which can solve problems with millions of jobs in a few seconds on CPU, and under a second on a GPU; our software can solve smaller problems in milliseconds. On large problems, our implementation is up to three orders of magnitude faster than a commerical solver for convex optimization.

+ View Abstract

Challenges and Opportunities for Autonomous Vehicle Query Systems

Fiodar Kazhamiaka, Matei Zaharia, and Peter Bailis

Autonomous vehicles (AVs) collect and process rich visual and dimensional data as they operate. Across a fleet of AVs, the collective data forms an up-to-date partial snapshot of the physical world. The ability to query this data has many use-cases, from finding open parking spots and checking the line outside a coffee shop, to monitoring the usage of urban spaces and road conditions. Managing AV data has unique challenges with respect to data volume, bias, and privacy issues. In this paper, we take steps to describe the design and research opportunities for a new class of data management systems: AV query systems.

+ View Abstract

Heterogeneity-Aware Cluster Scheduling Policies for Deep Learning Workloads

Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, and Matei Zaharia

Specialized accelerators such as GPUs, TPUs, FPGAs, and custom ASICs have been increasingly deployed to train deep learning models. These accelerators exhibit heterogeneous performance behavior across model architectures. Existing schedulers for clusters of accelerators, which are used to arbitrate these expensive training resources across many users, have shown how to optimize for various multi-job, multi-user objectives, like fairness and makespan. Unfortunately, existing schedulers largely do not consider performance heterogeneity. In this paper, we propose Gavel, a heterogeneity-aware scheduler that systematically generalizes a wide range of existing scheduling policies. Gavel expresses these policies as optimization problems and then systematically transforms these problems into heterogeneity-aware versions using an abstraction we call effective throughput. Gavel then uses a round-based scheduling mechanism to ensure jobs receive their ideal allocation given the target scheduling policy. Gavel’s heterogeneity-aware policies allow a heterogeneous cluster to sustain higher input load, and improve end objectives such as makespan and average job completion time by 1.4× and 3.5× compared to heterogeneity-agnostic policies.

+ View Abstract

Analysis and Exploitation of Dynamic Pricing in the Public Cloud for ML Training

Deepak Narayanan, Keshav Santhanam, Fiodar Kazhamiaka, Amar Phanishayee, and Matei Zaharia

DISPA workshop (2020)

Cloud providers offer instances with similar compute capabilities (for example, instances with different generations of GPUs like K80s, P100s, V100s) across many regions, availability zones, and on-demand and spot markets, with prices governed independently by individual supplies and demands. In this paper, using machine learning model training as an example application, we explore the potential cost reductions possible by leveraging this cross-cloud instance market. We present quantitative results on how the prices of cloud instances change with time, and how total costs can be decreased by considering this dynamic pricing market. Our preliminary experiments show that a) the optimal instance choice for a model is dependent on both the objective (e.g., cost, time, or combination) and the model’s performance characteristics, b) the cost of moving training jobs between instances is cheap, c) jobs do not need to be preempted more frequently than once a day to leverage the benefits from spot instance price variations, and d) the cost of training a model can be decreased by as much as 3.5× compared to a static policy. We also look at contexts where users specify higher-level objectives over collections of jobs, show examples of policies for these contexts, and discuss additional challenges involved in making these cost reductions viable.

+ View Abstract

Comparison of Different Approaches for Solar PV and Storage Sizing

Fiodar Kazhamiaka, Yashar Ghiassi-Farrokhfal, Catherine Rosenberg, and Srinivasan Keshav

IEEE Transactions on Sustainable Computing (2019)

We study the problem of optimally and simultaneously sizing solar photovoltaic (PV) and storage capacity in order to partly or completely offset grid usage. While prior work offers some insights, researchers typically consider only a single sizing approach. In contrast, we use a firm theoretical foundation to compare and contrast sizing approaches based on robust simulation, robust optimization, and stochastic network calculus. We evaluate the robustness and computational complexity of these approaches in a realistic setting to provide practical, robust advice on system sizing.

+ View Abstract

Tractable Lithium-Ion Storage Models for Optimizing Energy Systems

Fiodar Kazhamiaka, Catherine Rosenberg, and Srinivasan Keshav

Energy Informatics (2019)

Model-based optimization of energy systems with batteries requires a battery model that is accurate, tractable, and easy to calibrate. Developing such a model is challenging because electrochemical batteries exhibit complex behaviours. In this paper, we propose and evaluate a family of battery models that have different trade-offs between accuracy and complexity. We derive our models from a recently developed battery model, which is accurate and easy to calibrate but is not tractable. We evaluate our models against the commonly-used benchmark tractable model using a set of experiments that characterize the cycling behaviour of two Lithium-ion battery chemistries, as well as dynamic charge/discharge experiments. We further compare the models for two typical energy system applications, solar farm firming and grid regulation, to show the impact of the choice of battery model on the results. Finally, we evaluate the increase in accuracy when battery models are calibrated with the proper operating range.

+ View Abstract

Synthetic Trace Generation for Robust PV-Storage System Sizing

Sun Sun, Fiodar Kazhamiaka, Srinivasan Keshav, and Catherine Rosenberg

ACM eEnergy (2019)

Due to the inherent randomness of both solar power generation and residential electrical load, jointly sizing solar panel and storage capacity to meet a given quality-of-service (QoS) constraint is challenging. The challenge is greater when there is limited representative historical data. We therefore propose generating synthetic solar and load traces, corresponding to different realizations of the underlying stochastic processes. Specifically, we compare the effectiveness of three generative models: autoregressive moving-average (ARMA) models, Gaussian mixture models (GMMs), and generative adversar- ial networks (GANs) – as well as two direct sampling methods – for synthetic trace generation. These traces are then used for robust joint sizing by a technique described in recent work. Extensive experiments based on real data show that our approach finds robust sizing with only one year’s worth of hourly trace data. Moreover, assuming that solar data are available, given a database of load traces, we demonstrate how to perform robust sizing with access to only twelve data points of load, one for each month of one year.

+ View Abstract

Adaptive Battery Control with Neural Networks

Fiodar Kazhamiaka, Srinivasan Keshav, and Catherine Rosenberg

Applied Machine Learning in Energy Systems (AMLIES), an ACM eEnergy workshop (2019)

The return on investment of a battery system is maximized if the battery control strategy is appropriately matched to the operating environment (e.g., pricing scheme, electrical load). For residential battery systems, the current practice is to statically determine the control policy prior to system installation; the battery subsequently spends upwards of 10 years operating in a dynamic environment. A state-of-the-art model predictive controller (MPC) can adapt to changes in the system, but is limited by its high online computational requirements. To better extract value at a reasonable on-line computational cost, we propose an adaptive battery controller framework that learns a control strategy by encoding an MPC policy in a neural network, as data becomes available, to adapt the control to the operating environment. We evaluate our controller in the context of a solar PV-storage system deployed in Texas under a time-of-use pricing scheme. We find that our controller gets to within 5-10% of optimal performance, and outperforms a default control strategy for PV-storage systems within a few months of installation.

+ View Abstract

Robust and Practical Approaches for Solar PV and Storage Sizing

Fiodar Kazhamiaka, Yashar Ghiassi-Farrokhfal, Srinivasan Keshav, and Catherine Rosenberg

ACM e-Energy (2018) (Runner up: Best paper by audience vote)

We study the problem of optimally and simultaneously sizing solar photovoltaic (PV) and storage capacity in order to partly or completely offset grid usage. While prior work offers some insights, researchers typically consider only a single sizing approach. In contrast, we use a firm theoretical foundation to compare and contrast sizing approaches based on robust simulation, robust optimization, and stochastic network calculus. We evaluate the robustness and computational complexity of these approaches in a realistic setting to provide practical, robust advice on system sizing.

+ View Abstract

Simple Spec-Based Modelling of Lithium-Ion Batteries

Fiodar Kazhamiaka, Srinivasan Keshav, Catherine Rosenberg, and Karl-Heinz Pettinger

IEEE Transactions on Energy Conversion (2018)

Lithium-ion battery models that estimate their energy content after a series of charge and discharge operations are essential in the optimal design, analysis and operation of battery-based systems. We focus on the class of battery models that can be calibrated entirely from the battery’s manufacturer-provided specifications (spec). Such models are simple to calibrate and are therefore widely used in practice. The best-known model in this category was proposed by Tremblay et al. in 2007. This model, however, has several shortcomings, including low fidelity at high C-rates, and the fact that it does not model the battery management system. We propose an alternative, called the Power-based Integrated (PI) model that is also completely spec-based, yet has much higher fidelity. We perform two types of validation, the first one uses the voltage profiles in the spec while the other is based on laboratory experiments. Both validations confirm that our model, which we have publicly released as a Simulink system block, has a mean absolute voltage error of less than 0.1 V across a wide range of C-rates.

+ View Abstract

On the Influence of Jurisdiction on the Profitability of Photovoltaic-Storage systems: A Multi-National Case Study

Fiodar Kazhamiaka, Patrick Jochem, Catherine Rosenberg, and Srinivasan Keshav

Energy Policy, (2017)

Policy makers in many jurisdictions have implemented incentive schemes such as ‘feed-in tariffs’ (FIT) and upfront purchase subsidies to encourage consumers to self-generate parts of their power requirements by solar energy. We quantitatively study the impact of jurisdiction-specific solar radiation profile, the typical residential loads, the cost of system components, the price of grid electricity, and incentive programs on photovoltaic (PV) and storage system profitability in Germany, Ontario, and Austin, Texas. In each jurisdiction, for a range of PV and storage system sizes, we compute the optimal use of the system, and hence the best possible profitability of that system in that jurisdiction over a 20 year life span. This methodology allows us to quantitatively estimate the influence of a jurisdiction on the (best possible) profitability of PV-storage systems. We find that the choice of jurisdiction has significant impact on the profitability of PV-storage systems. We also find that policy makers can use the price of grid electricity as well as upfront subsidies to influence profitability, and therefore adoption.

+ View Abstract

Li-Ion Storage Models for Energy System Optimization: The Accuracy-Tractability Tradeoff

Fiodar Kazhamiaka, Catherine Rosenberg, Srinivasan Keshav, and Karl-Heinz Pettinger

ACM e-Energy (2016)

There is a need for accurate analytical models that describe how a Lithium-ion battery’s state of charge evolves as a result of a charging or discharging operation and that can be used in optimization problems. Although ‘white box’ models that take into account the details of electro-chemical processes can be highly accurate, they are not typically suitable for optimization problems. We propose two models that represent different trade-offs between accuracy and tractability. We validate the accuracy of these models with data traces obtained from extensive experiments using two different commercially-available cells based on two distinct Li-Ion technologies. We find that one of our models can be easily adopted for use in a mathematical optimization problem, while significantly increasing the range of C-rates over which it is accurate (<5% error) compared to the models that are currently being used.

+ View Abstract

Practical Strategies for Storage Operation in Energy Systems: Design and Evaluation

Fiodar Kazhamiaka, Catherine Rosenberg, and Srinivasan Keshav

IEEE Transactions on Sustainable Energy (2016)

Motivated by the increase in small-scale solar installations used for powering homes and small businesses, we consider the design of rule-based strategies for operating an energy storage device connected to a self-use solar generation system to minimize payments to the grid. This problem is inherently challenging, since strategies depend greatly on the choice of the tariff structure and forecasts of future generation and load. We propose an optimization framework for finding optimal operation strategies and use it to evaluate the performance of an existing operating strategy that we modified to not use forecasts, in the context of differential pricing. We also use our framework to propose a new practical operating strategy for peak-demand pricing. We simulate the two rule-based strategies using real data for solar generation and building load, and find that they are able to achieve near-optimal performance without requiring forecasts.

+ View Abstract

Mayflower: Improving Distributed Filesystem Performance Through SDN/Filesystem Co-Design

Sajjad Rizvi, Xi Li, Bernard Wong, Fiodar Kazhamiaka, and Benjamin Cassell

IEEE International Conference on Distributed Computing Systems (2016)

In this paper, we introduce Mayflower, a new distributed filesystem that is co-designed from the ground up to work together with a network control plane. In addition to the standard distributed filesystem components, Mayflower includes a flow monitor and manager running alongside a software-defined networking controller. This tight coupling with the network controller enables Mayflower to make intelligent replica selection and flow scheduling decisions based on both filesystem and network information. Mayflower can perform global optimizations that are unavailable to conventional network-aware distributed filesystems and network control planes. Our evaluation results from both simulations and a prototype implementation show that Mayflower reduces average read completion time by more than 25% compared to current state-of-the-art distributed filesystems with an independent network flow scheduler, and more than 75% compared to HDFS with ECMP.

+ View Abstract

Energy Supply Aware Power Planning for Flexible Loads

Florian Niedermeier, Fiodar Kazhamiaka, and Hermann de Meer

Energy Efficient Data Centers, an ACM eEnergy workshop (2016)

Increasing the use of renewable energy is considered a viable way of reducing carbon intensive power generation. However, a power grid running on high amounts of renewable energy has to deal with the limited controllability and higher volatility of power sources like wind or solar. In this work, we propose to use demand side management to deal with varying amounts of renewable power feed-in via the use of power plans, i.e. instructions passed to large energy consumers that specify how they should try to spread out their energy use over a day. We argue that a separation of power planning and implementation of technical measures to schedule loads to follow the plan would alleviate some of the problems faced by an integrated planning-scheduling approach, as these processes are governed by different entities who may be unwilling to disclose all required information to each other. As a proof-of-concept, we propose and analyze a quadratic programming approach to maximizing the fraction of renewable energy being used while not overburdening the consumer with a power plan that is difficult to follow.

+ View Abstract

Optimal Design of Renewable Farms with Storage

Yashar Ghiassi-Farrokhfal, Fiodar Kazhamiaka, Catherine Rosenberg, and Srinivasan Keshav

IEEE Transactions on Sustainable Energy (2015)

We consider the problem of allocating a capital budget to solar panels and storage to maximize the expected revenue in the context of a large-scale solar farm participating in an energy market. This problem is complex due to many factors. To begin with, solar energy production is stochastic, with a high peak-to-average ratio, thus the access link is typically provisioned at less than peak capacity, leading to the potential waste of energy due to curtailment. The use of storage prevents power curtailment, but the allocation of capital to storage reduces the amount of energy produced. Moreover, energy storage devices are imperfect. A solar farm owner is thus faced with two problems: 1)deciding the level of power commitment and 2) the operation of storage to meet this commitment. We formulate two problems corresponding to two different power commitment approaches, an optimal one and a practical one, and show that the two problems are convex, allowing efficient solutions. Numerical examples show that our practical power commitment approach is close to optimal and also provide several other engineering insights.

+ View Abstract

Hey, you found me! Hope the rest of your day is this lucky!