PerOS: Personalized Self-Adapting Operating Systems in the Cloud
Abstract: Operating systems (OSes) are foundational to computer systems, managing hardware resources and ensuring secure environments for diverse applications. However, despite their enduring importance, the fundamental design objectives of OSes have seen minimal evolution over decades. Traditionally prioritizing aspects like speed, memory efficiency, security, and scalability, these objectives often overlook the crucial aspect of intelligence as well as personalized user experience. The lack of intelligence becomes increasingly critical amid technological revolutions, such as the remarkable advancements in ML. Today's personal devices, evolving into intimate companions for users, pose unique challenges for traditional OSes like Linux and iOS, especially with the emergence of specialized hardware featuring heterogeneous components. Furthermore, the rise of LLMs in ML has introduced transformative capabilities, reshaping user interactions and software development paradigms. While existing literature predominantly focuses on leveraging ML methods for system optimization or accelerating ML workloads, there is a significant gap in addressing personalized user experiences at the OS level. To tackle this challenge, this work proposes PerOS, a personalized OS ingrained with LLM capabilities. PerOS aims to provide tailored user experiences while safeguarding privacy and personal data through declarative interfaces, self-adaptive kernels, and secure data management in a scalable cloud-centric architecture; therein lies the main research question of this work: How can we develop intelligent, secure, and scalable OSes that deliver personalized experiences to thousands of users?
- G. AG. Openvas - open vulnerability assessment scanner. https://www.openvas.org/, 2023.
- Firecracker: Lightweight virtualization for serverless applications. In Symposium on Networked Systems Design and Implementation, 2020. URL https://api.semanticscholar.org/CorpusID:211567076.
- High-definition routing congestion prediction for large-scale fpgas. 2020 25th Asia and South Pacific Design Automation Conference (ASP-DAC), pages 26–31, 2020. URL https://api.semanticscholar.org/CorpusID:214692445.
- Reducing power consumption of mobile thin client devices. 2009. URL https://api.semanticscholar.org/CorpusID:37738031.
- R. S. Amant and L. Zettlemoyer. User interface softbots. In AAAI/IAAI, 2000. URL https://api.semanticscholar.org/CorpusID:5413725.
- Low-power, high-performance analog neural branch prediction. 2008 41st IEEE/ACM International Symposium on Microarchitecture, pages 447–458, 2008. URL https://api.semanticscholar.org/CorpusID:14553490.
- Amazon. Amazon mechanical turk. https://www.mturk.com, 2023.
- AMD. Adaptive soc for any application from cloud to edge. https://www.xilinx.com/products/silicon-devices/acap/versal.html, 2023.
- Apple. Apple unveils m3, m3 pro, and m3 max, the most advanced chips for a personal computer. https://www.apple.com/newsroom/2023/10/apple-unveils-m3-m3-pro-and-m3-max-the-most-advanced-chips-for-a-personal-computer/, 2023.
- Energy-aware resource scheduling for serverless edge computing. 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet Computing (CCGrid), pages 190–199, 2022. URL https://api.semanticscholar.org/CorpusID:250715298.
- J. Axboe. Flexible i/o tester. https://github.com/axboe/fio, 2023.
- N. Bansal and M. Harchol-Balter. Analysis of srpt scheduling: investigating unfairness. In Measurement and Modeling of Computer Systems, 2001. URL https://api.semanticscholar.org/CorpusID:6418437.
- Cypress: input size-sensitive container provisioning and request scheduling for serverless platforms. Proceedings of the 13th Symposium on Cloud Computing, 2022. URL https://api.semanticscholar.org/CorpusID:253385775.
- Packet routing in dynamically changing networks: A reinforcement learning approach. In Neural Information Processing Systems, 1993. URL https://api.semanticscholar.org/CorpusID:364332.
- Netherite: Efficient execution of serverless workflows. Proc. VLDB Endow., 15:1591–1604, 2022. URL https://api.semanticscholar.org/CorpusID:249916915.
- Evidence-based static branch prediction using machine learning. ACM Trans. Program. Lang. Syst., 19:188–222, 1997. URL https://api.semanticscholar.org/CorpusID:8865665.
- Quality of experience in remote virtual desktop services. 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013), pages 1352–1357, 2013. URL https://api.semanticscholar.org/CorpusID:10213123.
- Intelligent interfaces through interactive planners. Interact. Comput., 12:545–564, 2000. URL https://api.semanticscholar.org/CorpusID:5527993.
- Building accountability into the internet of things: the iot databox model. Journal of Reliable Intelligent Environments, 4:39–55, 2018.
- Fast and accurate estimation of quality of results in high-level synthesis with machine learning. 2018 IEEE 26th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), pages 129–132, 2018. URL https://api.semanticscholar.org/CorpusID:5078393.
- Thin client computing solutions in low-and high-motion scenarios. In International Conference on Networking and Services (ICNS’07), pages 38–38. IEEE, 2007.
- Qlora: Efficient finetuning of quantized llms. ArXiv, abs/2305.14314, 2023. URL https://api.semanticscholar.org/CorpusID:258841328.
- A q-learning based self-adaptive i/o communication for 2.5d integrated many-core microprocessor and memory. IEEE Transactions on Computers, 65:1185–1196, 2016. URL https://api.semanticscholar.org/CorpusID:30396137.
- A circuit-architecture co-optimization framework for exploring nonvolatile memory hierarchies. ACM Transactions on Architecture and Code Optimization (TACO), 10:1 – 22, 2013. URL https://api.semanticscholar.org/CorpusID:624945.
- O. Etzioni and D. S. Weld. A softbot-based interface to the internet. Commun. ACM, 37:72–76, 1994. URL https://api.semanticscholar.org/CorpusID:2447472.
- Os agents: Using ai techniques in the operating system environment. 1993. URL https://api.semanticscholar.org/CorpusID:11991416.
- Operating system scheduling on heterogeneous core systems. 2007. URL https://api.semanticscholar.org/CorpusID:14823905.
- Dynamic voltage and frequency scaling in nocs with supervised and reinforcement learning techniques. IEEE Transactions on Computers, 68:375–389, 2019. URL https://api.semanticscholar.org/CorpusID:52840323.
- An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems. In Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems, pages 3–18, 2019.
- Horizon: Facebook’s open source applied reinforcement learning platform. ArXiv, abs/1811.00260, 2018. URL https://api.semanticscholar.org/CorpusID:53204036.
- Google. Write your application as a modular binary. deploy it as a set of microservices. https://serviceweaver.dev, 2023.
- H. Hè. In-vitro serverless clusters.
- G. Heiser. The sel4 microkernel–an introduction. The seL4 Foundation, 1, 2020.
- F. Hublet. The databank model. Master’s thesis, ETH Zurich, 2021.
- Energy-efficient application resource scheduling using machine learning classifiers. Proceedings of the 47th International Conference on Parallel Processing, 2018. URL https://api.semanticscholar.org/CorpusID:195348382.
- A. A. T. Isstaif and R. Mortier. Towards latency-aware linux scheduling for serverless workloads. Proceedings of the 1st Workshop on SErverless Systems, Applications and MEthodologies, 2023. URL https://api.semanticscholar.org/CorpusID:258486714.
- Towards more intelligent and interactive interfaces. In The Florida AI Research Society, 1999. URL https://api.semanticscholar.org/CorpusID:13371719.
- D. A. Jiménez and C. Lin. Dynamic branch prediction with perceptrons. Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture, pages 197–206, 2001. URL https://api.semanticscholar.org/CorpusID:3184222.
- Learning-based application-agnostic 3d noc design for heterogeneous manycore systems. IEEE Transactions on Computers, 68:852–866, 2018. URL https://api.semanticscholar.org/CorpusID:53046469.
- D.-C. Juan and D. Marculescu. Power-aware performance increase via core/uncore reinforcement control for chip-multiprocessors. In International Symposium on Low Power Electronics and Design, 2012. URL https://api.semanticscholar.org/CorpusID:15136374.
- Hermod: principled and practical scheduling for serverless functions. Proceedings of the 13th Symposium on Cloud Computing, 2022. URL https://api.semanticscholar.org/CorpusID:253385680.
- Communication-efficient graph neural networks with probabilistic neighborhood expansion analysis and caching. ArXiv, abs/2305.03152, 2023. URL https://api.semanticscholar.org/CorpusID:258546707.
- Cascadexml: Rethinking transformers for end-to-end multi-resolution training in extreme multi-label classification. ArXiv, abs/2211.00640, 2022. URL https://api.semanticscholar.org/CorpusID:253255408.
- J. Kim and K. Lee. Practical cloud workloads for serverless faas. In Proceedings of the ACM Symposium on Cloud Computing, pages 477–477, 2019.
- The case for learned index structures. Proceedings of the 2018 International Conference on Management of Data, 2017. URL https://api.semanticscholar.org/CorpusID:6038777.
- Tab2know: Building a knowledge base from tables in scientific papers. In International Workshop on the Semantic Web, 2020a. URL https://api.semanticscholar.org/CorpusID:226229411.
- Tab2know: Building a knowledge base from tables in scientific papers. In The Semantic Web–ISWC 2020: 19th International Semantic Web Conference, Athens, Greece, November 2–6, 2020, Proceedings, Part I 19, pages 349–365. Springer, 2020b.
- Kubernetes. Production-grade container orchestration. https://kubernetes.io, 2023.
- Thin-client computing for supporting the qos of streaming media in mobile devices. In Communication Systems and Applications, 2006. URL https://api.semanticscholar.org/CorpusID:394966.
- Muse: Secure inference resilient to malicious clients. In IACR Cryptology ePrint Archive, 2021. URL https://api.semanticscholar.org/CorpusID:232216360.
- On learning-based methods for design-space exploration with high-level synthesis. 2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC), pages 1–7, 2013. URL https://api.semanticscholar.org/CorpusID:12059048.
- G. Lyon. Nmap: the network mapper - free security scanner. https://nmap.org/, 2023.
- Comparison of two ict solutions: desktop pc versus thin client computing. The International Journal of Life Cycle Assessment, 18:861–871, 2013.
- Remote access protocols for desktop-as-a-service solutions. PLoS ONE, 14, 2019. URL https://api.semanticscholar.org/CorpusID:58022821.
- Orion and the three rights: Sizing, bundling, and prewarming for serverless dags. In USENIX Symposium on Operating Systems Design and Implementation, 2022a. URL https://api.semanticscholar.org/CorpusID:252819711.
- Wisefuse: Workload characterization and dag transformation for serverless workflows. Abstract Proceedings of the 2022 ACM SIGMETRICS/IFIP PERFORMANCE Joint International Conference on Measurement and Modeling of Computer Systems, 2022b. URL https://api.semanticscholar.org/CorpusID:249282028.
- N. Mahmoudi and H. Khazaei. Performance modeling of serverless computing platforms. IEEE Transactions on Cloud Computing, 10:2834–2847, 2022. URL https://api.semanticscholar.org/CorpusID:226587161.
- Microsoft. Discover the power of ai with copilot in windows. https://www.microsoft.com/en-us/windows/copilot-ai-features, 2023.
- P. Mohassel and Y. Zhang. Secureml: A system for scalable privacy-preserving machine learning. 2017 IEEE Symposium on Security and Privacy (SP), pages 19–38, 2017. URL https://api.semanticscholar.org/CorpusID:11605311.
- H. Moore. Metasploit. https://www.metasploit.com/, 2023.
- Ray: A distributed framework for emerging {{\{{AI}}\}} applications. In 13th USENIX symposium on operating systems design and implementation (OSDI 18), pages 561–577, 2018.
- A comparison of thin-client computing architectures. 2000.
- OpenAI. Gpt-4 turbo. https://help.openai.com/en/articles/8555510-gpt-4-turbo, 2023.
- Late breaking results: An efficient learning-based approach for performance exploration on analog and rf circuit synthesis. 2019 56th ACM/IEEE Design Automation Conference (DAC), pages 1–2, 2019. URL https://api.semanticscholar.org/CorpusID:163164981.
- Valorising the iot databox: creating value for everyone. Transactions on Emerging Telecommunications Technologies, 28(1):e3125, 2017.
- R. P. A. Petrick. Planning for desktop services. 2007. URL https://api.semanticscholar.org/CorpusID:55963917.
- Stateful serverless computing with crucial. ACM Transactions on Software Engineering and Methodology (TOSEM), 31:1 – 38, 2022. URL https://api.semanticscholar.org/CorpusID:247300542.
- Neuro-noc: Energy optimization in heterogeneous many-core noc using neural networks in dark silicon era. 2018 IEEE International Symposium on Circuits and Systems (ISCAS), pages 1–5, 2018. URL https://api.semanticscholar.org/CorpusID:53084745.
- Serverless workflows for containerised applications in the cloud continuum. Journal of Grid Computing, 19, 2021. URL https://api.semanticscholar.org/CorpusID:235901547.
- Icebreaker: warming serverless functions better with heterogeneity. Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022. URL https://api.semanticscholar.org/CorpusID:247026551.
- Improving the qoe of citrix thin client users. 2010 IEEE International Conference on Communications, pages 1–6, 2010. URL https://api.semanticscholar.org/CorpusID:1769656.
- The interactive performance of slim: a stateless, thin-client architecture. ACM SIGOPS Operating Systems Review, 33(5):32–47, 1999.
- Serverless in the wild: Characterizing and optimizing the serverless workload at a large cloud provider. ArXiv, abs/2003.03423, 2020. URL https://api.semanticscholar.org/CorpusID:212633767.
- Learning execution through neural code fusion. ArXiv, abs/1906.07181, 2019. URL https://api.semanticscholar.org/CorpusID:189999405.
- Fireworks: a fast, efficient, and safe serverless framework using vm-level post-jit snapshot. Proceedings of the Seventeenth European Conference on Computer Systems, 2022. URL https://api.semanticscholar.org/CorpusID:247765574.
- Prebaking functions to warm the serverless cold start. Proceedings of the 21st International Middleware Conference, 2020. URL https://api.semanticscholar.org/CorpusID:228085887.
- Optimized mobile thin clients through a mpeg-4 bifs semantic remote display framework. Multimedia Tools and Applications, 61:447–470, 2011. URL https://api.semanticscholar.org/CorpusID:2181477.
- Dbos: A dbms-oriented operating system. Proc. VLDB Endow., 15:21–30, 2021. URL https://api.semanticscholar.org/CorpusID:245827586.
- H. Song. Sumpy: A fuzzy software agent. 2007. URL https://api.semanticscholar.org/CorpusID:1371386.
- SpaceX. Starlink. https://www.starlink.com/, 2023.
- Parallel virtualized memory translation with nested elastic cuckoo page tables. Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022. URL https://api.semanticscholar.org/CorpusID:246472365.
- Leaftl: A learning-based flash translation layer for solid-state drives. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2, pages 442–456, 2023.
- Statistical user behavior detection and qoe evaluation for thin client services. Computer Science and Information Systems, 12(2):587–605, 2015.
- Minix 3: status report and current research. login: The USENIX Magazine, 2010.
- Distributed task scheduling in serverless edge computing networks for the internet of things: A learning approach. IEEE Internet of Things Journal, 9:19634–19648, 2022. URL https://api.semanticscholar.org/CorpusID:248188338.
- V. Tarasov. Filebench – a model based file system workload generator, 2018.
- I. Tenable. Nessus vulnerability scanner: Network security solution. https://www.tenable.com/products/nessus, 2023.
- Elf: An extensive, lightweight and flexible research platform for real-time strategy games. ArXiv, abs/1707.01067, 2017. URL https://api.semanticscholar.org/CorpusID:1160900.
- Benchmarking, analysis, and optimization of serverless function snapshots. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021. URL https://api.semanticscholar.org/CorpusID:231699170.
- Accurate operation delay prediction for fpga hls using graph neural networks. 2020 IEEE/ACM International Conference On Computer Aided Design (ICCAD), pages 1–9, 2020. URL https://api.semanticscholar.org/CorpusID:221727103.
- D. Vengerov. A reinforcement learning framework for utility-based scheduling in resource-constrained systems. Future Gener. Comput. Syst., 25:728–736, 2009. URL https://api.semanticscholar.org/CorpusID:12769472.
- Flint: A platform for federated learning integration. ArXiv, abs/2302.12862, 2023. URL https://api.semanticscholar.org/CorpusID:257220077.
- Self-consistency improves chain of thought reasoning in language models. ArXiv, abs/2203.11171, 2022. URL https://api.semanticscholar.org/CorpusID:247595263.
- Piranha: A gpu platform for secure computation. In IACR Cryptology ePrint Archive, 2022. URL https://api.semanticscholar.org/CorpusID:250361679.
- T. Weibel. Smart-umass trace repository. https://traces.cs.umass.edu/index.php/Storage/Storage, 2013.
- Sol: safe on-node learning in cloud platforms. Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2022. URL https://api.semanticscholar.org/CorpusID:246275770.
- S. Whiteson and P. Stone. Corrected version : See errata section adaptive job routing and scheduling. 2005. URL https://api.semanticscholar.org/CorpusID:9636537.
- Providing occupancy as a service with databox. In Proceedings of the 1st ACM International Workshop on Smart Cities and Fog Computing, pages 29–34, 2018.
- The performance of remote display mechanisms for thin-client computing. In USENIX Annual Technical Conference, General Track, pages 131–146, 2002.
- Characterizing serverless platforms with serverlessbench. In Proceedings of the ACM Symposium on Cloud Computing, SoCC ’20. Association for Computing Machinery, 2020. doi: 10.1145/3419111.3421280. URL https://doi.org/10.1145/3419111.3421280.
- Learning driven parallelization for large-scale video workload in hybrid cpu-gpu cluster. Proceedings of the 47th International Conference on Parallel Processing, 2018. URL https://api.semanticscholar.org/CorpusID:195348830.
- Sinan: Ml-based and qos-aware resource management for cloud microservices. Proceedings of the 26th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, 2021. URL https://api.semanticscholar.org/CorpusID:232118940.
- Aquatope: Qos-and-uncertainty-aware resource management for multi-stage serverless workflows. Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, 2022. URL https://api.semanticscholar.org/CorpusID:254927697.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.