Francisco Perez-Sorrosal
I’m a trustful, goal-oriented, and detail-focused professional with experience in both research and software engineering, particularly in the fields of machine learning (ML), artificial intelligence (AI), and distributed systems. As a Research Engineer, I love exploring and implementing cutting-edge technologies, particularly in AI/ML, to tackle complex challenges and foster innovation. As a Software Engineer, I can design, develop, and ensure the maintainability of robust and scalable big data applications, whether on-premises or cloud platforms, always prioritizing high availability, scalability, performance, and reliability. I am adaptable, capable of taking initiative and leading efforts with a decisive and strategic approach when needed, or collaborating as a communicative, supportive, and empathetic team player to achieve shared goals. I thrive in open, dynamic, multicultural, and multidisciplinary environments, where I integrate visionary and practical insights to deliver impactful results while empowering others and fostering a spirit of collaboration.
Currently, I am deeply interested in agentic technologies and decision making, which I believe will play a transformative role in shaping the next generation of applications set to revolutionize the (software) industry in the coming years. In my opinion, these technologies, driven by advancements in AI research and software/hardware engineering, have the potential to enable systems that are capable of proactive decision-making, continuous learning, and adaptive behavior. My focus is on exploring how these capabilities can be harnessed to build intelligent, interactive, and dynamic applications that redefine user experiences and operational efficiency.
Since the popularization of Deep Learning around 2017, I have had the opportunity to expand my expertise into the fields of machine learning/artificial intelligence, focusing primarily on natural language processing (NLP). I began by utilizing classical algorithms such as Support Vector Machines (SVMs) and Random Forests for binary classification tasks, including detecting clickbait news articles. In those projects, I had the opportunity to explore feature engineering and model optimization to achieve high accuracy and robust performance.
Building on this foundation, I transitioned to deep learning, where I designed and trained neural networks for more complex classification tasks, including multiclass and multilabel problems. My experience includes experimenting with custom architectures tailored to specific datasets, leveraging state-of-the-art frameworks such as TensorFlow and later PyTorch, and fine-tuning models to optimize metrics such as precision, recall, and computational efficiency.
Around 2019, I also ventured into generative AI, working on early proofs of concept for binary classification tasks using T5 and exploring its capabilities in NLP. Later, with the generalization of large language models (LLMs), I successfully fine-tuned an LLM for language style adaptation.
Through my involvement in these diverse range of ML/AI projects, I aimed to deepen my understanding and enhance my ability to tackle new complex and challenging problem domains using both traditional and generative AI approaches. Additionally, in collaboration with my colleagues, we achieved research contributions, including publications in well-know conferences/journals in the field, such as ACL, SIGKDD, and TKDE.
I was a core contributor to Apache Omid, an open-source transactional system for the HBASE DB, where I played a pivotal role in enhancing its scalability, reliability, and overall functionality. My work included designing and implementing high-availability mechanisms, ensuring fault tolerance and uninterrupted service in distributed environments. I contributed to the optimization of Omid’s commit protocol, significantly improving its latency and throughput while preserving transactional consistency and isolation guarantees. Apart of its critical role in [Yahoo]](https://www.yahoo.com/)’s content ingestion platform, Omid’s success and robustness in the open-source community ultimately led to its integration into Apache Phoenix, where it continues to enable transactional guarantees for large-scale, cloud-native applications. At Yahoo, I’ve developed also an incremental processing framework and other scalable services (e.g. a Twitter Firehose.)
In the past, I contributed to several European and Spanish national research projects, including CumuloNimbo, ADAPT, S4All, AUTONOMIC, and NEXOF. Across these initiatives, I designed and developed highly available and scalable architectures and software systems. Notable achievements include implementing a transaction-aware replicated cache for application servers, open-sourcing implementations of the JEE Activity Service and WS-CAF specifications, and building replication frameworks to ensure high availability for stateful services. Additionally, while working with Atos/BULL SAS France, I implemented replication and recovery protocols for their JEE application server, JOnAS, enhancing its reliability and robustness.
I earned my Ph.D. from Universidad Politecnica de Madrid (UPM) while working at the Distributed Systems Lab (LSD). My Ph.D. Thesis focused on designing protocols for consistent replication in transaction-aware middleware. During this time, I published research in prestigious conferences and journals, including WWW, ICDCS, Middleware, and the VLDB Journal. Additionally, I completed an internship at INRIA (France), where I contributed to integrating autonomic computing features into my projects, further enhancing their adaptability and resilience.
Prior to my time at LSD, I spent two academic years as a lecturer at the Madrid Campus of the School of CS, Universidad Pontificia de Salamanca. There, I taught undergraduate students foundational courses in programming and operating systems. Earlier in my career, I served as a UNIX System Administrator at the same university and worked as a Quality Analyst (QA), focusing on the database modules of Meta4’s ERP suite, ensuring their reliability and performance.
In addition to the projects above, I also gained hands-on experience as a freelance developer, where I successfully designed and built a Ruby on Rails web application to streamline operations and management for a family-owned greenhouse.
Beyond my core professional focus, but some way related to it, I have a strong interest in the intersection of fields such as neuroscience, psychology, decision-making, cognitive sciences, learning techniques, behavioral economics, and philosophy, as they offer valuable insights into human behavior, intelligence, and how the brain works (and why!) I am deeply fascinated by the mechanisms of human thought, learning, and behavior, and how these insights can inform and inspire advancements in AI and our daily lives and wellbeing in general. In my personal life, I enjoy staying active through activities like running, mountain biking, yoga, cold-plunging, and playing tennis, which not only keep me physically fit but also provide a sense of balance and focus.
… Some of those moments will be lost in time, like tears in rain; It is always time to move forward and create new things!!! :-)