Herdianto, Bachtiar Billot, Romain Lucas, Flavien Sevaux, Marc
We propose a metaheuristic algorithm enhanced with feature-based guidance that is designed to solve the Capacitated Vehicle Routing Problem (CVRP). To formulate the proposed guidance, we developed and explained a supervised Machine Learning (ML) model, that is used to formulate the guidance and control the diversity of the solution during the optim...
Zou, Xinying Esnaola, Iñaki Altman, Eitan
In this paper, the worst-case probability measure over the data is introduced as a tool for characterizing the generalization capabilities of machine learning algorithms. More specifically, the worst-case probability measure is a Gibbs probability measure and the unique solution to the maximization of the expected loss under a relative entropy cons...
Barış-Tüzemen, Özge Tüzemen, Samet Çelik, Ali Kemal
Published in
European Journal of Tourism, Hospitality and Recreation
The Cappadocia region is one of the most popular tourist destinations in Turkey, and its tourism sector has a significant share in the Turkish economy. In this study, we scraped TripAdvisor reviews of visitors of the Cappadocia region with the Python programming language and used them to analyse public sentiment using various supervised machine lea...
Möller, A.M. Vermeer, Susan A.M. Baumgartner, Susanne E.
Social scientists often study comments on YouTube to learn about people’s attitudes towards and experiences of online videos. However, not all YouTube comments are relevant in the sense that they reflect individuals’ thoughts about, or experiences of the content of a video or its artist/maker. Therefore, the present paper employs Supervised Machine...
Westerholm, Ludvig
In this project, we explored the usage of machine learning in classifying portable electronic devices. The primary objective was to identify devices such as laptops, smartphones, and tablets, based on their physical and technical specification. These specifications, sourced from the Pricerunner price comparison website, contain height, Wi-Fi standa...
Zou, Xinying Esnaola, Iñaki Altman, Eitan Poor, H. Vincent
The worst-case data-generating (WCDG) probability measure is introduced as a tool for characterizing the generalization capabilities of machine learning algorithms. Such a WCDG probability measure is shown to be the unique solution to two different optimization problems: $(a)$ The maximization of the expected loss over the set of probability measur...
McTeer, Matthew Applegate, Douglas Mesenbrink, Peter Ratziu, Vlad Schattenberg, Jörn M Bugianesi, Elisabetta Geier, Andreas Romero Gomez, Manuel Dufour, Jean-Francois Ekstedt, Mattias
...
AIMS: Metabolic dysfunction Associated Steatotic Liver Disease (MASLD) outcomes such as MASH (metabolic dysfunction associated steatohepatitis), fibrosis and cirrhosis are ordinarily determined by resource-intensive and invasive biopsies. We aim to show that routine clinical tests offer sufficient information to predict these endpoints. METHODS: Us...
Al Ghadban, Yasmina Du, Yuheng Charnock-Jones, D Stephen Garmire, Lana X Smith, Gordon CS Sovio, Ulla
OBJECTIVES: To identify and internally validate metabolites predictive of spontaneous preterm birth (sPTB) using multiple machine learning methods and sequential maternal serum samples, and to predict spontaneous early term birth (sETB) using these metabolites. DESIGN: Case-cohort design within a prospective cohort study. SETTING: Cambridge, UK. PO...
Ding, Ruiwen Yadav, Anil Rodriguez, Erika Araujo Lemos da Silva, Ana Hsu, William
Lung adenocarcinoma (LUAD) is a morphologically heterogeneous disease with five predominant histologic subtypes. Fully supervised convolutional neural networks can improve the accuracy and reduce the subjectivity of LUAD histologic subtyping using hematoxylin and eosin (H&E)-stained whole slide images (WSIs). However, developing supervised models w...
Durango, María C Torres-Silva, Ever A Orozco-Duque, Andrés
Published in
Healthcare informatics research
A substantial portion of the data contained in Electronic Health Records (EHR) is unstructured, often appearing as free text. This format restricts its potential utility in clinical decision-making. Named entity recognition (NER) methods address the challenge of extracting pertinent information from unstructured text. The aim of this study was to o...