Resumen de: WO2024200467A1
The recommender system uses as input a new use case description (UCD), which is a free-form text description of a new industrial use case for machine learning. A set of available use cases is initialized by filling it with previous industrial use cases for machine learning. A graph database provides a knowledge graph (KG) containing for each available use case at least one attribute, and a machine learning pipeline that has been suitable. A language model (LM) computes a vector space embedding for the new use case description and for available use case descriptions. Relevance scores are computed for each available use case in the embedding space. A questionnaire component (QC) iteratively selects questions, which are binary questions corresponding to an attribute in the knowledge graph that splits the set of available use cases into two sets, wherein the sums of the relevance scores of the available use cases in the two sets are approximately the same, and incrementally filters the available use cases in the knowledge graph by removing use cases from the set of available use cases based on the answer to the question. If there is only one available use case remaining or if a user interface (UI) detects a selection of one of the available use cases, the machine learning pipeline (MLP) that is linked in the knowledge graph to that use case is recommended to a user (U).
Resumen de: US2024330781A1
Provided are systems for ensemble learning with machine learning models that include a processor to receive a training dataset of a plurality of data instances, wherein each data instance comprises a time series of data points, add an amount of time delay to one or more data instances to provide an augmented training dataset, select a first plurality of supervised machine learning models, select a second plurality of unsupervised machine learning models, train the first plurality of supervised machine learning models and the second plurality of unsupervised machine learning models based on the augmented training dataset, generate an ensemble machine learning model based on outputs of the supervised machine learning models and unsupervised machine learning models, and generate a runtime output of the ensemble machine learning model based on a runtime input to the ensemble machine learning model. Methods and computer program products are also provided.
Resumen de: WO2024201122A1
A computer-implemented method for generating and/or adjusting a heuristic function for a machine learning prediction. A machine learning prediction task including a target entity and attribute is received. Semantic relations are explored to generate relevant entities and attributes related to the target entity and attribute. The heuristic function is generated and/or adjusted based on the relevant entities and attributes.
Resumen de: WO2024206001A1
An online concierge system receives, from a client device associated with a user of the online concierge system, order data associated with an order placed with the online concierge system, in which the order data describes a delivery location for the order. The online concierge system receives information describing a set of attributes associated with the delivery location and accesses a machine learning model trained to predict a difference between an arrival time and a delivery time for the delivery location. The online concierge system applies the model to the set of attributes associated with the delivery location to predict the difference between the arrival time and the delivery time for the delivery location and determines an estimated delivery time for the order based at least in part on the predicted difference. The online concierge system sends the estimated delivery time for the order for display to the client device.
Resumen de: US2024330497A1
A computer-implemented method for performing automated sharing of data and analytics across a data space platform includes receiving a request for a data analytics service from a first data stakeholder and providing an initial analysis to the first data stakeholder based on determining a portion of semantic data of the data space platform that is accessible to the first data stakeholder. The initial analysis is updated based on comparing the portion of semantic data with another portion of semantic data of the data space platform that is accessible to a second data stakeholder. The updated analysis is provided to the first data stakeholder. The method can be applied to machine learning and regression problems (continuous values) including, but not limited to, providing improvements to various technical fields such as medical diagnosis and treatment, operation system design and optimization, material design and optimization, telecommunication network design and optimization.
Resumen de: US2024333763A1
An AI adversary red team configured to pentest email and/or network defenses implemented by a cyber threat defense system used to protect an organization and all its entities. AI model(s) trained with machine learning on contextual knowledge of the organization and configured to identify data points from the contextual knowledge including language-based data, email/network connectivity and behavior pattern data, and historic knowledgebase data. The trained AI models cooperate with an AI classifier in producing specific organization-based classifiers for the AI classifier. A phishing email generator generates automated phishing emails to pentest the defense systems, where the phishing email generator cooperates with the AI models to customize the automated phishing emails based on the identified data points of the organization and its entities. The customized phishing emails are then used to initiate one or more specific attacks on one or more specific users associated with the organization and its entities.
Resumen de: US2024330130A1
Herein is machine learning for anomalous graph detection based on graph embedding, shuffling, comparison, and unsupervised training techniques that can characterize an unfamiliar graph. In an embodiment, a computer obtains many known vectors that respectively represent known graphs. A new vector is generated that represents a new graph that contains multiple vertices. The new vector may contain an arithmetic aggregation of vertex vectors that respectively represent multiple vertices and/or a vector that represents a virtual vertex that is connected to the multiple vertices by respective virtual edges. In the many known vectors, some similar vectors that are similar to the new vector are identified. The new graph is automatically characterized based on a subset of the known graphs that the similar vectors represent.
Resumen de: US2024330102A1
The present disclosure enables signal error correction using a first processor and a memory on a first substrate, where the first processor is operationally connected to a second processor on a second substrate and the memory stores computer code having a machine learning model. The first processor executes computer code to: automatically receive from the second processor, a first output signal intended to be received by a target recipient device. The first processor automatically inputs the first output signal into the machine learning model, where the machine learning model determines that the first output signal includes an error signal that would cause a malfunction in the target recipient device, and output an instruction to cause the first processor to generate a second output signal that corrects the error signal. The first processor automatically generates the second output signal and transmits the second output signal to the target recipient device.
Resumen de: US2024330281A1
Systems and methods for determining a query for a data store are described. A natural language text may be analyzed using heuristic processing and one or more machine learning models. Query parameters may be determined from the heuristic processing and machine learning and combined to form a query in a query language. In the heuristic processing, parsing rules may be used to remove conditional terms to generate a base question. The base question may be input to the one or more machine learning model to generate a base query which may be combined with query parameters related to the conditional terms.
Resumen de: US2024329751A1
This document relates to employing tongue gestures to control a computing device, and training machine learning models to detect tongue gestures. One example relates to a method or technique that can include receiving one or more motion signals from an inertial sensor. The method or technique can also include detecting a tongue gesture based at least on the one or more motion signals, and outputting the tongue gesture.
Resumen de: US2024331866A1
Techniques for real-time intradialytic hypotension (IDH) prediction are disclosed. A system obtains historical hemodialysis treatment data that is segmented into sets of machine learning training data based on temporal proximities to IDH events and trains a machine learning model to predict IDH events based on the sets of machine learning training data.
Resumen de: US2024330756A1
A computer-implemented method for developing a hierarchical machine-learning pipeline can include receiving a hierarchy specification, a set of estimators for the root node, and one or more transformer options for each of the transformer nodes. The hierarchy specification provides a configuration of the root node, transformer nodes, and edges interconnecting the root and transformer nodes. A rank can be obtained for each estimator in the root node. A hierarchy pipeline traverser can then traverse a first child layer of the transformer nodes connected to the root node via one of the edges. A first ranked list of pathways can be determined with respect to the one or more transformer options selected for the first child layer and at least one selected estimator of the root node.
Resumen de: CN118284896A
Architecture and techniques for providing promotable machine learning recommendations in conjunction with a medical protocol, such as a radiology protocol. In response to receiving a medical examination order request in a standardized input format, the system may output a recommended protocol in accordance with a standardized output format based on machine learning techniques. The system may then execute a mapping procedure that maps site-specific data to the standardized input format and the standardized output format. The site-specific data may include information specific to an entity providing the medical examination order request.
Resumen de: AU2022397403A1
The present disclosure relates to computer-implemented methods and systems for identifying biosynthetic gene clusters (BGCs) that encode pathways for the production of secondary metabolites. Secondary metabolites that target genes or gene products that are homologous to, e.g., human genes or gene products may have utility as potential drug compounds.
Resumen de: EP4439403A2
Systems and methods are provided for generating training data for a machine-learning classifier. A knowledge representation synthesized based on an object of interest is used to assign labels to content items. The labeled content items can be used as training data for training a machine learning classifier. The labeled content items can also be used as validation data for the classifier.
Resumen de: EP4439401A1
The recommender system uses as input a new use case description (UCD), which is a free-form text description of a new industrial use case for machine learning. A set of available use cases is initialized by filling it with previous industrial use cases for machine learning. A graph database provides a knowledge graph (KG) containing for each available use case at least one attribute, and a machine learning pipeline that has been suitable. A language model (LM) computes a vector space embedding for the new use case description and for available use case descriptions. Relevance scores are computed for each available use case in the embedding space. A questionnaire component (QC) iteratively selects questions, which are binary questions corresponding to an attribute in the knowledge graph that splits the set of available use cases into two sets, wherein the sums of the relevance scores of the available use cases in the two sets are approximately the same, and incrementally filters the available use cases in the knowledge graph by removing use cases from the set of available use cases based on the answer to the question. If there is only one available use case remaining or if a user interface (UI) detects a selection of one of the available use cases, the machine learning pipeline (MLP) that is linked in the knowledge graph to that use case is recommended to a user (U).
Resumen de: US2024320036A1
A data processing method and system for automated construction, resource provisioning, data processing, feature generation, architecture selection, pipeline configuration, hyperparameter optimization, evaluation, execution, production, and deployment of machine learning models in an artificial intelligence solution development lifecycle. In accordance with various embodiments, a graphical user interface of an end user application is configured to provide a pre-configured template comprises an automated ML framework for data import, data preparation, data transformation, feature generation, algorithms selection, hyperparameters tuning, models training, evaluation, interpretation, and deployment to an end user. A configurable workflow is configured 10 to enable a user to assemble one or more transmissible AI build/products containing one or more pipelines and/or ML models for executing one or more AI solutions. Embodiments of the present disclosure may enable full serialization and versioning of all entities relating to an AI build/product for deployment within an enterprise architecture.
Resumen de: WO2024194871A1
The described invention is directed to systems and methods capable of identifying Machine Learning (ML) models that are potentially biased. The system obtains: (a) a list of potentially problematic labels, and (b) at least one code segment, including a plurality of code lines containing one or more commands associated with generating at least one machine learning model from a given data structure. The system extracts the actual labels of the given data structure and compares them to the list of potentially problematic labels. Upon a match between at least one of the extracted actual labels and at least one of the potentially problematic labels, the system performs an action associated with the knowledge that the ML model is potentially biased.
Resumen de: US2024320526A1
A computerized system and method for health care facilities to reduce manual handling of at least some open account issues. The system provides healthcare facilities with the ability to resolve current open patient account issues by utilizing the data patterns from a facility's historical patient account transaction activity, to create a machine learning model that can predict resolutions to the open accounts. These patterns are then applied to a facility's current transaction data providing next step resolution to each patient account.
Resumen de: US2024320507A1
Aspects relate to a privacy preserving public machine learning model that achieves high performance while maintaining data privacy. Further aspects relate to a weighted knowledge transfer device including a feature determination unit to generate a public knowledge transfer dataset and a private knowledge transfer dataset; a data selection unit to generate, based on a similarity calculation of the public knowledge transfer dataset and the private knowledge transfer dataset, a public training dataset and a similarity weight vector; a machine learning model management unit to generate, by processing the public training dataset with a set of machine learning models trained based on the private knowledge transfer dataset, a public label vector that indicates labels for the set of public features; and a knowledge transfer unit to generate a public machine learning model based on the weight vector, the public training dataset, and the public label vector.
Resumen de: US2024320552A1
An evaluation result of training data to be used to train a machine learning model is presented.An information processing method that performs processing related to the training data to be used to train the machine learning model includes a determination step of determining a characteristic of each piece of the training data on the basis of an inference result of the machine learning model for the training data, and a presentation step of presenting the evaluation result of the training data based on the determined characteristic. In the determination step, a physical characteristic such as mass, a size, or acting force including attractive force and repulsive force of an object corresponding to the training data is determined on the basis of an expected value for each label output by the machine learning model for the training data.
Resumen de: US2024320252A1
A computer-based system may engineer features based on semantic types. The computer-based system may implement deep learning algorithms and derive a domain-specific feature engineering strategy from semantic type predictions and data profiling. The computer-based system may utilize embedded domain (e.g., financial industry, etc.) knowledge to generate curated features from raw data (e.g., transactional datasets, relational datasets, etc.).
Resumen de: US2024321129A1
A computer implemented method includes accessing instructional content that describes a task for completion by a user. Actions described in the instructional content are derived from the instructional content. Telemetry containing logged actions taken by users is accessed and used to identify actions taken that are associated with the task. A machine learning model is used to identify a task completion path endpoint for the instructional content based on the derived actions and actions taken associated with the task.
Resumen de: US2024323156A1
Methods and systems are provided for facilitating time zone prediction using electronic communication data. Electronic message data associated with a message recipient of electronic communications is obtained. The electronic message data includes message delivery data associated with an electronic message and message response data associated with a response, by the message recipient, to a received electronic message. Using a machine learning model and based on the message delivery data and the message response data, a time-zone score is determined for a time zone. Such a time-zone score can indicate a probability the time zone corresponds with the message recipient. Based on the time-zone score, the time zone is identified as corresponding with the message recipient.
Nº publicación: US2024323093A1 26/09/2024
Solicitante:
FORESCOUT TECH INC [US]
FORESCOUT TECHNOLOGIES, INC
Resumen de: US2024323093A1
Systems, methods, and related technologies for classifying a device on a network are described. A method includes capturing device information corresponding to a device on a network. The method inputs unstructured crowdsourced data on the network into a machine learning model to produce structured crowdsourced data. The method classifies the device based on evaluating the device information with the structured crowdsourced data.