Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

296 results about "Data mining" patented technology

Data mining is the process of discovering patterns in large data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and statistics with an overall goal to extract information (with intelligent methods) from a data set and transform the information into a comprehensible structure for further use. Data mining is the analysis step of the "knowledge discovery in databases" process or KDD. Aside from the raw analysis step, it also involves database and data management aspects, data pre-processing, model and inference considerations, interestingness metrics, complexity considerations, post-processing of discovered structures, visualization, and online updating.

Method, device and system for searching repeated data

An embodiment of the invention provides a method, a device and a system for searching repeated data. After a repeated-deleting file is partitioned into blocks, fingerprint information of block data is utilized to determine a metadata server responsible for search. When the fingerprint information of the block data is found to be stored in a database by search, the block data corresponding to the fingerprint information of the block data are transmitted to a shared-file system and are not required to be transmitted to the metadata server for storage. Compared with the prior art, the method, the device and the system for searching the repeated data reduce link cost to a large extent and improve system performance.
Owner:HUAWEI DIGITAL TECH (CHENGDU) CO LTD

Method and system for updating statistical data

ActiveCN101557316AAvoid inaccuraciesGuaranteed real-timeData switching networksSpecial data processing applicationsMessage queueData mining
The application discloses a method and a system for updating statistical data. The method comprises the following steps: a service system receives a service request from a user, conducts data processing for the current service and saves the service data processed into a service database and the message queue in a calculation server; the calculation server acquires the current service data processed from the message queue, conducts increment statistical computing to acquire increment statistical data of the current service processed and updates the statistical data into the statistical table database; a task server periodically traverses the service data in the service database, executes the statistical computation based on the service database to acquire the statistical data for the whole service database and applies the statistical data for the whole service database to overwrite all of the data in the statistical table database and update the statistical table database.
Owner:ALIBABA GRP HLDG LTD

Malware detection system with contextual analysis

A computerized method for detecting malware associated with an object. The method includes operations of analyzing an object to obtain a first set of attributes, where the first set of attributes include one or more characteristics associated with the object. Furthermore, the object is processed with a virtual machine to obtain a second set of attributes. The second set of attributes corresponds to one or more monitored behaviors of the virtual machine during processing of the object. Thereafter, a threat index is determined based, at least in part, on a combination of at least one attribute of the first set of attributes and at least one attribute of the second set of attributes. The threat index represents a probability of maliciousness associated with the object.
Owner:FIREEYE SECURITY HLDG US LLC

Electronic tag data filtering method used for radio frequency identification middleware

InactiveCN101515334AAdd filter functionRich sourcesCo-operative working arrangementsData miningMiddleware
The invention provides an electronic tag data filtering method used for a radio frequency identification middleware. The middleware sequentially filters according to the source, type, coding range, behavior type and entity attribute of an electronic tag. The method comprehensively considers the attribute, behavior information and EPC network information of the tag, greatly reduces the redundant data and realizes the efficient screening of data.
Owner:HUAZHONG UNIV OF SCI & TECH

Web service quality of service (QOS) prediction method based on mixed collaborative filtering

ActiveCN103139310AImprove forecast accuracySolve the problem of overestimation of similarity valueData switching networksRelevant informationWeb service
The invention discloses a Web service quality of service (QOS) prediction method based on mixed collaborative filtering. An existing Web service QOS prediction method is not high in prediction accuracy, and the condition is particularly more severe under the condition of data of QOS are sparse. The method comprises the steps of according to relevant information of users and services, firstly, classifying the users and the services, then, according to various classifications which target users and target services belong to, calculating QOS predication values through various prediction methods, and during a prediction value calculation process, calculating similarity between the users and the services through an improved Euclidean distance method. Compared with a traditional method, the Web service QOS prediction method has higher prediction accuracy of the Web service QOS. Particularly under the condition that the data of the historic QOS are sparse, the prediction accuracy is remarkably improved when the method is compared with the prior art.
Owner:HANGZHOU DIANZI UNIV

Methods and equipment for generating and maintaining web content extraction template

InactiveCN102073654AAutomatic generatedImprove efficiencySpecial data processing applicationsWeb contentData mining
The invention provides methods and equipment for generating and maintaining a web content extraction template. The equipment for generating the web content extraction template comprises an input unit, a weight calculation unit, a maximum alignment relationship calculation unit, a combination unit, a determination unit and a selection unit, wherein the weight calculation unit is configured to calculate weights of nodes of each type in each input tree. The equipment for maintaining the web content extraction template comprises a similarity calculation unit, a statistic calculation unit, a statistic judgment unit and a recalculation unit, wherein the similarity calculation unit calculates a similarity sequence; the statistic calculation unit traverses the similarity sequence by utilizing a window with a predetermined size and calculates statistic in the window; and the statistic judgment unit judges whether the web content extraction template is adapted to the input of a web or not according to the calculated statistic. In the methods and the equipment, the web content extraction template can be automatically generated with high efficiency, and when the web changes to cause the invalidation of the extraction template or reduction in accuracy, the web content extraction template can be automatically rapidly regenerated.
Owner:FUJITSU LTD

Method for determining risk control strategy based on predictive model and related device

ActiveCN109034660AIncreased generation flexibilityImprove reliabilityForecastingResourcesRisk ControlDependability
The embodiment of the application discloses a method and a device for determining a risk control strategy based on a prediction model. The method comprises the following steps: acquiring first user data and determining a target service associated with the first user data; according to the target business type of the target business, the target risk decision-making rule model associated with the target business is determined. A target risk decision rule corresponding to the first user data is determine based on the target risk decision rule model, a target risk decision rule model is obtained based on the sample user data training, and the sample user data comprises a first sample user data corresponding to a first risk decision rule and a second sample user data corresponding to a second risk decision rule; the target risk decision rule model comprises a first sample user data corresponding to a first risk decision rule and a second sample user data corresponding to a second risk decision rule. According to the target risk decision rules, the target risk control strategy is determined. By adopting the embodiment of the present application, the relevance between the risk control strategy determined based on machine learning and the user service data can be enhanced, the reliability of the risk control strategy can be improved, and the applicability is stronger.
Owner:PING AN TECH (SHENZHEN) CO LTD

Method and device for processing network information

ActiveCN103425703AImprove click-through rateIncrease reachSpecial data processing applicationsComputer networkClick-through rate
The invention discloses a method and a device for processing network information. The method includes enabling a content matching module to match corresponding candidate network information for a current user; enabling a network information recommending module to select recommended network information from the candidate network information; enabling a recommendation reason generating module to generate recommendation reason information of the recommended network information; enabling a display module to display the recommended network information and the recommendation reason information for the current user. The method and the device have the advantages that a network information click rate can be increased by the aid of the method and the device, and accordingly a target web page arrival rate can be increased.
Owner:SHENZHEN TENCENT COMP SYST CO LTD

Text recognition method and device, electronic equipment and medium

PendingCN111723575ACharacter and pattern recognitionNatural language data processingText recognitionWord list
The invention discloses a text recognition method and device, electronic equipment and a medium. According to the invention, the method comprises the steps of: performing entity feature recognition onthe target text by using a pre-trained deep learning model to obtain a candidate name entity list, and matching the candidate name entity list with multiple pieces of name information in the enterprise name library one by one to obtain at least one matching result, thereby taking the candidate name entity, higher than the hit matching rate, in the at least one matching result as a name entity obtained by identifying the target text. By applying the technical scheme, name entities possibly existing in the text can be extracted by adopting the deep learning model, a part of entities with identification errors are filtered out by utilizing the filtering word list to serve as candidate company entities, and the candidate companies correspond to the specific enterprise name libraries by meansof the enterprise name libraries and the enterprise entity mapping tables. Therefore, the problem that the efficiency of extracting the effective name entity from the text is very low in the prior artis avoided.
Owner:HANGZHOU WEIMING XINKE TECH CO LTD +1

Industrial fault analysis expert system based on knowledge graph

InactiveCN110705710AEnhance expressive abilityProcessing speedKnowledge representationInference methodsFuzzy inferenceEngineering
The invention provides an industrial fault analysis expert system based on a Knowledge Graph. The industrial fault analysis expert system based on the Knowledge Graph comprises the following steps that expert experience knowledge is obtained and preprocessed to eliminate anomalies and complement missing values; knowledge data fuzzification is carried out, and a knowledge graph is constructed in the field of industrial fault analysis by applying a semantic net technology and a graph calculation engine according to the knowledge data fuzzification. The simplification and high efficiency of a knowledge graph structure are ensured through a graph dynamic self-optimization process, and the calculation efficiency is improved; industrial faults are diagnosed and analyzed based on a fuzzy reasoning method, and data or equipment abnormity is efficiently and accurately judged.
Owner:CHINA UNIV OF PETROLEUM (EAST CHINA)

Data-mining-based method of fault-collecting early warning system

InactiveCN106503439APrevent wrong dischargeImprove troubleshooting efficiencyForecastingSpecial data processing applicationsEarly warning systemData stream
The invention relates to a data-mining-based method of a fault-collecting early warning system. The method includes the steps that accumulated fault data is processed, and a classifier is built; real-time data flow is processed through the classifier, and fault types are analyzed; an expert knowledge base is built; according to comparison of the fault types and the expert knowledge base, the result is obtained, a feedback mechanism is built in cooperation with feedback of maintenance persons, and the feedback result and the fault data are applied to building of the classifier. The method is intelligent and rapid, the running fault types can be accurately found, the technical support is provided for maintaining for collected faults, and the fault solving efficiency is improved.
Owner:STATE GRID FUJIAN ELECTRIC POWER CO LTD +4

Real-time demand prediction method and device and electronic device

ActiveCN111612122APrediction is accurateForecastingNeural architecturesData packEngineering
The invention provides a real-time demand prediction method and device, and an electronic device. The method comprises the steps of receiving a demand prediction request of a client; wherein the demand prediction request carries a target time interval and a target position identifier, and the target position identifier comprises at least one sub-position identifier; reading target historical datacorresponding to the target time interval and the target position identifier from a preset offline database; wherein the target historical data comprises demanded quantities in different time intervals corresponding to the sub-position identifiers; inputting the target historical data into a demand quantity prediction model corresponding to the target position identifier to obtain a predicted demand quantity of each sub-position identifier in the target time interval; wherein the demand prediction model is generated by training a plurality of models including a graph convolutional neural network. According to the method, the prediction request of the user can be responded in real time, and an accurate demand prediction result is predicted through the demand prediction model trained by theplurality of models including the graph convolutional neural network.
Owner:BEIJING DIDI INFINITY TECH & DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products