-
公开(公告)号:US20170359362A1
公开(公告)日:2017-12-14
申请号:US15365008
申请日:2016-11-30
Applicant: Microsoft Technology Licensing, LLC
Inventor: Ori Kashi , Philip Newman , Daniel Alon , Elad Yom-Tov , Hani Neuvirth , Royi Ronen
Abstract: In an example embodiment, a computer-implemented method comprises obtaining labels from messages associated with an email service provider, wherein the labels indicate for each message IP how many spam and non-spam messages have been received; obtaining network data features from a cloud service provider; providing the labels and network data features to a machine learning application; generating a prediction model representing an algorithm for determining whether a particular set of network data features are spam or not; applying the prediction model to network data features for an unlabeled message; and generating an output of the prediction model indicating a likelihood that the unlabeled message is spam.
-
公开(公告)号:US09398034B2
公开(公告)日:2016-07-19
申请号:US14135247
申请日:2013-12-19
Applicant: Microsoft Technology Licensing, LLC
Inventor: Royi Ronen , Shay Kels , Elad Ziklik , Efim Hudis , Corina Feuerstein , Tomer Brand
CPC classification number: H04L63/1416 , G06F21/56
Abstract: Disclosed herein is a system and method for automatically identifying potential malware files or benign files in files that are not known to be malware. Vector distances for select features of the files are compared to vectors both known malware files and benign files. Based on the distance measures a malware score is obtained for the unknown file. If the malware score exceeds a threshold a researcher may be notified of the potential malware, or the file may be automatically classified as malware if the score is significantly high.
Abstract translation: 本文公开了一种用于自动识别不知道是恶意软件的文件中的潜在恶意软件文件或良性文件的系统和方法。 将文件的特征的矢量距离与已知恶意软件文件和良性文件的向量进行比较。 根据距离测量,为未知文件获取恶意软件得分。 如果恶意软件得分超过阈值,可能会向研究人员通知潜在的恶意软件,否则如果分数显着较高,则文件可能会自动分类为恶意软件。
-
公开(公告)号:US12165631B2
公开(公告)日:2024-12-10
申请号:US17735663
申请日:2022-05-03
Applicant: Microsoft Technology Licensing, LLC
Inventor: Abedelkader Asi , Royi Ronen , Roy Eisenstadt , Dean Geckt
Abstract: A method of generating keyword-based dialogue summaries is provided. The method includes inputting a transcript of an audio conversation and a keyword into a machine learning model trained based on encodings representing the keyword and the transcript, generating computer-generated text different from and semantically descriptive of the transcript and semantically associated with the keyword, and outputting the computer-generated text in association with a selectable item selectable for inclusion of the computer-generated text in displayed text representing the transcript, the selectable item associated with the keyword.
-
公开(公告)号:US11768961B2
公开(公告)日:2023-09-26
申请号:US17513158
申请日:2021-10-28
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yun-Cheng Ju , Ashwarya Poddar , Royi Ronen , Oron Nir , Ami Turgman , Andreas Stolcke , Edan Hauon
IPC: G06F21/62 , G06F40/295 , G10L15/26 , G10L17/00 , G10L15/22
CPC classification number: G06F21/6254 , G06F40/295 , G10L15/26 , G10L17/00 , G10L2015/228
Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.
-
35.
公开(公告)号:US11630958B2
公开(公告)日:2023-04-18
申请号:US17336881
申请日:2021-06-02
Applicant: Microsoft Technology Licensing, LLC
Inventor: Royi Ronen , Yarin Kuper , Tomer Rosenthal , Abedelkader Asi , Erez Altus , Rona Shaanan
IPC: G06F40/30 , G06F40/166 , G06F40/117 , G06F40/284 , G06N20/00 , G10L15/26 , G10L15/22 , G06F16/34 , G06F40/279
Abstract: The disclosure herein describes determining topics of communication transcripts using trained summarization models. A first communication transcript associated with a first communication is obtained and divided into a first set of communication segments. A first set of topic descriptions is generated based on the first set of communication segments by analyzing each communication segment of the first set of communication segments with a generative language model. A summarization model is trained using the first set of communication segments and associated first set of topic descriptions as training data. The trained summarization model is then applied to a second communication transcript and, based on applying the trained summarization model to the second communication transcript, a second set of topic descriptions of the second communication transcript is generated. By training the summarization model based on output of the generative language model, it enables efficient, accurate generation of topic descriptions from communication transcripts.
-
公开(公告)号:US11182504B2
公开(公告)日:2021-11-23
申请号:US16397738
申请日:2019-04-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yun-Cheng Ju , Ashwarya Poddar , Royi Ronen , Oron Nir , Ami Turgman , Andreas Stolcke , Edan Hauon
IPC: G06F21/62 , G06F40/295 , G10L15/26 , G10L17/00 , G10L15/22
Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.
-
公开(公告)号:US11126651B2
公开(公告)日:2021-09-21
申请号:US16229040
申请日:2018-12-21
Applicant: Microsoft Technology Licensing, LLC
Inventor: Neta Haiby-Weiss , Amir Pinchas , Hanan Lavy , Yitzhak Tzahi Weisfeld , Yair Snir , Royi Ronen
IPC: G06F16/53 , G06F16/50 , G06Q50/00 , G06F16/9535 , G06F16/901
Abstract: Data from social networking applications and other applications that can be used to communicate are combined for a user to generate a graph of the various relationships that the user has with other users in the social networking applications and other applications. In addition, the behaviors of each user with respect to communicating through the various social networking applications and other applications are monitored to generate task data that describes user preferences for communicating using each social networking application or other application for different tasks. At a later time, when a user is looking to connect with another user for an indicated task such as networking, the graph can be used to recommend paths to other users in the various social networking applications and other applications, and the generated task data can be used to rank the recommended paths based on the indicated task.
-
公开(公告)号:US11062706B2
公开(公告)日:2021-07-13
申请号:US16397745
申请日:2019-04-29
Applicant: Microsoft Technology Licensing, LLC
Inventor: Yun-Cheng Ju , Ashwarya Poddar , Royi Ronen , Oron Nir , Ami Turgman , Andreas Stolcke , Edan Hauon
IPC: G10L15/22 , G10L15/26 , G10L21/028 , G10L17/00
Abstract: Methods for speaker role determination and scrubbing identifying information are performed by systems and devices. In speaker role determination, data from an audio or text file is divided into respective portions related to speaking parties. Characteristics classifying the portions of the data for speaking party roles are identified in the portions to generate data sets from the portions corresponding to the speaking party roles and to assign speaking party roles for the data sets. For scrubbing identifying information in data, audio data for speaking parties is processed using speech recognition to generate a text-based representation. Text associated with identifying information is determined based on a set of key words/phrases, and a portion of the text-based representation that includes a part of the text is identified. A segment of audio data that corresponds to the identified portion is replaced with different audio data, and the portion is replaced with different text.
-
公开(公告)号:US10594711B2
公开(公告)日:2020-03-17
申请号:US15362076
申请日:2016-11-28
Applicant: Microsoft Technology Licensing, LLC.
Inventor: Roy Levin , Royi Ronen
Abstract: A method and device for detecting botnets in a cloud-computing infrastructure are provided. The method includes gathering data feeds over a predefined detection time window to produce a detection dataset, wherein the detection dataset includes at least security events and a first set of bot-labels related to the activity of each of at least one virtual machine in the cloud-computing infrastructure during the detection time window; generating, using the detection dataset, a features vector for each of a plurality of virtual machines in the cloud-computing infrastructure, wherein the features vector is based on idiosyncratic (iSync) scores related to botnet activity; transmitting each generated features vector to a supervised machine learning decision model to generate a label indicating if each of the plurality of virtual machines is a bot based on the respective features vector; and determining each virtual machine labeled as a bot as being part of a botnet.
-
公开(公告)号:US10534925B2
公开(公告)日:2020-01-14
申请号:US15286558
申请日:2016-10-05
Applicant: Microsoft Technology Licensing, LLC
Inventor: Moshe Israel , Royi Ronen , Daniel Alon , Tomer Teller , Hanan Shteingart
Abstract: Controlling device security includes obtaining a set of device activity data indicating current device activity on a device and a set of user activity data indicating a current activity state of one or more legitimate users of the device. It is determined whether the indicated current activity state of the users indicates that a legitimate user is in an active state on the device, or that none of the legitimate users is in an active state on the device. A statistical fit of the indicated current device activity on the device, with the indicated current activity state of the one or more legitimate users, is determined, by a comparison with at least one of the models that are generated via supervised learning. A security alert action may be initiated, based on a result of the determination of the statistical fit indicating a compromised state of the device.
-
-
-
-
-
-
-
-
-