U
U
Username2020-06-12 16:44:57
Python
Username, 2020-06-12 16:44:57

What section of statistics solves the problem of identification?

Good afternoon, I am engaged in the task of identifying a person as a thesis.

Task: to identify a user (anonymous, authorized) on the site
Introductory: I collect information about the user, such as: geo position, OS version, platform, GMT, hash sum of the canvas area, login if the user is authorized, etag, facebook id
Accordingly, some parameters can change, some parameters tell us exactly what kind of user came to us with a 100% guarantee.
Question: I have a database of already authenticated users with authentication parameters. How can I figure out whether a new user has entered the site, whom we have never identified yet, or a user who visited the site?

What area of ​​statistics will help me solve the problem and tell me which libraries to use in Python?

5ee386a14784c166623166.png

Answer the question

In order to leave comments, you need to log in

2 answer(s)
D
dmshar, 2020-06-12
@dalv_happy

Of course, statistics do not deal with this. The solution to your problem is a typical classification (diagnosing) problem from the field of Machine Learning. And there "according to metsu" - it is necessary to select the most adequate approach - the Nearest Neighbors Method, Decision Trees, neural networks, etc. - as long as there is enough strength, resources and enthusiasm.
Python libraries - Sklearn, Scipy, first of all. Get into the neural network - Tensorflow, Keras, Pytorch

D
Dimonchik, 2020-06-12
@dimonchik2013

simple (and primitive): fingerprint + bloom filter

Didn't find what you were looking for?

Ask your question

Ask a Question

731 491 924 answers to any question