added emails from throwawaymail.com #437

TravisLittlechilds · 2023-12-20T18:25:40Z

haumacher · 2024-02-19T18:17:08Z

Hi @TravisLittlechilds, I'm wondering how you found all these domains, since the site throwawamail.com does not allow the user to select a domain from a list and has strong protection against automatic queries. However, when I visited the site, I got an e-mail from domain "mybx.site", which is not in your list...

TravisLittlechilds · 2024-02-19T18:55:15Z

@haumacher it was an annoying process over a few weeks consisting of many sessions with incognito windows. I didn't detect any kind of pattern in the domains it handed out, so I'm not surprised I didn't manage to find them all.

haumacher · 2024-02-19T19:16:23Z

@TravisLittlechilds I'm still wondering how to (a) automatically decide whether a PR to this list is valid and (b) how a good heuristics would look like that decides whether a new e-mail domain seen in some user registration form is disposable or not (given this list of "well-known" disposable domains).

There are several services that claim to decide about e-mail domains to be disposable or not. On of them (https://check-mail.org/) seems to take the DNS MX record into account that is associated with the domain.

This "check" applied to the domains you entered gives the following result:

No MX record:

em4.catchservers.com
mx4.catchservers.net

Mail server 164.90.194.37

jual.me
seoph.website

All other domains have the mail servers 165.22.201.68 and 137.184.154.224

Maybe the maintainers of this list could give some more insights?

martenson · 2024-02-20T21:42:58Z

to my knowledge a valid MX record is not needed for receiving mail

https://datatracker.ietf.org/doc/html/rfc5321#section-5

we've had some discussions about this topic at #84 and #58

haumacher · 2024-02-20T22:03:48Z

OK, not having a MX record means the domain/host is its own mail server. So not having a MX record finally provides no clue, whether the domain could be a disposable e-mail domain or not.

But having a MX record resolving to the same IP address as some other "well-known" disposable domain provides some evidence that this domain is also a disposable one, right?

haumacher · 2024-02-20T22:08:45Z

Or the other way around, if a domain has no MX record, but its resolved IP address points to some mail server used by some other "well-known" disposable domain also provides some evidence that this domain is disposable, too.

In the example above, em4.catchservers.com points to 137.184.154.224, which is the same as one of the mail servers of all other domains. mx4.catchservers.net points to 165.22.201.68, which is the other mail server of all other domains entered in this PR.

martenson · 2024-02-23T23:59:46Z

@haumacher your reasoning seems plausible to me. However I do not have deep knowledge of the intricacies that come with mailing systems.

Are you proposing of making something like disposable_ip_blocklist?

haumacher · 2024-02-25T12:34:28Z

@martenson No, I don't think a block-list of "disposable" IP addresses would be a good solution, because this data is too volatile. However, a dynamically built classification of mail server IP addresses could be helpful to decide, whether a newly discovered e-mail domain is suspicious to be disposable.

Look at the domains added in this PR. Manually, nobody is able to verify, that those really belong to the fake-mail service mentioned in the PR. The service does not offer a list of domains a user can select from and repeatedly querying the service (from the same IP?) requires solving strong CAPTCHAs to get another e-mail address. Therefore, even automating the lookup process as proposed in #450 seams not to be a feasible solution.

What I'm thinking about is a database with "well-known" fake-mail services and "well-known" fake mail domains associated with those services. The question is how to classify a new e-mail domain based on the MX records and IP addresses of associated mail servers. If a reasonable heuristics can be found, it would be sufficient to manually manage a list of well-known fake-mail providers (web sites) and some examples of their fake-mail-domains (as training data), which can easily be collected manually. New domains could then be classified automatically...

martenson · 2024-02-26T19:59:16Z

@haumacher Sounds like a honking good idea to me. Would you care to outline this approach in a new issue?

haumacher · 2024-03-03T11:11:34Z

@martenson I opened issue #456 with initial thoughts for such heuristics.

added emails from throwawaymail.com

e239855

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added emails from throwawaymail.com #437

added emails from throwawaymail.com #437

TravisLittlechilds commented Dec 20, 2023

haumacher commented Feb 19, 2024

TravisLittlechilds commented Feb 19, 2024

haumacher commented Feb 19, 2024

martenson commented Feb 20, 2024

haumacher commented Feb 20, 2024

haumacher commented Feb 20, 2024

martenson commented Feb 23, 2024

haumacher commented Feb 25, 2024

martenson commented Feb 26, 2024

haumacher commented Mar 3, 2024

added emails from throwawaymail.com #437

Are you sure you want to change the base?

added emails from throwawaymail.com #437

Conversation

TravisLittlechilds commented Dec 20, 2023

haumacher commented Feb 19, 2024

TravisLittlechilds commented Feb 19, 2024

haumacher commented Feb 19, 2024

martenson commented Feb 20, 2024

haumacher commented Feb 20, 2024

haumacher commented Feb 20, 2024

martenson commented Feb 23, 2024

haumacher commented Feb 25, 2024

martenson commented Feb 26, 2024

haumacher commented Mar 3, 2024