Tell HN: YC companies scrape GitHub activity, send spam emails to users

474 points | by miki123211 9 hours ago

43 comments

  • martinwoodward 6 hours ago
    Martin from GitHub here. This type of behaviour is explicitly against the GitHub terms of service, when we catch the accounts doing this we can (and do) take action against those accounts including banning the accounts. It's a game of whack-a-mole for sure, and it's not just start-ups that take part in this sketchy behaviour to be honest. I've been plenty of examples in my time across the board.

    The fundamental nature of Git makes this pretty easy for folks to scrape data from open source repositories. It's against our terms of service and those folks might want to talk with some lawyers about doing it - but as every Git commit contains your name and email address in the commit data it's not technically difficult even if it is unethical.

    From the early days we've added features to help users anonymise their email addresses for commits posted to GitHub. Basically, you configure your local Git client to use your 'no-reply' email address in commits and that still links back to your GitHub account when you push: https://docs.github.com/en/account-and-profile/reference/ema...

    I think that's still probably the best route. We want to keep open source data as open as possible, so I don't think locking down API's etc is the right route. We do throttle API requests and scraping traffic, but then again there have been plenty of posts here over the years from people annoyed at hitting those limits so it's definitely a balancing act. Love to know what folks here think though.

    • retlehs 3 hours ago
      I’ve made over five reports for this exact spam scenario, and never once have y’all acted on them. I have a hard time believing you ban spam accounts that clearly violate your ToS.

      I even wrote about a specific example of a YC company spamming me from my GitHub email at https://benword.com/dont-tolerate-unsolicited-spam

      • eli 3 hours ago
        How would you know whether the account that did the scraping was banned?
        • retlehs 3 hours ago
          By visiting the account and noticing that it still has activity long after the report.
          • hedora 48 minutes ago
            How do you propose GH take action without risking taking down legitimate projects due to brigades of false reports?
            • shimman 33 minutes ago
              That they use some of their trillion dollar marketshare to solve it, why are you acting like this is a hard problem? It's not. They're just too cheap and greedy to do anything about it.
              • cortesoft 13 minutes ago
                Trillion dollar marketshare? How big do you think GitHub is?
                • mardef 5 minutes ago
                  GitHub is wholly owned by Microsoft, which has a 3 trillion market cap
    • koito17 5 hours ago
      I don't have any specific suggestions, but I do want to give thanks for implementing functionality to block pushes if the email field is *not* using an anonymized mail address.

      It's one thing to offer anonymous e-mail addresses, but it's also awesome that GitHub can help prevent mistakes that would otherwise leak a user's e-mail address. I am not sure how many people try to be privacy conscious on GitHub, but I assume most users don't, so it's nice seeing this little feature exist.

    • blobbers 49 minutes ago
      Scrape once, spam forever.

      I think it's pretty clear you need to use an anonymization scheme in the way commits are handled so that it links back to your github account and the email addresses are kept private.

      Privacy centric companies like Apple do this for users offering hashed emails, on a per login basis.

      I'm sure this would not work in a world of scraping, but having that kind of ability to figure out bad actors would be nice. You could require authenticated users for certain kinds of requests, and block user information from non-authenticated requests.

    • ayhanfuat 6 hours ago
      I am also getting constant spam because apparently they can see who starred a repo (i.e. I see you starred repo x and we are doing something similar). I am not starring anything anymore.
    • skwashd 4 hours ago
      I know it is against the ToS. I've reported multiple organisations doing this. Last time I reported one, support closed the ticket saying the activity is off platform so they can't do anything.
    • danesparza 4 hours ago
      I didn't realize this was against the Github TOS - I just thought it was par for the course for recruiters nowadays. This is good to know!

      How do I report that person, though? Your support page about reporting abuse assumes I know the person's Github account: https://docs.github.com/en/communities/maintaining-your-safe...

    • dent9 59 minutes ago
      Amazon did this to me. Their recruiters started hounding me at an email address that I only ever used to sign git commits on some repos used on GitHub. When I asked them how they got my email address they said "it was in [our] database"
    • AznHisoka 6 hours ago
      Maybe I am missing something, but can’t you simply not show the email address in a git commit? (Sincere question, not saying this is trivial. i am dumb and like to ask dumb questions even if might be embarassing)

      If someone wants to message someone, it goes through github notifications or github emails them

      Also banning an account doesnt seem like a heavy punishment, given they can simply move to gitlab, bitbucket etc

      • EdNutting 6 hours ago
        That would be a fundamental change to how Git works, not just GitHub. Even if the web UI didn't show it, a simple `git log` would reveal it.

        You can mask your email address in git commits but a lot of open source projects won't accept that. And some pseudo-open-source ones insist on sending you an email to authenticate before they'll give you access to the GitHub repo (looking at you Unreal Engine!)

        So, no, I don't think they could simply "not show the email address".

        • sheept 2 hours ago
          fyi, you can also see the author email by appending ".patch" to the end of a commit URL
        • AznHisoka 6 hours ago
          Makes sens! Appreciate the explanation!
      • easton 6 hours ago
        Git commits have a email address as a required field[0], although some people put something bogus in there. And then it's in the data provided when you clone the repo onto your machine even if you aren't using the GitHub APIs.

        To his point, you can set that to the no-reply email address GitHub gives you if you don't want mail but do want the commit to be linked to your GitHub account.

        [0]: https://git-scm.com/docs/git-commit#_commit_information

      • miki123211 53 minutes ago
        Git commits are identified by a hash of their entire contents[1]. The way hashes work, if you change even one bit, the hash becomes completely different. Every commit contains the email address of the committer and the hash of the parent commit. If the email address in even one commit is changed or removed, that changes its hash, which in turn requires you to update its children, changing their hashes etc. So, updating a commit from n years ago requires you to update all commits that have been made since. By default, git will refuse to pull from such an updated repository, as commits are considered immutable once pushed.

        [1] In practice, it's a bit more complicated. Merkle trees are involved, so it's hashes of hashes of hashes instead of hashing a multi-gigabyte blob on each commit, but that's a performance optimization that doesn't affect semantics much.

      • dent9 57 minutes ago
        You should be using the email address "username@no.reply.github.com" or similar

        There's never been an obligation to use a real email address for git

    • miki123211 58 minutes ago
      I've raised this as ticket ID 4114793, just in case.
    • TheSaifurRahman 4 hours ago
      Are no-reply emails associated with the accounts if the username is changed? That's one reason why I switched back to my personal email.
    • ericol 5 hours ago
      I've had more than a few instances of this over the past 2 years, and my reply is exactly the above.

      "What you are doing is against Github's TOS"

    • blibble 18 minutes ago
      > it's not technically difficult even if it is unethical.

      kettle, pot, black?

      I received the following offical spam last week from GitHub:

      > Build AI agents with the new GitHub Copilot SDK

      despite never granting consent for marketing material

      (and yes, there's a GDPR complaint now working its way through the national regulator)

    • trympet 4 hours ago
      Nice, thank you Martin. How do you punish the fraudsters? Do you send them to prison over CFAA violation terms of service?
      • martinwoodward 3 hours ago
        I kinda wish I had that much power. There would certainly be less people in the world listening to their phones without headphones..

        Usually starts with contacting them over email reminding them of the terms of service and warning them to stop. Then their account might get deactivated and they need to write and promise to not be naughty again. If they ignore that then the account gets removed.

        There are a bunch of automated checks that are running all the time as well and will take automated action that then gets later reviewed by humans. At lot of times the process is fast-tracked.

        The off-platform 'let's scrape a bunch of data and then spam nice people' is the hardest to police. Linking those mails to an offending GitHub account is hard and very manual, also anyone can send emails saying they are someone they are not and because of that anyone can deny they sent the mail and they'll usually blame a rogue agency they where working with etc.

        I probably shouldn't say it, but the public shame that comes from being mentioned on social, in hacker news etc. That stops people who want to be treated as legitimate from doing that sort of thing and helps educate the wider community around what is and isn't acceptable behaviour - that is why it's good to see this thread and see the issue getting attention.

        • trympet 1 hour ago
          Love the transparency - someone should make you VP of ..uhm dev rel or something! I was being quite hyperbolic in my original comment, however, I _do_ think you are doing the right thing, and you are definitely not the bad guy.

          Having said that, there are big corps who have been known to use the CFAA as a way to coerce the long arm of the law upon teenagers and geeks hacking away - not always a great thing either IMO.

      • nerdsniper 3 hours ago
        > CFAA violation terms of service

        This would be a gross miscarriage of justice and bringing successful action under this theory would do widespread harm by expanding the definition of the CFAA.

        Just because a company can take some nuclear action, doesn't mean they should.

      • skeptic_ai 4 hours ago
        Will send a strong email: Don’t do bad things.
    • moomoo11 2 hours ago
      Ban them. Honestly I get the same and it is beyond frustrating.

      I will pay more for GitHub if you go hard on these mfs.

    • observationist 2 hours ago
      Hey, Martin - https://github.com/lucidrains

      Mind fixing lucidrains account? Something happened without notice or recourse. He's one of, if not the most well known open source AI researchers on the planet, with implementations and explanations of papers and ideas that are wonderful. If you could bring some sanity to that situation and take it out of whatever kafkaesque account purgatory it fell into, you'd be doing the work of angels.

      Thanks!

      • davnn 2 hours ago
        What was happening with this account? I was often seeing popular but empty (only title of the paper and maybe a short readme) repositories that were created directly after a paper was published?
        • observationist 1 hour ago
          Just part of the process - he'd queue up the projects as interesting things came in, then plow through. Usually he'd have a rough framework within a day or two, and then a working proof of concept within a week, and then return to the most promising, useful, or interesting projects.
  • scottydelta 6 hours ago
    YC is a proud investor in Flock, what YC Ethics thing are you talking about?
    • wslh 1 hour ago
      And, Gecko Security.
    • cassonmars 6 hours ago
      And Cluely
      • tasn 6 hours ago
        Cluely is not YC.
        • fantasizr 1 hour ago
          he might be thinking of chadIDE "the first brainrot ide"
    • ls-a 6 hours ago
      [flagged]
      • shrubble 5 hours ago
        How would that even be legal? (Although I can't find such a startup with any kind of search engine)
        • akerl_ 5 hours ago
          Why would it be illegal?
          • john_strinlai 4 hours ago
            i am not sure of anywhere it is illegal.

            but areas i am familiar with can consider a negative reference to be defamation, thus anyone providing a negative reference should only do so if they are able to defend it (i.e. prove their statement is substantially true, or prove that the statement was honestly believed to be true and published with no malice or reckless disregard).

            seems risky, at least, to build a whole business around negative references that could potentially cross the line into defamation. but that type of thinking is probably why i am not rich.

            • nerdsniper 4 hours ago
              There are many definitions of illegal (criminal, civil, regulatory, the much much looser “license to operate” as used in chemical industry, etc).

              A blacklist seems dubious. I’d advise the founders to get counsel on their obligations under the FCRA, which they may be construed to be regulated by.

              That said, I believe "Bad News" is an AI hallucination. The most similar company I can find historical news is "Peeple"[0], which was not funded by YC. YCombinator's only known association with a blacklist that I can find was a blacklist of VC's that were accused of harassing female founders[1].

              0: https://archive.is/r9UQo

              1: https://archive.is/17Ans

              • john_strinlai 3 hours ago
                >There are many definitions of illegal (criminal, civil, regulatory, the much much looser “license to operate” as used in chemical industry, etc).

                yes, but i am not sure why this matters here. i am not aware of negative references, in general, being illegal under any of those definitions of illegal.

                no one would say regular speech is illegal just because it can be subject to a defamation lawsuit. same logic.

                but i agree, if it is a real business, it seems exceptionally risky.

                • nerdsniper 3 hours ago
                  https://www.law.cornell.edu/uscode/text/15/1681d

                  It's more than just "subject to a defamation lawsuit" (including class action lawsuits). Although for me, even if it were "just that", I'd still call it "potentially illegal". Rather, they'd potentially face FTC penalties and CFPB enforcement actions under 15 U.S.C. section 1681d(a), (b).

                  This law would likely classify such a company as falling under laws pertaining to "investigative consumer reports" under FCRA. This is any report on someone's "character, general reputation, personal characteristics, and mode of living" used for the purposes of employment, loans, housing, etc.

                  > A consumer reporting agency shall not prepare or furnish an investigative consumer report on a consumer that contains information that is adverse to the interest of the consumer and that is obtained through a personal interview with a neighbor, friend, or associate of the consumer or with another person with whom the consumer is acquainted or who has knowledge of such item of information, unless—

                  > (A) the agency has followed reasonable procedures to obtain confirmation of the information, from an additional source that has independent and direct knowledge of the information; or

                  > (B) the person interviewed is the best possible source of the information.

                  They'd find themselves subject to legal penalties under:

                  FCRA Willful Noncompliance (15 U.S. Code § 1681n) (if they did not disclose their existence/use/content of reports to employment candidates)

                  FCRA Negligent Noncompliance (15 U.S. Code § 1681o) (if they made somewhat reasonable but insufficient efforts to comply with the FCRA)

                  or

                  Administrative Enforcement (15 U.S. Code § 1681s)

                  and be subject to fines up to $4,700 per violation plus actual damages, plus punitive damages, plus legal fees. State Attorneys General can also bring FCRA lawsuits on behalf of their constituents, not just the federal government. FTC / CFPB can name the founders individually in the lawsuits, not just their corporate entity, and ban[1][2] them from operating any similar businesses in the future.

                  That all said, to some extent, YCombinator partners are on the record[3] supporting the idea of their startups sometimes doing illegal things. Generally they'll frame this as challenging outdated regulations, but they acknowledge that the founders whose strategies they fully support sometimes come into office hours and discuss how they're worried that the strategy puts them at risk of going to jail.

                  0: https://www.law.cornell.edu/uscode/text/15/1681d

                  1: FTC v MyLife.com, Inc., and Jeffrey Tinsley (CEO): https://www.ftc.gov/news-events/news/press-releases/2021/12/...

                  2: https://www.ftc.gov/legal-library/browse/cases-proceedings/b...

                  3: https://www.youtube.com/watch?v=Hm-ZIiwiN1o&t=8m46s

                  • john_strinlai 3 hours ago
                    ah, okay. so the hypothetical company may potentially be doing something illegal (the "investigative consumer report" part). good to know! that makes sense, and i was unaware of that.

                    i stand corrected in the hypothetical "bad reference aggregator company" scenario.

                    >YCombinator partners are on the record[3] supporting the idea of their startups sometimes doing illegal things.

                    interesting, thanks for surfacing that up! i wont pretend to be surprised, though.

            • drcongo 4 hours ago
              It's definitely illegal in the UK.
              • john_strinlai 4 hours ago
                i dont believe that it is illegal to provide a negative reference in the UK, as long as it is honest, factual, and provided in good faith.

                from gov.uk:

                >"If you think you’ve been given an unfair or misleading reference, you may be able to claim damages in court. Your previous employer must be able to back up the reference, such as by supplying examples of warning letters.

                You must be able to show that:

                - it’s misleading or inaccurate

                -you ‘suffered a loss’ – for example, the withdrawal of a job offer"

                which means, if the reference is not misleading and not inaccurate, a negative reference is ok. other uk-based law firms (from a quick google) agree with this interpretation.

                • laserlight 3 hours ago
                  Providing a negative reference is totally different than gathering negative references and selling them. The former could be legal while the latter could be illegal.
                  • john_strinlai 3 hours ago
                    for sure!

                    in my comment, i was speaking more generally than i should have, and that (obviously, in hindsight) caused some confusion between the specific case of the hypothetical company, and the general case of an employer providing a negative reference. my bad -- and it is too late to edit to provide clarification.

                    • drcongo 2 hours ago
                      No problem, I wasn't very clear either! I remember someone I know looking into this in the early 2000s as part of a wider collective thing. It's long enough ago that I can't remember the details but it was definitely less about a poor reference and more about the individuals' being on a list somewhere without having even applied for a job. And come to think of it, it's probably even more illegal now because of GDPR.
      • k33n 5 hours ago
        I can't find any website for it. Are you sure it's not just some posting category on Bookface, YC's internal social network?
        • vunderba 3 hours ago
          Same. While it doesn't help that their name is about as generic as it gets, I searched across Kagi, Google, etc. and couldn't find any such YC company.

          That being said, it wouldn't entirely surprise me if somebody's tried to start the tech equivalent of the casino "Black Book".

          https://en.wikipedia.org/wiki/Black_Book_(gambling)

        • ls-a 5 hours ago
          [flagged]
          • tptacek 4 hours ago
            GPT:

            There are some mentions online of a Y Combinator startup called Bad News, but nothing official or well-documented shows up in public YC lists or press — at least as of the latest searchable sources.

            The only place it’s referenced is in a Hacker News thread where someone claimed there was a YC company whose product was a blacklist of employees so other startups wouldn’t hire them, and they said the name was Bad News. But people in that thread couldn’t find any evidence of it, and there aren’t real search results tying that name to an official YC company on YC’s site, their startup directory, or mainstream reports.

          • BigTTYGothGF 4 hours ago
            > According to chatgpt

            Oh come on.

            • ls-a 4 hours ago
              [flagged]
              • weird-eye-issue 3 hours ago
                Ask it about what it might have hallucinated to help it hallucinate more?
          • hypeatei 3 hours ago
            Why are you obfuscating so much and telling people to use ChatGPT? How hard would it be to paste what they renamed to and/or the founders' names?
            • ls-a 3 hours ago
              [flagged]
  • keiferski 6 hours ago
    I've spent a lot of my career marketing to developers, and spamming their GitHub account might be top 1 or 2 worst marketing tactics you can use.

    Cold emailing rarely works by itself. Cold emailing developers via emails you pulled from their GitHub accounts? At that point, you're actively harming your brand, and may as well just send them spam diet pill ads.

    • RandallBrown 1 minute ago
      If someone took the time to look through my GitHub contributions then pitched me with a job relevant to that work I would absolutely consider them. That's exactly the kind of recruiter I would like to work with.

      If it's obviously just a bot scraping emails and sending generic job requests, that's very different.

  • elwebmaster 1 hour ago
    Just got a SPAM email from a Github scraper while reading this thread:

    From: james@techglobal.website Quick note – your GitHub profile Hi X,

    I came across your profile on GitHub. Given you're based in the US, I thought it might be relevant to reach out.

    Profile:

    I run a technical team (full-stack, cloud, DevOps) that delivers for clients. We're looking to work with an engineer based in the US on client-facing coordination—discovery, requirements, alignment—while we handle delivery. If that might be relevant, I'd be glad to set up a short call.

    Regards, James

    If I had to guess, "James" is a North Korean looking to scam US clients, based on my experience with shady actors.

    • max__dev 1 hour ago
      Checked my spam after seeing this thread and found the same sender/email. Subject and signature are slightly changed.

      From: james@techglobal.website Brief note – Following up on your GitHub work

      Hi ,

      I came across your profile on GitHub. Given you're based in the US, I thought it might be relevant to reach out.

      Profile:

      I run a technical team (full-stack, cloud, DevOps) that delivers for clients. We're looking to work with an engineer based in the US on client-facing coordination—discovery, requirements, alignment—while we handle delivery. If that might be relevant, I'd be glad to set up a short call.

      Best, James

  • unfunco 4 hours ago
    I also had unsolicited spam from Vincent Jiang of Aden, another YC company.

        Hi Daniel,
    
        I just came across your profile on social media and wondered if you'd be interested in joining our Discord community for AI agent development. Currently, we see that agents break, loop, get lost, hallucinate, and cost a fortune, and therefore built a space where developers can share challenges and insights.
    • unfunco 2 hours ago
      …and more from Backdrop.

          Hi Daniel, I found your GitHub profile while searching for anthropic projects, and got your email from your profile.
      
          I'm part of an online program for builders called Backdrop Build, and I think that program would be a great fit given what you are building. We have a track for builders in AI like you, it's fully online/remote and costs nothing to participate. It also works if you have a day job, it's light on time and perfect for side projects!
      
      And then another after I marked the first one as spam and ignored it.

          Checking in one last time to see if you have any questions about the program or the application. If it's not for you, all good - just ignore the email because I won't be pinging you again :)
      
         Joey from Backdrop
      
      Both companies have guaranteed that I won't use their services nor procure them for any organisation I work for.
      • agmater 1 hour ago
        Hey it's Joey checking in again. We noticed you mentioned our company, let me know if you have any questions about our (free!) program. I'll go ahead and email you some more info, just in case.
    • foldr 3 hours ago
      I had a similar one from that guy asking me to make open source PRs to some repo of theirs for, err, $25-50/hour. I replied explaining that senior software engineers in the UK aren’t quite as desperately poor as that, and got a canned response saying that they were looking forward to reviewing my PRs :D
  • cyann 4 hours ago
    Got this spam today on my GitHub address, YC affiliated:

    From: henry@joincactuscompute.com

    Hey,

    I hope all is well with you, just reaching out as you seem to be interested in on-device speech models.

    Cactus is a low-latency AI engine for consumer devices like phones, Macs, wearables, Raspberry Pis, etc.

    We support transcription models like Whisper & Parakeet, benchmarks available in the attached GitHub repo.

    GitHub: https://github.com/cactus-compute/cactus

    We are keen to get your feedback, and star if feeling generous.

    Thanks a million

    • ignoramous 2 hours ago
      > star if feeling generous ... Thanks a million

      A 419 scam?

  • an0malous 17 minutes ago
    Ever wonder why YC has the "Describe a time you most successfully hacked some system to your advantage" question? It's because they select for founders that are willing to take advantage of legal gray areas. Airbnb repeatedly violated Craigslist terms of service and called it "growth hacking." Reddit stole content from Digg and faked users. OpenAI trains their models on copyrighted content.
  • armchairhacker 8 hours ago
    I remember this being discussed a while ago

    https://news.ycombinator.com/item?id=9332418 (11 years ago)

    https://news.ycombinator.com/item?id=20660624 (7 years ago)

    https://news.ycombinator.com/item?id=27855152 (5 years ago)

    https://news.ycombinator.com/item?id=30900237 (4 years ago)

    Seems it’s a reoccurring issue

  • pmdr 14 minutes ago
    People here assume that YC is some kind of ethics benchmark for business. It's not.
  • callamdelaney 3 hours ago
    YC is basically advising their startups to engage in shitty business practices, like trying to hire UK staff for half the salary and expecting 7 day weeks.
  • coffeecoders 1 hour ago
    For me, its those Who's hiring or Who wants to get hired posts. I used a throwaway email once and got emails about SEO and AI projects.

    I don’t engage. I mark as spam, block the sender/domain, and move on.

  • kristoff_it 7 hours ago
    I have received over the years so much spam of this kind by multiple YC-funded companies that I now reflexively send to spam any email that mentions being YC-funded, regardless of how legitimate the email is.
    • AznHisoka 6 hours ago
      Same here, having YC attached to your name is not the flex you think it is, its even the opposite for me
      • ryandrake 1 hour ago
        Their brand has been associated with hacking-around and gaining advantage via rule breaking for a while. Didn't their founder application at one point ask "Tell us about a time where you hacked some system for your advantage?" At this point, I think everyone knows they're signing up for dark patterns and questionable practices when they get involved.
        • nerdsniper 1 hour ago
          It still does.

          > Please tell us about a time you most successfully hacked some (non-computer) system to your advantage:

          I suspect it can be an excellent barometer of someone's:

          - alignment in terms of pro-social vs. anti-social

          - decision making under desperation

          - "social filter": threading the line between 'interesting'/'compelling' vs. 'off-putting'/'concerning'

          which are important signals for evaluating potential future C-suite executives.

        • Goronmon 1 hour ago
          Their brand has been associated with hacking-around and gaining advantage via rule breaking for a while.

          Yup, this type of behavior is pretty much as I would expect and it's something I've seen since I first started posting here.

    • neya 7 hours ago
      I don't blame you, the FOMO is real to the point even basic ChatGPT wrappers are getting funded these days, I guess.
      • jvwww 5 hours ago
        I'm always interested to understand - what constitutes a basic ChatGPT wrapper? Is Legora, which is doing very well, a basic ChatGPT wrapper? Because if you don't view it as one, it certainly started as one.
  • neya 8 hours ago
    This is atleast fine as it's just spam, I got pulled into an actual scam and it never made it to the frontpage.

    https://news.ycombinator.com/item?id=45357205

    • medi8r 6 hours ago
      But that is someone pretending to be YC which is sort of less interesting than a YC company doing something bad. Because phishers imitate legit companies all the time. Easy to get roped in and I sympathise, anyone is suseptable (today I almost clicked the phishing training email as it looked urgent and pushed the right buttons)
    • ChrisMarshallNY 8 hours ago
      Looks like GH nuked it, though.

      Hope they didn’t get too many folks.

    • nubinetwork 7 hours ago
      That's a little creepier than the time I got an email from someone trying to push a new crypto coin to me because I contributed to OSS.
  • dewey 7 hours ago
    This happens all the time, not really surprised as the GitHub API makes it pretty easy to extract valuable leads with real and confirmed email addresses.
    • progbits 2 hours ago
      I don't like this way of putting it, it's good the github API makes this easy as that makes it an useful. Should not try to imply this should be restricted just because of some bad actors. It's just going to annoy legit users and the bad ones will scrape anyway.
      • dewey 2 hours ago
        I'm just stating a fact, not implying anything. It's the good old saying with the sharp knife, it can be used for good and bad.
        • progbits 1 hour ago
          Ok sorry I guess I read too much into it.
    • tommoor 7 hours ago
      Yea, been going on at least a decade
  • c16 7 hours ago
    Email address privacy is a feature offered by Github and replaces your day to day email: https://docs.github.com/en/account-and-profile/how-tos/email...
  • WhatsName 7 hours ago
    Doesn't YC have some code of conduct or legal/ethical guidelines? I would assume a legal and compliance department would have some major headache if documented cases of misconduct jeopardize later due diligence. I would not fund or aquire a company on the radar of national regulatory bodies for something as stupid as this.
    • Goronmon 1 hour ago
      Doesn't YC have some code of conduct or legal/ethical guidelines?

      Regardless of any claims of having this, I would say this behavior aligns with what I have seen over the last couple decades. I'm more surprised that other people would expect anything different?

    • stevekemp 1 hour ago
      It's not "spam", it's a "growth hack".
    • mbesto 2 hours ago
      > Doesn't YC have some code of conduct or legal/ethical guidelines?

      Sorry but lol you must be new here.

    • whalesalad 3 hours ago
      When you are a team of 3 people eating ramen there is no legal or ethical compliance department.
    • buellerbueller 5 hours ago
      Imagine thinking in 2026 that an American tech company has ethics.
      • haute_cuisine 4 hours ago
        Only free individual can have strong ethics. There are no free people in capitalism, money is debt after all. Think of applied pressure once you sign under VC money and amount of brainwashing / gaslighting. I sincerely hope my observation is wrong.
        • buellerbueller 4 hours ago
          If you are going to go down that road: life is debt, and there is no true freedom. We are bound by the needs of our meat-containers, after all.

          I don't like unfettered capitalism, but when I consider economies that have existed over time, it certainly looks like constrained capitalism affords the most freedom.

    • thinkingtoilet 5 hours ago
      Like every other VC firm, the only thing they care about is money. They can pretend to morals, but they will never sacrifice one for the other in any meaningful way.
  • ttul 3 hours ago
    Didn't AirBnB famously spam people in the Bay Area as a "guerilla tactic" to build the business in its early days? This kind of fast and loose behaviour seems standard.
  • oefrha 2 hours ago
    Yes, startups, recruiting platforms, and students/“researchers” with stupid surveys for their worthless “research” spam me all the time by scraping the email from GitHub. I immediately trash the first two categories; I send a sternly-worded reply to the third category.
  • b8 1 hour ago
    Boundaries don't exist really in tech and especially with emails. I just filter out spam and block a good bit. People just ignore stuff now a days even people saying hi passing someone in the street (which I stopped doing)? My colleges spam filter catches a lot of them. Your email is presumably already in data dumps.
  • scosman 4 hours ago
    I’m also getting “saw you on GitHub” spam from voice.ai

    And they are using a different domain for the emails so the spam markers don’t hit their primary domain.

  • 6thbit 2 hours ago
    I wish github could ammend the email of my commits to the private noreply address during push so they _never_ have any other email associated to them. May not be feasible due to the commit changing, confusing local branch and such?

    They have this other thing where they reject pushes for the 'known' emails you've told them you have, but kinda seems there should be a setting to do that for any email that is not your noreply private one. is that a feasible thing to ask for?

    • cperciva 2 hours ago
      If you change the email address, you change the commit hash. And yes, suddenly your local branches are orphaned.

      Of course, there's nothing stopping you from using a git-only email address (nospam-6thbit@yourdomain) and routing that to /dev/null. GitHub can't change email addresses, but you can.

    • arcfour 2 hours ago
      They literally have a setting to block pushes with any email other than the noreply one, lol.
  • theturtletalks 6 hours ago
    General advice would be to mark the email as spam or junk and hopefully their email platform penalizes them, but this has been working less and less. Email has truly become pay to play now.
    • suyash 6 hours ago
      That's exactly what I've been doing with solicitation emails, reporting as SPAM on gmail.
  • ttoinou 2 hours ago
    Couldn’t github replace all public commits author info email by a username@author.github.com email automagically ?
    • jonathanlydall 2 hours ago
      You can’t change anything about a commit without breaking the chain of SHA hashes in the commits, which causes pulls to break.

      GitHub hides the emails on their web UI, but nothing stops people from pulling the repository with a Git client and looking at the emails in the commit log after doing so.

      • VorpalWay 2 hours ago
        Which is why you should be careful to never use your actual email in git commits.

        When I made a patch to the Linux kernel I did have to use a real email, since you have to send to their mailing list. I used a throwaway email for it, which I have since edited on my mail server config to forward to /dev/null (yes, I'm one of the weirdos still self hosting email in 2026). The amount of spam I got was insane, and not even developer relevant spam.

        • miki123211 45 minutes ago
          This makes me wonder how the Linux kernel git system deals with GDPR data deletion requests. Are they even legally allowed to deny them?
    • arcfour 2 hours ago
      You have to configure your own Git client manually. But you can configure GitHub to block pushes from any email other than the no reply email GH generates for you.
  • EdNutting 6 hours ago
    My solution to this is to use a Github-specific email address. All emails sent to that address which do not originate from GitHub are immediately reported as spam, marked read and deleted.

    I sometimes use different git/GitHub addresses depending on who I'm working for or specific projects so I can more accurately detect where data is being scraped from.

    • EdNutting 6 hours ago
      N.B. Using service-specific emails is trivial - you don't need separate email accounts. Just use email aliases, e.g. "john.smith+github@gmail.com" -- which is an alias called "github" for "john.smith@gmail.com"
      • input_sh 6 hours ago
        A simple regex filter will get rid of that. Now, if you use your own domain and have it configured as a catch-all, then you could do github@domain.tld.
        • EdNutting 3 hours ago
          I'm not saying I do this but if I were as smart as I think I am I would have given a Gmail example rather than the example you've given to avoid bots just looking up my website and starting to bypass my setup... ;) ;) ;)

          Also, spammers generally don't seem to be going to the effort to apply regex filters to the data they've scraped...

          • VorpalWay 2 hours ago
            I self host email, and I have never gotten spam to any email "constructed" from the domain, other than random attempts to things like "accounting@domain.tld" etc.

            But the email I used to interact with the Linux kernel mailing list I had to null route after a while, it got so much spam. I used a throwaway for just that purpose of course, so no big deal.

      • gus_massa 6 hours ago
        Don't spammers have an automatic filter to cleanup that?
        • EdNutting 5 hours ago
          You'd have thought so, but no, in my experience this works very well. People doing this kind of spamming don't seem to be particularly bright, nor do they seem to spend any time/effort to clean up their scraped database.
  • malmeloo 1 hour ago
    Oh I'm getting so tired of this. Lately there appears to have been an uptick in this kind of marketing spam too, there's so many companies trying to advertise their AI products this way. At least it's a good indicator of which companies I should avoid at all costs, and it provides me with an email address I can use to direct my angry emotions towards.

    They're getting more aggressive at it too. Just yesterday I received an email from Alignerr (not YC affiliated I think) saying that my sign-up was complete and cheerfully welcoming me to their platform. I had never even heard of them. An automated "job opportunity!" email didn't arrive until 3 hours later, but by then I had already directed some angry words towards their support email.

    Other, even less respectable projects are also regularly enrolling my GitHub projects into their platforms, and I have to actively reach out to them to remove it.

    I'm so tired of this man. Can someone go and take away these organizations' ability to send emails?

  • ChrisMarshallNY 8 hours ago
    I’m not especially bothered by this [yet -AI is likely to make this worse]. It’s a fairly insignificant component of my spam catcher. At least, it’s a bit focused.

    Every day, I get deluged with hundreds of spam and scam emails, often because some knucklehead entered my email in a form (either accidentally, or as a throwaway red herring).

    • Maxious 7 hours ago
      Sure but these YC spammers are identifiable and have much more to lose https://www.ycombinator.com/ethics/

      > Some examples of ethical behavior we expect from founders are:

      > - Not spamming members of the community

      > To maintain our community, if we determine (in our sole discretion) that a founder has behaved unethically during or after YC, we will revoke their YC founder status. This includes access to all Y Combinator spaces, software, lists and events. All founders in a company may be held responsible for the unethical actions of a single co-founder or a company employee, depending on the circumstances.

      • RobotToaster 7 hours ago
        Has this ever actually been enforced?
      • ChrisMarshallNY 6 hours ago
        > > - Not spamming members of the community

        Ah... but there's the rub.

        Define "the community."

        Do random GH accounts count as "members of the YC community"?

        Sorry, but unsolicited contact, much as I hates, HATESSSS it, is a classic component of any business, and has been, for many decades. I don't think it would be appropriate for a business organization to prohibit its members from engaging in "cold calling," of which, UCE is really an example.

        Using the YC branding/name, however, is a different matter.

  • lordgrenville 6 hours ago
    Maybe a dumb question, but isn't this trivially solved with this .gitconfig?

        [user]
             name = lordgrenville
             email = <some_kind_of_id>+lordgrenville@users.noreply.github.com
    • edelbitter 2 hours ago
      For commits you author.

      Kernel guidelines now have a more verbose section about tagging: https://www.kernel.org/doc/html/latest/process/submitting-pa...

    • darknavi 5 hours ago
      Sure, as long as you want to rewrite all of the history of all of your public repositories.
      • lordgrenville 5 hours ago
        Oh yeah, I have always had this as it was pretty clear to me that the info in the email field is public.
    • ktm5j 3 hours ago
      Perhaps, but it doesn't change the fact that this is bad behavior for the company sending the email. Since YCombinator funded this company it makes sense that YC would want to know about how they are conducting business.
    • haute_cuisine 4 hours ago
      Not all projects are hosted at github. You also might want to receve relevant mail from fellow developers.
      • lordgrenville 3 hours ago
        Fair point. Pretty sure there is a way to have a few .gitconfig files, with the active one based on the remote URL domain, but it is more work.
  • pscanf 8 hours ago
    I was also spammed (twice) by voice.ai.

    You mention GDPR, which also "applies" to me, though I wonder if what they're doing is actually illegal. I mean, after all, I'm putting my email on GitHub precisely to give people a way to contact me.

    Of course, I do that naïvely, assuming good faith, not expecting _companies_ to use it to spam me. So definitely what they're doing is, at the very least, in poor taste.

    • notpushkin 6 hours ago
      > I'm putting my email on GitHub precisely to give people a way to contact me.

      They’re not only looking at the public email in your profile, they’re also looking at your committer email (git config user.email). You could argue that you’re not putting that out for people to contact you.

      (I’ve used that trick a couple times to reach out to people, too, but never mass emailing.)

    • victorbjorklund 7 hours ago
      They spammed me as well.
    • zvqcMMV6Zcr 7 hours ago
      Is there any company that will take my money to solve GDPR issues? And by solve I mean sue the spammers? For last few years I saw they "try" to look legit, by claiming addresses are managed by some Hungarian/Spanish shell company, hoping no one will be able to afford pursuing infractions over borders.
      • miki123211 35 minutes ago
        This is hard, because private right of action in Europe is often very limited, and the damages are low.

        THe US basically has a "private police force" for certain laws, notably the ADA. Many people are against this, I personally think it's a great idea and something countries should be doing a lot more of of.

      • RobotToaster 7 hours ago
        There's probably a law against it, but I've always thought a legal company could make decent money taking cases like this in bulk for free, on the condition that they get to keep all the compensation, while the "client" still gets the satisfaction of punishing the offending party.
        • rationalist 5 hours ago
          On the U.S., only Attorneys General can go after violators of the CAN-SPAM Act.

          It needs to be modified like how individuals can go after telemarketers.

        • notpushkin 6 hours ago
          That’s pretty much class action lawsuits!
      • KomoD 7 hours ago
        > Is there any company that will take my money to solve GDPR issues? And by solve I mean sue the spammers?

        A lawyer

  • ting0 2 hours ago
    Change your email to something like: myemail+gh@mail.com (the "+gh" tag). You can put any tag/word there, and if you get spam from a company you'll be able to identify that it came from them scraping your GH. Then you can report it with certainty.
    • buellerbueller 2 hours ago
      you can also autofilter that tag to route to spam
  • bakugo 7 hours ago
    This sounded familiar, so I checked my inbox and I did indeed receive a similar email from sanchitmonga@runanywheresdk.com earlier this month:

    > I came across your GitHub profile and thought you might be interested in what my team and I are building. We're developing an open source SDK that runs LLMs directly on-device.

    What's even more interesting is that both buildrunanywhere.org and runanywheresdk.com show a stock hostinger parking page when accessed in a browser. Something tells me they're intentionally registering these "alternate" domains specifically for spam, to avoid tanking the email reputation of their main runanywhere.ai domain.

    I guess I shouldn't be surprised given YC is going all in on AI and most AI companies are no better than the crypto scammers of yesteryear, but still.

    • Imustaskforhelp 5 hours ago
      I observed the same thing and it was only when you told me the main domain that I found their website.

      > Something tells me they're intentionally registering these "alternate" domains specifically for spam, to avoid tanking the email reputation of their main runanywhere.ai domain

      This is a really bad look on them.

      https://www.whatsmydns.net/domain-age?q=buildrunanywhere.org and https://www.whatsmydns.net/domain-age?q=runanywheresdk.com

      Both these domain were registered only 36 days ago

      Their main domain had been around for 6 month (216 days) tho:- https://www.whatsmydns.net/domain-age?q=runanywhere.ai

      (I also couldn't see any post created by them on YC checking algolia from their website fwiw)

      Seeing their star history on their product, I see some few interesting observations[0] Their star history was almost horizontal between december and february until it got vertical all of a sudden.

      [0]:https://www.star-history.com/#runanywhere.ai/runanywhere.ai&...

      I looked through their linkedin and found this website owned by them as well https://www.openclawpi.com/ and using the YC brand here as well. (registerered 26 days ago)

      This website looks fairly AI generated to me as well and there are some bugs within the original website as well which I am now incredibly more unsure of if generated by AI or not given the similarities between the two websites UI/UX as well.

  • j16sdiz 6 hours ago
    Over many years, I have got email from university for survey / research.

    This is not GitHub only, I have got a survey on how my experience interacting with folks on lkml

  • rlaabs 7 hours ago
    I've received the exact same email from the same company.
  • jacquesm 4 hours ago
    Sometimes they also scrape HN profiles, it is most irritating.
  • outloudvi 7 hours ago
    I usually check the "Received" header and report to the email service provider. Once in a while I receive a response saying the case is properly handled.

    These providers are the only ones that care about their reputation and thus may take some action. Investors? Nope.

    • john_strinlai 4 hours ago
      the problem is that the emails arent typically sent from the main domain.

      in this example, the email came from buildrunanywhere.org, which is just a parked domain. the real domain is runanywhere.ai, which they arent using for spam.

      so, once buildrunanywhere.org has their reputation burned from reports, they will simply register buildrunanywheres.org and start spamming again.

  • axegon_ 6 hours ago
    I've received several similar ones over the years. At this point, if I get an email from someone I don't know and it contains a link, chances are it's spam. I genuinely doubt github(or any other company for that matter) would do something about it. While I fully support GDPR, the truth is, few people are willing to take action knowing how much bureaucracy would be involved...
    • dagi3d 5 hours ago
      > how much bureaucracy would be involved... it varies from country to country, but filling a complaint on that matter is usually quite straightforward
  • rodrigodlu 6 hours ago
    I did receive these kinds of emails as well.

    And I use a different email fromy priority email for GitHub commits since 4 years ago.

    So just stop with marketing slop please.

    Yes, I work with AI, and I'm becoming pretty good at it.

    But this doesn't mean I'm comfortable pushing AI slop into potential users and customers.

    I (and they) want to use AI to facilitate their processes, not to ingest slop content.

  • hmokiguess 3 hours ago
    HN and YC walk a thin line between hacker culture and venture capitalist culture. I know it’s easy to think that because HN comes from YC them too are aligned with hacker culture, but no. YC is all cutthroat business.
  • idoxer 4 hours ago
    I also received this shitty email 3 days ago
  • nprateem 6 hours ago
    There's no reason to put your real email in git config unless you're signing, in which case repos should be private. I would have thought that was obvious.
  • koakuma-chan 7 hours ago
    I have been having the same experience. If you starred a GitHub repo, and they think that their product is similar, they will send you their spam. I condemn this! They should be ashamed!
    • lyu07282 5 hours ago
      After 25 years on the internet dealing with spam, it would never even occur to me to invest the energy to write a letter to the offending companies investor. But more power to them I'd say!
  • atfzl 8 hours ago
    [flagged]
    • speedgoose 8 hours ago
      Why would you promote spam?
    • bilekas 7 hours ago
      This is some next level spam posting. Not sure to be annoyed or impressed.
    • RobotToaster 7 hours ago
      I feel like spam is somewhat less offensive when it's for FOSS, assuming it isn't some faux FOSS freemium scam. It's about the only spam I wouldn't mind getting.
  • ValentineC 8 hours ago
    > These emails indicate that those companies scrape people's Github activity, and if they notice users contributing to repos in their field of business, send marketing emails to those users without receiving their consent. My guess is that they use commit metadata for this purpose.

    There are likely marketing email datasets floating around the internet that contain email addresses scraped from commit metadata.

    I use a catchall with a specific Git client (not GitHub) email address, and found spam and phishing emails being sent there quite a few times.

    • input_sh 7 hours ago
      May not necessarily be from commit messages, there's at least one way simpler way: simply adding .gpg to the end of any user URL will return that user's public GPG key.