Hacker Newsnew | past | comments | ask | show | jobs | submit | jcrawfordor's commentslogin

Very odd. Besides the Turbotax-esque over length loading process, every point I could find (in the Southwestern US) was a seemingly random location with no other details. Some of them just, like, the side of a road. I think the whole database was generated by LLM and it's just useless junk.

I've checked a few locations I know, and it looks like this project imported points from various sources, (locations check out) but they completely left out the metadata.

This project looks useless. No curation (left as an exercise for the viewer I guess), crappy navigation (the OpenStreetMap tile server is not being called correctly leading to blocked tiles), and just… an aggregate of everything they could get their hands on.


Also they don't show the necessary OpenStreetMap attribution anywhere...

TSA does more than just the security checkpoints, even at airports with privatized screening TSA does all of the back-office work, including some on-site staff. The physical screening is the only thing that can be contracted out, not the whole rest of the process like maintenance of risk databases.

What might confuse things a bit is that this incident happened hours before ICE agents started reinforcing TSA at checkpoints and seems mostly unrelated, other than establishing the general principle that ICE will arrest people at airports based on tips from TSA's flight booking records.


USPS doesn't care; each ZIP code has a single preferred city name and a list of acceptable alternate city names to account for cases in which a ZIP code spans multiple cities. However, USPS's address validation will prefer to use the preferred city name for the ZIP regardless of whether the recipient actually lives within the boundaries of that city. That's because USPS has opted to organize addressing entirely around the ZIP codes, and other political boundaries are irrelevant except in cases of problems interpreting the address.

This does mean that you might autofill a city name that is "wrong" in the view of the person completing the address form, but much of the bulk mail they receive probably uses that city name anyway.

Technically speaking ZIP codes are not "supposed" to span states but, in exceptional cases, some do. In this case USPS handles it the same way: the state of the preferred city is the preferred state for the ZIP code.

The preferred city is almost always the location of the post office serving the ZIP, which makes this situation fairly intuitive. You can find some interesting edge cases where a post office in located in a suburb city, resulting in a ZIP that includes part of a major city having the suburb as its preferred city name.

You can look up the city name and alternates for a ZIP here: https://tools.usps.com/zip-code-lookup.htm?citybyzipcode and the Domestic Mailing Manual covers this, although it's scattered across several sections and mostly part of how the City State database (the database used for validating city and state names in addresses) works.


I agree using the preferred city name works just fine for USPS, though maybe not for UPS/Fedex.

What I want to know is: Why isn't this preferred city+state mapping dataset for zip codes publicly available from USPS? It would be like 40kb of data for the entire thing. Why is this not public domain from the US Government?

Edit: or is this what I'm looking for (the "Physical City", "Physical State" columns? https://postalpro.usps.com/ZIP_Locale_Detail

It's missing 00501 at least (which zippopotam has), and military zip codes (which zippopotam doesn't have). Military zip codes are included in this file: https://postalpro.usps.com/areadist_ZIP5

Also fun fact 88888 is for "Operation Santa" uspsoperationsanta.com, which zippopotam is missing, but appears in the areadist_ZIP5 file.


Unfortunately these USPS datasets are not public because USPS sells them. Or in some cases, the pattern tends to be that USPS has a contract with a provider (part of what I call the Postal Industrial Complex) that maintains the database and then sells it to both USPS and everyone else. Since these databases are used primarily by bulk mail services, they're fairly expensive and represent an important revenue source to USPS. Remember that USPS is semi-privatized, so they're looking for fees they can charge like everyone else... especially fees that can be changed more easily than postage rates.

That said, the ZIP DB is indeed not very large, so you can find copies of it. You won't generally find complete copies of the City State file but I wouldn't be surprised if there is one out there.


That's not nearly as good as Canada's H0H 0H0.


Which thankfully Zippopotamus supports: https://api.zippopotam.us/CA/H0H


> Technically speaking ZIP codes are not "supposed" to span states but, in exceptional cases, some do. In this case USPS handles it the same way: the state of the preferred city is the preferred state for the ZIP code.

I've heard this before but it raises a million questions for me and I don't understand how this doesn't cause massive systematic problems and headaches in practice. Are residents even usually well-aware what city they live in, versus what's on their postal address? I sure as heck have always assumed whatever my mailing address says is the city I live in; I can't imagine a ton of people questioning it.

Doesn't this mean a ton of citizens would be registering for the wrong state's elections? Do the election officials always catch these? What about businesses - don't they constantly pay taxes incorrectly if the address is written incorrectly? What about laws (say privacy, wiretapping/call recording, etc.) where people make assumptions based on the city and state - what if they're wrong because the written city isn't the actual city? Who's criminally liable then?? Does every business have to perform a jurisdiction lookup to make sure an address isn't misleading?


My dad had an address in Morgantown, Indiana, and the fact that he lived several miles south, over the county line and past antother small town, always made it pretty clear to me that he didn't live in Morgantown.

Likewise, if you live in another state, there's little confusion because state lines appear on maps and are well marked on all major roads.

Businesses and individuals are responsible for knowing which state they reside in and paying the appropriate taxes, regardless of where their mail is sorted.

As for elections, electoral districts don't generally align with city limits in the first place, so this has to be sorted out by the election registration system based on the full address in most cases anyway.

As for what city name appears in legal documents, the answer is that "preferred" doesn't mean "mandatory". A warrant to search the Indianapolis Motor Speedway, 4790 W 16th St, Speedway IN, 46222, would be perfectly valid, despite the fact that the USPS prefers mail to be addressed to "Indianapolis" rather than "Speedway". For there to be any possibility of confusion, you'd need to have two distinct locations, whose identical addresses share a ZIP code, and differ only by city name, which for obvious reasons the postal service will not allow.


Thanks for explaining! That's a great perspective/experience that helps me understand it better.

Re: legal documents: there are less severe cases than warrants that I would expect to trip a lot of people up, which is what I'm wondering more about. Example:

ZIP code 97635 is apparently in both Oregon and California. The post office for both sides of the border appears to be in New Pine Creek, OR 97635. So if someone provides you with that address, you would quite naturally assume they're in Oregon. And you might collect taxes, record calls, or sell their personal data based on that. But whoops! Turns out they're actually in California, so you just broke a bunch of privacy laws on top of missing tax payments to California, in some cases potentially risking criminal penalties.

Wouldn't this trip a lot of people up, especially for smaller entities? I can understand multi-billion-dollar businesses having this all handled correctly, but for individuals or smaller businesses, wouldn't it completely throw them off and potentially subject them to criminal penalties? (How) would people deal with this?


First, you're responsible for knowing where you live. Historically, people who lived in more challenging areas geographically often did not have regular postal addresses at all. You would just have a box number in the nearest town or a rural route stop number, and these obviously didn't reflect the legalities of where you lived. In our modern world, USPS has adopted a policy of 100% physical addressing, meaning that all properties now have a "real" address even if the number part is scaled from mileposts (as is the case in rural areas). Still, I think people who live in areas where any of this is less than obvious understand the nuance that how USPS handles addresses is not necessarily the same as how the voter registration clerk handles them.

Still, it is rarely a problem in practice, because anyone relying on addresses to establish these legal details will have to look at where the address is actually located---not just the city written in it. Keep in mind that quite a few people live in ZIPs where they could write multiple city names in their address.

When it comes to the unusual case of spanning states, it might help to note that the City State Database the postal service uses to validate addresses does not actually differentiate between city and state. "NEW YORK NY" is a single string. The state is really just part of the city name. The fact that USPS implemented it this way indicates the extent to which it does not matter in operational reality.

A sibling comment points this out, but it might also help to explain that in the US, it is very common for people to have mailing addresses in cities they do not actually live in. That's because of suburbs. City lines are often surprisingly arbitrary and reflect complex political histories. Many people consider themselves to live in [major city] but, in legal actuality, live in [unincorporated county that contains major city]. Many of the upsides of living in the city, sans some of the property taxes and voting in city elections! Yet another reason that people understand that mailing addresses are not definitive reflections of political boundaries.

If you ever work as an election clerk you will find this a LOT---people indignant that they cannot vote for the mayor, to whom you will have to explain, somehow for the first time, that they do not actually live in city limits. This tends to be more obvious if you get a property tax bill but a lot of people are renters and never really think about that aspect.


But addresses aren’t just for sending mail. Location also determines which municipal and state laws apply, so there are contexts where the distinction matters.


I would just like to point out that the city field doesn’t necessarily prove anything because many unincorporated areas have a listed a city but may not be subject to the laws or taxes of a municipality. So having the correct city isn’t as useful as one thinks it is.


I used to live in just such a place. Went to the city center to apply for a library card, thinking "of course I can get a card here, I live in Foo City and this is the Foo City Public Library." I was asked for my street address, and she pulled out a binder of street names to check (yes it was analog, in the year 2016 A.D.). I was not within the city limits and was denied a card.


same thing happened to me last year, except at a brush drop-off rather than a library; analog binder and all!


I live in a zip code that spans two cities and I live in the unincorporated area between them, but with one of the cities in my preferred address. So at least two of the exceptions listed in this thread apply.


This is accurate. For a scenario with a possibility of litigation you must ultimately geocode the address with google maps API or census geocoder, point in polygon against district boundaries (geopandas or shapely), then pass the result through a rules table keyed on jurisdiction + case type.


It’s more that the municipal “geofence” encompasses a certain area, and all addresses that fall within that space belong to that municipality. I.e. the address doesn’t determine the location, it just happens to be located somewhere.


These things shouldn’t be based on the zip code.


I've been peripherally involved in an early stages effort to build something similar for the entire nation (https://reportapayphone.com/), and became aware of this just recently. It's a really great example to aspire to, in terms of the level of polish. I do find the time limiting odd; our goal is to identify as many payphones as possible this way.

Unfortunately the state of payphone-related records is extremely poor, with many ostensibly-active PSPs having quietly gone out of business, other PSPs reorganized without reregistering, and states themselves keeping PSP records very poorly. Throw in small-scale COCOT operations and the result is that there really isn't any authoritative database of possible payphones, so this website's map is going to be missing some. It will also include many that are nonfunctional, as today's PSPs seem to do close to zero maintenance and out of service phones stay that way for years.

Some of the nation's largest PSPs have become ghosts, with the phones still operating and able to accept payment, but the PSP completely unresponsive to efforts to contact them. It's a very strange afterlife.


I don't know that there is much technology to sell, palm vein imaging is decades old in the access control industry. The reason you don't see it anywhere is because it was already a commercial failure in that application, by the end of the 1990s.

Amazon was even trying to sell the technology for access control applications, but their sales material were remarkably devoid of any reason to choose it over other biometrics.


Out of curiosity, why did it fail in that market?


Biometrics were a very crowded market during the 1980s and 1990s when it was a newer idea and electronics were starting to make things practical. Lots of ideas were tossed around before the industry pretty well consolidated on fingerprints with a side of iris imaging and hand geometry in some more security-sensitive niches. It mostly came down to cost: fingerprint scanners, even before the modern capacitative type, came down in price much faster than other types of imaging (visible rather than IR sensors, glass platen allowed for fixed focus, etc). The widespread use of fingerprint comparison in criminal forensics also mean that there's an older and stronger academic literature on fingerprint comparison, whereas other types of biometric sensors often involve proprietary match algorithms and you have to rely on the vendor's assertions about reliability.

Of course everything around cameras has come down in cost tremendously since then, so palm imaging is probably reasonably priced now, but it lacks a clear enough advantage over better-established methods for anyone to switch over. Besides, just the fact that you have to position your palm the way you do makes it difficult to install them in most practical door situations. Fingerprint sensors turn out to be very compact and fairly intuitive to use.

I scoured Amazon's sales materials around Amazon One very closely, because I found it fascinating that they were seemingly trying to revive the technique. I was surprised they were doing it as a payment device, but it made more sense when I found materials (I think old FCC filings) that suggested that it was originally designed as an access control product and perhaps "pivoted" to payments later. The strangest thing about it though was how unconvincing the sales materials were, it felt like they were really grasping at straws for a reason to select it over other options.

From what I could find it doesn't appear to have been an acquisition; the regulatory paperwork was all filed under some LLC but it seemed to just be a front company for Amazon which is fairly common for that kind of thing. So my best guess is that it was a pet project of someone influential enough to burn some R&D on it, and maybe pivoting to payments and putting them in Whole Foods was thought to maybe be the hail Mary that would turn it into a real business.

The actual integration with the PoS in the stores was clumsy too, they Velcro'd an NFC antenna to the side of the credit card terminal to use to make payments by proxy card. I originally got obsessed with it because I was trying to ID the suspicious device Velcro'd to the payment terminals at Whole Foods!


Good data is hard to come by, but from a brief survey electronic precinct tabulation (the most common system in the US) is also in at least partial use in Canada, Mexico, India, the Phillipines, and Russia, and a laundry list of smaller countries.

Now, you might contend that this is not a list of first-world countries exactly (but rather I sampled the largest countries). You must keep in mind that the use of electronic tabulation in the United States is mostly a response to the very limited budget on which elections are carried out; electronic tabulation is much less expensive than significantly increasing staffing. As a result, globally, electronic tabulation tends to be most common in poorer countries or countries with newer election systems, while hand tabulation is most common in wealthier countries with long-established election procedures.

For this reason, the countries you might go to for comparison (like France and Germany) have largely manual election processes that have often seen few changes since the Second World War.

The Help America Vote Act (2002) had a de facto effect of making the United States a country with much newer election processes, as HAVA requires strict accessibility measures that most European election systems do not meet (e.g. unassisted voting for blind and deaf people). Most US election systems didn't meet them either, in 2002, so almost the entire country had to design new election processes over a fairly short span of time and on a shoestring budget. Understandably, election administrators leaned on automation to make that possible.

It's also important to understand that because of the US tradition of special-purpose mill levies and elected independent boards (like school boards), the average US ballot has significantly more questions than the average European ballot. This further increases the cost and complexity of hand tabulation, even ruling out entirely the "optimized" hand tabulation methods used in France and Ireland.


This is the system used in the majority of the United States. Direct-recording electronic voting systems were never that common, briefly peaked after the Help America Vote Act as the least expensive option to meet accessibility requirements, and have become less common since then as many election administrators have switched to either prectinct tabulators or direct-recording with voter-verified paper audit trail.

In the 2026 election, only 1.3% of voters were registered in jurisdictions that use direct-recording electronic machines without a voter verifiable paper audit trail (https://verifiedvoting.org/verifier/#mode/navigate/map/voteE...). 67.8% of voters are registered in precincts that primarily use hand-marked ballots, and the balance mostly use BMDs to generate premarked ballots.


There have been diesel electric surface ships as well going back to WWII, although it hasn't proven a very popular design and they remain an oddity.


To be quite honest I've never seen a colo that didn't offer access at all. The cheapest locations may require a prearranged escort because they don't have any way to restrict access on the floors, but by the time you get to 1/4 rack scale you should expect 24/7 access as standard.


Same. We would colo and had racks behind chain link fencing that was locked behind cipher locks


It's very confusing in the text of the article, at times it sounds like the author is using heuristic methods (like timings) but at times it sounds like they somehow have access to network traffic from the provider's backend. I could 100% believe that a ton of these companies are making API calls to providers directly from an SPA, but the flow diagrams in the article seem to specifically rule that out as an explanation.

I might allow them more credit if the article wasn't in such an obviously LLM-written style. I've seen a few cases like this, now, where it seems like someone did some very modest technical investigation or even none at all and then prompted an LLM to write a whole article based on it. It comes out like this... a whole lot of bullet points and numbered lists, breathless language about the implications, but on repeated close readings you can't tell what they actually did.

It's unfortunate that, if this author really did collect this data, their choice to have an LLM write the article and in the process obscure the details has completely undermined their credibility.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: