veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

293
active users

#bigquery

1 post1 participant0 posts today
Neil Craig<p>TIL the Tranco domain ranking data is available as a public data source in BigQuery - so you can do e.g.:</p><p>```<br>select<br>domain,<br>rank<br>from `tranco.daily.daily`<br>where date = date_sub(current_date(), interval 1 day)<br>and domain in ("bbc.co.uk", "bbc.com")<br>order by rank asc<br>```<br><a href="https://tranco-list.eu" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">tranco-list.eu</span><span class="invisible"></span></a></p><p><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/Tranco" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Tranco</span></a> <a href="https://mastodon.social/tags/DomainRanking" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DomainRanking</span></a></p>
Clinton<p>Diving into <a href="https://mastodon.social/tags/Vermont" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Vermont</span></a> wildlife for the <a href="https://mastodon.social/tags/30DayChartChallenge" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>30DayChartChallenge</span></a> "circle" day! 🦌 Using <a href="https://mastodon.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> &amp; <a href="https://mastodon.social/tags/plotly" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>plotly</span></a> to compare monthly <a href="https://mastodon.social/tags/Moose" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Moose</span></a> and <a href="https://mastodon.social/tags/BlackBear" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BlackBear</span></a> sightings 🐻 Data wrangled with <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> and <a href="https://mastodon.social/tags/SQL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SQL</span></a>. Any guesses which animal is seen more consistently throughout the year? 😜 <a href="https://mastodon.social/tags/DataViz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataViz</span></a> <a href="https://mastodon.social/tags/Wildlife" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Wildlife</span></a> <a href="https://mastodon.social/tags/RadialChart" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>RadialChart</span></a></p>
Hacker News<p>BigQuery pricing model cost us $10k in 22 seconds</p><p><a href="https://www.linkedin.com/posts/yingjun-wu_%F0%9D%90%81%F0%9D%90%A2%F0%9D%90%A0%F0%9D%90%90%F0%9D%90%AE%F0%9D%90%9E%F0%9D%90%AB%F0%9D%90%B2%F0%9D%90%AC-%F0%9D%90%91%F0%9D%90%88%F0%9D%90%83%F0%9D%90%88%F0%9D%90%82%F0%9D%90%94%F0%9D%90%8B%F0%9D%90%8E%F0%9D%90%94-activity-7307736315079405569-F2Ng" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">linkedin.com/posts/yingjun-wu_</span><span class="invisible">%F0%9D%90%81%F0%9D%90%A2%F0%9D%90%A0%F0%9D%90%90%F0%9D%90%AE%F0%9D%90%9E%F0%9D%90%AB%F0%9D%90%B2%F0%9D%90%AC-%F0%9D%90%91%F0%9D%90%88%F0%9D%90%83%F0%9D%90%88%F0%9D%90%82%F0%9D%90%94%F0%9D%90%8B%F0%9D%90%8E%F0%9D%90%94-activity-7307736315079405569-F2Ng</span></a></p><p><a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/Cost" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cost</span></a> #$10k <a href="https://mastodon.social/tags/DataAnalysis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataAnalysis</span></a> <a href="https://mastodon.social/tags/CloudComputing" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CloudComputing</span></a> <a href="https://mastodon.social/tags/PricingModel" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>PricingModel</span></a> <a href="https://mastodon.social/tags/TechNews" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TechNews</span></a></p>
N-gated Hacker News<p>🍕🎨 "Map of Python" is the digital dabbling of a cartographer, lost in the vast jungle of 500,000+ <a href="https://mastodon.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> packages, desperately trying to create art from JSON blobs. 🧩🤷‍♂️ But hey, at least there's <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> to save the day from the horror of actually downloading data! 🚀🧐<br><a href="https://fi-le.net/pypi/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">fi-le.net/pypi/</span><span class="invisible"></span></a> <a href="https://mastodon.social/tags/DataVisualization" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>DataVisualization</span></a> <a href="https://mastodon.social/tags/Cartography" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cartography</span></a> <a href="https://mastodon.social/tags/ArtInTech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtInTech</span></a> <a href="https://mastodon.social/tags/HackerNews" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>HackerNews</span></a> <a href="https://mastodon.social/tags/ngated" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ngated</span></a></p>
Neil Craig<p>FFS. Turns out (after I built a feature) that you can't supply a schema for BigQuery Materialised Views.</p><p>&gt; Error: googleapi: Error 400: Schema field shouldn't be used as input with a materialized view, invalid</p><p>So it's impossible to have column descriptions for MVs? That sucks.</p><p><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a></p>
Neil Craig<p>Whilst migrating our log pipeline to use the BigQuery Storage API &amp; thus end-to-end streaming of data from Storage (GCS) via Eventarc &amp; Cloud Run (read, transform, enrich - NodeJS) to BigQuery, I tested some big files, many times the largest we've ever seen in the wild.</p><p>It runs at just over 3 log lines/rows per millisecond end-to-end (i.e. inc. writing to BigQuery) over 3.2M log lines.</p><p>Would be interested to know how that compares with similar systems.</p><p><a href="https://mastodon.social/tags/BBC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BBC</span></a> <a href="https://mastodon.social/tags/NodeJS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NodeJS</span></a> <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/GoogleCloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleCloud</span></a></p>
Neil Craig<p>After several iterations, I think I've finally got my log ingest pipeline working properly, at scale, using the <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> Storage API.<br>Some complications with migrating from the "legacy" "streaming" (it's not in the sense of code) API have been really hard to deal with e.g.:<br>* A single row in a write fail means the entire write fails<br>* SQL column defaults don't apply unless you specifically configure them to<br>* 10MB/write limit<br>I rewrote the whole thing today &amp; finally things are looking good! 🤞</p>
Neil Craig<p>...and after even more debugging, it turns out that the reason for BigQuery `appendRows()` write failures was that large write data volumes (~5k rows or ~5MB of data) exceed the (undocumented) default `createStreamConnection()` timeout 🤦🏻‍♂️.</p><p> Even the units of the config option (once you find it) are not documented 🤦🏻‍♂️🤦🏻‍♂️. It's in milliseconds as it turns out.</p><p>I upped the timeout to 120s and the failures go away. FFS.</p><p><a href="https://cloud.google.com/nodejs/docs/reference/bigquery-storage/latest/bigquery-storage/managedwriter.writerclient" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">cloud.google.com/nodejs/docs/r</span><span class="invisible">eference/bigquery-storage/latest/bigquery-storage/managedwriter.writerclient</span></a></p><p><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/GoogleCloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleCloud</span></a> <a href="https://mastodon.social/tags/NodeJS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NodeJS</span></a> <a href="https://mastodon.social/tags/WebDev" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebDev</span></a></p>
Neil Craig<p>After a goodly amount of debugging, it turns out that the BigQuery Storage API (for NodeJS at least) function `appendRows()` (which is how you tell it to write data to BigQuery) fails every time if you give it "too much" data.</p><p>This is not documented (and the docs are somewhat minimal for such an important function). I have an open case with Google &amp; have asked for the docs to be improved.</p><p>Thought it might help others to note this.</p><p><a href="https://cloud.google.com/nodejs/docs/reference/bigquery-storage/latest/bigquery-storage/v1.bigquerywriteclient#_google_cloud_bigquery_storage_v1_BigQueryWriteClient_appendRows_member_1_" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">cloud.google.com/nodejs/docs/r</span><span class="invisible">eference/bigquery-storage/latest/bigquery-storage/v1.bigquerywriteclient#_google_cloud_bigquery_storage_v1_BigQueryWriteClient_appendRows_member_1_</span></a></p><p><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/NodeJS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NodeJS</span></a> <a href="https://mastodon.social/tags/WebDev" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebDev</span></a></p>
Neil Craig<p>I deployed (then had to revert) an update to the log processing pipeline which ingests our CDN access logs in Google Cloud Run then writes to BigQuery.<br>The change migrated from the "legacy" BigQuery API to the Storage API. Thought it was worth sharing some write performance improvements seen in the Storage API:<br>* P99 0.5s (30%) lower<br>* p95 155ms (25%) lower<br>* p75 58ms (20%) lower<br>* p50 43ms (43%) *higher*<br>That's on 1/2 size Run containers so higher p50 is worth it (⬇️)<br><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/GoogleCloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleCloud</span></a> <a href="https://mastodon.social/tags/BBC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BBC</span></a></p>
Eph Zero :emacs: 💪🏻 🚲 🤘🏻<p>Welp, I was laid off today. Anyone need an SQL monkey? </p><p>[UPDATED to include location and link to résumé] </p><p>Physical location: <a href="https://masto.snoozetown.org/tags/madison" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>madison</span></a> <a href="https://masto.snoozetown.org/tags/wisconsin" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>wisconsin</span></a> <a href="https://masto.snoozetown.org/tags/usa" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>usa</span></a> </p><p>Link to résumé: <a href="https://drive.google.com/file/d/1RkZ_buZxopuJOAgOh0Mp0ck0onF0Scex/view?usp=sharing" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">drive.google.com/file/d/1RkZ_b</span><span class="invisible">uZxopuJOAgOh0Mp0ck0onF0Scex/view?usp=sharing</span></a></p><p><a href="https://masto.snoozetown.org/tags/SQL" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SQL</span></a> <a href="https://masto.snoozetown.org/tags/bigquery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>bigquery</span></a> <a href="https://masto.snoozetown.org/tags/lookerstudio" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>lookerstudio</span></a> <a href="https://masto.snoozetown.org/tags/gcp" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>gcp</span></a> <a href="https://masto.snoozetown.org/tags/downtowork" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>downtowork</span></a> <a href="https://masto.snoozetown.org/tags/laidoff" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>laidoff</span></a> <a href="https://masto.snoozetown.org/tags/opentowork" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>opentowork</span></a> <a href="https://masto.snoozetown.org/tags/remote" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>remote</span></a> <a href="https://masto.snoozetown.org/tags/getfedihired" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>getfedihired</span></a></p>
Neil Craig<p>The BigQuery Storage API has (AFAIK) undocumented behaviour as-goes write failures which differs *dramatically* from the "legacy API":</p><p>If you tell the BQ Storage API to `appendRows()` and *any* rows fail to be written (due to e.g. data type/range incompatibility), *all* rows will in fact not to be written to BQ - even though the (Node) lib will only tell you the rows which are incompatible.</p><p>It took me some time to find this out. Hoping it helps someone.</p><p><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/GoogleCloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleCloud</span></a> <a href="https://mastodon.social/tags/NodeJS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NodeJS</span></a> <a href="https://mastodon.social/tags/Node" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Node</span></a></p>
Neil Craig<p>Long shot...<br>Is anyone using the Node JS SDK for the BigQuery Storage API with `appendRows()`?</p><p>It seems to fail *all* writes if &gt;= 1 row passed to `appendRows()`fails which makes it unusable for me.</p><p>The "legacy" API has a `skipInvalidRows`option but I can't find one with the Storage API. I've raised a ticket but hoping someone'll know.</p><p><a href="https://mastodon.social/tags/GoogleCloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>GoogleCloud</span></a> <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a></p>
NblogLinks<p>Don’t Lose Your Google Universal Analytics Data Ever Again!</p><p><a href="https://social.outsourcedmath.com/display/9b1e8f93-1066-7fe7-5b11-0e5739686650" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">social.outsourcedmath.com/disp</span><span class="invisible">lay/9b1e8f93-1066-7fe7-5b11-0e5739686650</span></a></p>
Neil Craig<p>I've been migrating our log ingest pipeline from writing (JSON) to BigQuery via `table.insert()` to using the BigQuery Storage API (which converts to a protobuf representation of the JSON you feed it).</p><p>The example code in docs is god-awful but I have persevered &amp; made it work.</p><p>Storage API:<br>- Is &gt;2x faster thus far<br>- Is ~10x cheaper (IIRC)<br>- Needs timestamps with usec res, `table.insert()` is in seconds 🤷🏼‍♂️</p><p>Pleased with it despite the pain. I'll polish it next week!<br><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/NodeJS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NodeJS</span></a> <a href="https://mastodon.social/tags/Data" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Data</span></a> <a href="https://mastodon.social/tags/BBC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BBC</span></a></p>
Neil Craig<p>During last night's England vs Denmark match, we hit just over 750k log lines/second (mean minutely, the real peak will've been higher) on my log pipeline. The minutely peak was 45.1M log lines/minute.<br>Bearing in mind this only processes our web edge logs &amp; a small subset of our media logs (plus supporting services) &amp; the match wasn't super busy, that's quite a lot. I'll see if I can find out what the overall peak was. I'm guessing way into the millions<br><a href="https://mastodon.social/tags/WebDev" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>WebDev</span></a> <a href="https://mastodon.social/tags/Data" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Data</span></a> <a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/BBC" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BBC</span></a> <a href="https://mastodon.social/tags/Euros" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Euros</span></a></p>
Neil Craig<p>I wish BigQuery had better IP address functions e.g. a function which detects whether a given IP address is in a CIDR block.<br><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a></p>
Neil Craig<p>Weird...I have been working to migrate our log processing pipeline to use streams all the way from the log file itself to BigQuery...got it all working and was very pleased until I deployed it to the dev env and writes all failed due to lack of `bigquery.tables.create` permisison on the svc account (despite the destination tables existing).<br>Added that and it works.<br>Seems `table.createWriteStream()` (node) uses an implicit `create if not exists` or similar.<br><a href="https://mastodon.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> <a href="https://mastodon.social/tags/Node" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Node</span></a></p>
Flüpke<p>"""Open Source""", CC BY SA 4.0</p><p>Data is only <a href="https://chaos.social/tags/Google" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Google</span></a> <a href="https://chaos.social/tags/BigQuery" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>BigQuery</span></a> 🤡</p><p><a href="https://rumarchive.com/datasets/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">rumarchive.com/datasets/</span><span class="invisible"></span></a></p>