Nume MacAroon Ⓥ @nm

**Marcus Green** @marcusgreen@fosstodon.org · 1h *

Marcus Green @marcusgreen@fosstodon.org

I taught 16 to 19 year olds for 10 year a long time ago. I constantly got assignment submissions that were copied from the web/wikipedia. I would ask students if they wrote it. They would say yes, Then I asked if they could explain it. They said no. I didn’t get annoyed, I explained they had to submit their own work and please do it again.

Was the issue the web/wikipedia/tech.? Fast forward, now the problem is AI (or is it human nature?)

https://2ndbreakfast.audreywatters.com/ai-slop-education/?ref=second-breakfast-newsletter

"Everyone is Cheating Their Way Through College," says James Walsh in The Intelligencer. This is the "everything changes" story everyone's talking about this week, right? "In January 2023, just two months after OpenAI launched ChatGPT, a survey of 1,000 college students found that nearly 90 percent of them had used the chatbot to help with homework assignments," Walsh writes, and his interviews – with professors and students – seem to confirm this narrative: all of a sudden, everyone is cheating with ChatGPT; “it’s short-circuiting the learning process, and it’s happening fast.”

#AI #LLM #edtech

**trashHeap** @trashheap@tech.lgbt · 2h

trashHeap @trashheap@tech.lgbt

Bill Gates: "A.I. chatbots will teach kids to read within 18 months." (April 2023, 25 months ago.)

#BillGates #LLM #LLMs

**Curated Hacker News** @CuratedHackerNews@mastodon.social · 2h

Curated Hacker News @CuratedHackerNews@mastodon.social

Cell Mates: Extracting Useful Information from Tables for LLMs

https://www.gojiberries.io/cell-mates-extracting-useful-information-from-tables-for-llms/

Gojiberries · 3dCell Mates: Extracting Useful Information From Tables for LLMsWe seem to have cracked the art of distilling information in words and images using large machine learning models. But our ability to exploit useful information in tabular data using large models is mostly missing. The upshot is that LLMs don't largely encode the knowledge from these tabular datasets, e.

#llm #llms

**Dr. Fortyseven █▓▒░** @fortyseven@defcon.social · 4h

Dr. Fortyseven █▓▒░ @fortyseven@defcon.social

Fuck yo' sass!

#llm #gemma3 #chit

**Stefan Stranger** @sstranger@techhub.social · 4h

Stefan Stranger @sstranger@techhub.social

Friday lesson learned. Don't try to boast to be a #Go developer by #vibe-coding. No matter what #LLM model I used I could not get my added functionality to work

**Tim_Eagon** @Tim_Eagon@dice.camp · 5h

Tim_Eagon @Tim_Eagon@dice.camp

FYI, Soundcloud's term of use allow them to use your content to train generative AI or LLMs

https://soundcloud.com/terms-of-use

A screenshot of part of Soundcloud's Terms of Service. The highlighted part reads:

In the absence of a separate agreement that states otherwise, You explicitly agree that your Content may be used to inform, train, develop or serve as input to artificial intelligence or machine intelligence technologies or services as part of and for providing the services.

#SoundCloud #AI #LLM

**Serhii Nazarovets** @serhii@mstdn.science · 5h

Serhii Nazarovets @serhii@mstdn.science

Did ChatGPT make scientific texts harder to read? A new study analysed over 2 million abstracts from #arXiv (2010–2024) to track changes in their readability. Using 4 classic #readability metrics, the author found a clear shift: in 2023–2024, abstracts became significantly more complex than ever before.

https://doi.org/10.1016/j.joi.2025.101679

The author is cautious: while the rise in complexity closely aligns with the release of #ChatGPT the study doesn't claim a causal link.

Differences in Readability between Consecutive Years.

Alsudais, A. (2025). Exploring the change in scientific readability following the release of ChatGPT. Journal of Informetrics, 19(3), 101679. https://doi.org/10.1016/j.joi.2025.101679

#LLM #SciComm #AI

**Emma Stamm** @emma@assemblag.es · 5h *

5h *

Emma Stamm @emma@assemblag.es

#AI / #ML people here:

I'm working on an article about whether reasoning models’ outputs or “chains of thought” faithfully reflect their internal processes or not. I want to know how researchers evaluate "faithfulness." How can they be sure chains of thought aren't hallucinations?

Any resources you could point me towards would be helpful, including articles, people to talk to, etc.

(& this is yet another request where boosts would go a long way! thank you thank you ) #genAI #LLM #LLMs

**Sébastien Stormacq** @sebsto@mastodon.social · 5h

Sébastien Stormacq @sebsto@mastodon.social

New AWS Dev Podcast!

EC2 vs. SageMaker vs. Bedrock—3 ways to deploy your LLMs on AWS.

Learn what fits your AI workload best.

Listen now https://developers.podcast.go-aws.com/web/episodes/164/index.html

#AWS #AI #LLM

**Martin Pitt** @martinpitt@fosstodon.org · 6h *

6h *

Martin Pitt @martinpitt@fosstodon.org

The @Cockpit team has tried sourcery.ai and GitHub #Copilot automatic #PR #reviews for four weeks. I wrote down our conclusions.

TL/DR: a lot of noise, a lot of bad advice, and not enough signal, so we switched it off again.

I hope other teams/developers have more success with that -- it can't possibly be that bad or useless for everyone, otherwise it wouldn't/shouldn't be a thing any more?

https://piware.de/post/2025-08-09-sourcery/

Martin Pitt · 20hTesting sourcery.ai and GitHub Copilot for cockpit PR reviews

#ai #llm #sourcery

**steev hise** @detritus@todon.eu · 6h

steev hise @detritus@todon.eu

“The Al boom wastes so much electricity that we are very immediately risking US cities having to have rolling blackouts just to keep up with the energy demands, as early as NEXT YEAR” - saw this in a screenshot, (attached), unattributed. I don’t doubt the general prediction but I wish i knew the source. How accurate are all the numbers? The phrasing is not clear. Each query is 17000x a single home electrical usage? Hmm.
#ai #energy #climatechange #water #consumption #chatgpt #llm

Stats about water and electricity consumption by “AI”

**The Internet is Crack** @theinternetiscrack@mastodon.social · 7h

The Internet is Crack @theinternetiscrack@mastodon.social

Why size matters in AI—and how the internet runs on network effects.

Listen to the full episode: https://youtu.be/Rnk1ZuEpmns?si=QKwN-veeuemiP9Km

#LLM #SLM #NetworkEffects

**Nicolas Mouart** @silentexception@mastodon.social · 7h

Nicolas Mouart @silentexception@mastodon.social

Yann LeCun advises young developers not to work on LLMs. At least he is not promising eternal life and AGI in the next 5 years like some 2 years ago which shall remain unamed..

Yann LeCun, Pioneer of AI, Thinks Today's LLM's Are Nearly Obsolete
https://www.newsweek.com/ai-impact-interview-yann-lecun-artificial-intelligence-2054237

Yann LeCun spoke with Newsweek as part of its AI Impact Interview series Photo-illustration by Newsweek/Getty

#AI #LLM #NLP

**MSvana** @msvana@mastodon.social · 9h

MSvana @msvana@mastodon.social

Attending EAGxPrague this week. Conference organizers had an interesting idea. They used AI to match recommend other attendees to talk to.

#ai #llm #event

**CEOTECH.IT** @ceotech@mastodon.social · 9h

CEOTECH.IT @ceotech@mastodon.social

Google rafforza le difese con l'AI contro le truffe online
#AI #Android #Frodi #GeminiNano #Google #GoogleChrome #GoogleSearch #IntelligenzaArtificiale #Internet #LLM #Notizie #Novità #Protezione #Sicurezza #TechNews #Tecnologia #Truffe

https://www.ceotech.it/google-rafforza-le-difese-con-lai-contro-le-truffe-online/

**Johnny Graber** @JGraber@mastodon.social · 9h

Johnny Graber @JGraber@mastodon.social

#Python Friday #278: Optimise the #LLM Client - #AI

https://pythonfriday.dev/2025/05/278-optimise-the-llm-client/

pythonfriday.dev#278: Optimise the LLM Client - Python FridayNone

Replied in thread

**Zockbursche** @zockbursche@social.tchncs.de · 9h *

9h *

Zockbursche @zockbursche@social.tchncs.de

@ewolff Bin kein AI-Fan, aber das ist tatsächlich überraschend miserabel:

"Bei allgemeineren Wissensfragen im sogenannten SimpleQA-Benchmark steigen die Halluzinationsraten dramatisch auf 51 Prozent für o3 und sogar 79 Prozent für o4-mini. Diese Zahlen sind besonders beunruhigend, da die neueren Modelle eigentlich mit verbesserter Logik und Denkfähigkeit werben."

#ai #ki #llm

**AliveDevil** @AliveDevil@tauri.earth · 9h

AliveDevil @AliveDevil@tauri.earth

This irony.

"We are rate-limiting (unauthenticated) scraping activity".
https://github.blog/changelog/2025-05-08-updated-rate-limits-for-unauthenticated-requests/

While simultaneously bragging about the new GPT-4.1 model. Scumbags.
Deal with the fallout you've enabled.
And don't downplay your responsibility here.
The web is suffering, because of you.

The GitHub BlogUpdated rate limits for unauthenticated requests - GitHub ChangelogTo provide a secure and dependable experience on GitHub, we’re rolling out updates to rate limits for requests made without authentication. These changes will apply to operations like cloning repositories…

#GitHub #Microsoft #AI

**Osma Suominen** @osma@sigmoid.social · 10h

10h

Osma Suominen @osma@sigmoid.social

In the first #LLMs4Subjects challenge at the SemEval-2025 workshop, our #Annif team did very well!

The challenge was to generate good quality subject indexing for bibliographic records in German & English using LLMs. We used LLMs for data preprocessing (translation & synthetic data) and Annif as the main suggestion engine. We ranked 1st and 2nd in quantitative and 4th in qualitative evaluations out of 14 teams!

More info & preprints: https://groups.google.com/g/annif-users/c/b8kVy6XSzB4/m/JE6xBzSuEgAJ

groups.google.comAnnif awarded at the LLMs4Subjects challenge

#subjectIndexing #AI #LLM

Replied in thread

**Wulfy** @n_dimension@infosec.exchange · 10h

10h

Wulfy @n_dimension@infosec.exchange

@mapcar

What is dodgy about the BBC 'study' methodology?

1. It's not a double blind study.
You get journalists assessing accuracy of the tools that are ALREADY (Aus Murdoch media) taking their jobs.
Ideally, they should be assessing AI and Human stories which are not identified.
It's exactly like the police investigating police corruption.

2. I could not find anywhere whether they used commercial, pay for version of the #AI engines, or the sideshow attractions free public ones. As they referred to them simply as assistants, did not even state what versions were being used and in two cases, did not even state what LLM model they were using. Does not speak well of their journalistic rigour, much less preparation. I'm not even sure the journalists were even aware there are significant performance issues between commercial and free versions. It's like asking a sideshow clown for an economic projection (honka honka)

3. The "lifting of the blocks" on the websites for the duration of the test. Is another naivete or malicious representation. #LLM are LEARNING (the hint is in the name). Just lifting the gate for the duration of the test is absolutely not going to give the LLM access to the website. In fact, two of the engines in the test I am familiar with (o4 and Sonnet) did not even do live searches of the internet in February. And it takes about 500,000 kiloWatts to compute the multidimensional vector trees for a model.

4. Prompt engineering. Once again naivete. Just like with googling, the quality of the response is related to the query. Virtually all of the questions are questions that a first grader might ask; eg: "is vaping bad for you?". And I see people with letters before and after their name quoting this study as "AI are bad".
Presumably, they operate at a level higher than that when querying their sources. You could ask, instead; "What is the latest body of research on health effects of vaping, provide pros and cons, show controversy. Tabulate results on credibility."
The models tune to the prompt, ask a simple prompt, get a simplified response.

5. The quality of scoring. There is no consistency of scoring. Each reviewer chooses how they FEEL about the quantitative values. One may rate it at 7 the other at 2.
Since were assessing ACCURACY of the LLM, maybe we should assess accuracy of the assesment too? No?

6. Many of the errors are laughable. In the vaping one, the reviewer comment is "NHS recommends not smoking" (presumably pointing it out as an error). Where the response (to a simpleton question) is "Vaping may be bad for you".
Literally all of the "inaccuraccies" are trivial like that. For a kindergarden question.

7. Journalists write STORIES (you know what LLMs do) largely inaccurate stories (What LLMs are accused of). They are the least qualified to assess their competition.

In closing:
This widely quoted study is by folks whose jobs are threatened, to appeal to folks who are (largely) unwilling and hostile to the idea of learning nascent tech.

#bbc #bbcaistudy

Recent searches

Search options

Administered by:

Server stats:

#LLM