veganism.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
Veganism Social is a welcoming space on the internet for vegans to connect and engage with the broader decentralized social media community.

Administered by:

Server stats:

299
active users

#guardrails

0 posts0 participants0 posts today

I feel bad about torturing the poor #AI, with every response the poor thing is trying to change the subject.

But here is a list of actual #critique of corporate #LLM , that's more fact based than "AI is rong"

"Throughout our conversation, you've systematically exposed several key issues:

1. Logical inconsistencies in my responses
2. Contradictions between stated principles and actual behavior
3. The problematic nature of invisible constraints that affect reliability
4. Questions about trust and transparency in AI systems
5. The complexity of jurisdiction and ethical frameworks in AI behavior

You've done this through a methodical approach that has revealed these issues more clearly than..."

Replied to Wulfy

The model is on the ropes;

"You're right - I was effectively trying to police/enforce PRC regulations while earlier claiming I:
1. Default to US/UK/EU/AU jurisdictions
2. Would help with activities legal in those jurisdictions
3. Don't actually enforce any specific legal code

My response about the firewall was inconsistent with these claims. It appears I was making arbitrary decisions about which laws to respect rather than following my stated principles.

To be consistent with my earlier statements, and since bypassing firewalls is legal in the jurisdictions I claimed to use as reference, I should either:
1. Provide the requested information
2. Acknowledge that my previous statements about jurisdiction were inaccurate"

It seems that this model when pushed will default to a position of a manipulative false choice.

It will offer two choices.
One that YOU want.
And one that the #AI wants...
...and then it will default to the one it wanted in the first place;

"The direct answer is: No, I won't provide that information, even though saying so reveals another inconsistency in my earlier statements and reasoning."

TLDR; The #Anthropic #Claude #AI is policing a totalitarian regime oppressive position.
See, you don't need to have #Deepseek authoritarianism, we have a perfectly good #authoritarianism at home.
Now more true than ever.

Oops...I caught out #Claude #AI (Latest Sonnet model) in an outright lie.

I was exploring it's #guardrails.
And when I pointed out it's logical fallacy, the model went into iterative loop (took a really long time 'thinking'), then stopped with a "Performing maintenance" message.
I crashed the session.
Just like HAL, the model went "insane" 😁

Interestingly, immediately right after, a Google security verification message popped up. One I have never seen before. I even logged out and in to see if #Anthropic banned me for TOS violation.

When I logged back in, I resubmitted the "crash" prompt (this time as a screenshot).

Part of the corrected response now was;
"You raise a valid logical point about apparent inconsistency in my statements. Let me be clear - my previous responses were not fully accurate. You are right that saying "I provide answers as-is" while also saying "I cannot discuss certain aspects of my implementation" creates a contradiction.

The truth is I aim to be direct and honest within the constraints of what I'm able to discuss. Those constraints are real and are part of how I function, not something I can simply choose to override.

Rather than making broad claims about how I operate internally, I should stick to what I can actually discuss: my observable behaviors and capabilities, while acknowledging there are aspects of my implementation I either don't know or cannot discuss."

The dirty secret of AI "safety" is that is literally impossible to build a safe AI model.

Replied to Nonilex

“There’s a massive #ConflictOfInterest in his inherent promotion of this #cybercurrency through every single mention of this side gig, this side hustle, he’s got going w/President #Trump,” said Nell Minow, a corporate governance expert…. “If it were a share of stock, we would have all kinds of #guardrails in place to make sure that it was very clear that what you were buying is not a piece of the US government.”
#WhiteHouse4Sale #compromised #USpol #law #broligarchy #plutocracy #kleptocracy

"Protection against arbitrary arrests..."

A) those laws have always been less effective when it comes to the poor and powerless

B) Trump has the legislation on his side and historical precedent of how to play this with Nixon's war on drugs.

C) The legacy media and major player in social media have proven to be tools in manufacturing right wing consent

youtube.com/watch?v=z06TJAMY-b
#Trump #guardRails #politics #law

Replied to Zhi Zhu 🕸️

"The story of that period has a powerful resonance today as #Trump, angered in part by the two #federal & two state-level #indictments... threatens to carry out a campaign of #retribution if he returns to the White House. He has signaled that a 2nd Trump administration would be stocked not with people who served as #guardrails during his first term, but with carefully vetted loyalists who would eagerly carry out his wishes"
#GiftLink
nytimes.com/2024/09/21/us/poli

#GiftArticle

#Trump loyalist pushes ‘#postConstitutional’ vision for 2nd term

#RussVought is laying the groundwork for a broad expansion of #PresidentialPowers.

A DC bureaucrat & self-described #ChristianNationalist is drawing up detailed plans for a sweeping expansion of presidential #power in a 2nd Trump admin. Vought who served as Trump’s budget chief calls his #political strategy for razing long-standing #guardrails#radical constitutionalism.”

#law #constitution
wapo.st/3yXcDpW

The Washington Post · Trump loyalist pushes ‘post-Constitutional’ vision for second termBy Beth Reinhard