37633. | singularity | Accounting for consistent performance across different LiveBench tasks shows Claude is the clear winner 2025-02-24 21:26:42 EST Source |
---|---|---|
37634. | singularity | Claude 3.7 Sonnet ranks 1st on SimpleBench. 2025-02-24 21:02:57 EST Source |
37635. | singularity | Sonnet 3.7-thinking wins against o1 and o3 on LiveBench 2025-02-24 20:39:50 EST Source |
37636. | singularity | How ‘Unfiltered’ is Grok? I Pushed It to Find Out 2025-02-24 20:32:39 EST Source |
37637. | singularity | Claude 3.7 Sonnet Thinking loses to o1 and o3-mini on LiveBench 2025-02-24 19:51:23 EST Source |
37638. | singularity | Let's discuss jailbreaking Grok! 2025-02-24 19:49:42 EST Source |
37639. | singularity | Claude 3.7 Sonnet can generate manim code. Here's a visualization of spacetime curvature. There was one error that it fixed by itself, so took two prompts. 2025-02-24 19:41:58 EST Source |
37640. | singularity | Claude 3.7 thinking livebench results 2025-02-24 19:41:20 EST Source |
37641. | singularity | Not what I wanted to hear from ChatGPT. Would future AGI act to counter non-democratic threats or act to accentuate them? What if a State AGI decides democracy is not the best form of governance? 2025-02-24 19:34:55 EST Source |
37642. | singularity | Claude 3.7 decoding task 2025-02-24 19:33:02 EST Source |
37643. | singularity | Guys, I broke Grok 2025-02-24 19:26:21 EST Source |
37644. | singularity | 3.7 Sonnet Thinking Ranks 3rd On Livebench 2025-02-24 19:15:07 EST Source |
37645. | singularity | Played with 3.7 Sonnet and I'm impressed 2025-02-24 19:11:26 EST Source |
37646. | singularity | Are we too hard on Google lmao 2025-02-24 19:03:56 EST Source |
37647. | singularity | Highest Quality PubMed/Academic Paper Scraper LLM 2025-02-24 18:31:56 EST Source |
37648. | singularity | Sonnet 3.7 sets SOTA on the aider leaderboard with a 65% score, using 32k thinking tokens 2025-02-24 18:31:45 EST Source |
37649. | singularity | Is Elon Musk right or is he just hyping up Grok? 2025-02-24 18:26:00 EST Source |
37650. | singularity | I guess livebench is terrible as a real-world analogue? 2025-02-24 18:13:02 EST Source |
37651. | singularity | Fish Tank Boids Simulation One-Shot Claude 3.7 2025-02-24 18:09:51 EST Source |
37652. | singularity | Claude 3.7 Sucks 2025-02-24 18:09:11 EST Source |
37653. | singularity | Ngl, Grok is pretty funny. Maybe the first AI that genuinely made me laugh 2025-02-24 17:34:19 EST Source |
37654. | singularity | Claude 3.7 Sonnet base is the new best non reasoning model in the world on LiveBench (reasoning scores coming soon) 2025-02-24 17:26:00 EST Source |
37655. | singularity | Dr. Ghaharian emphasized the development of AI-driven chatbots to educate consumers about responsible gambling 2025-02-24 17:22:58 EST Source |
37656. | singularity | I played with 3.7 Sonnet and I'm not impressed. Even after repeated explanations and prompts from me, this web game prototype was still broken in movement and the arrows depicting movement direction. (Caveat: I wasn't using extended thinking) 2025-02-24 17:04:59 EST Source |
37657. | singularity | 3.7 sonnet LiveBench results are in 2025-02-24 17:04:28 EST Source |
37658. | singularity | IonQ Announces Innovations in Compact, Room-Temperature Quantum Computing through Novel Extreme High Vacuum (XHV) Technology 2025-02-24 16:54:34 EST Source |
37659. | singularity | I tested various models' ability to generate SVG unicorns. 2025-02-24 16:54:00 EST Source |
37660. | singularity | Claude 3.7 reasoning response for my request to make something impressive for r/singularity 2025-02-24 16:47:52 EST Source |
37661. | singularity | AI is shaping the future of affiliate marketing 2025-02-24 16:47:37 EST Source |
37662. | singularity | Grok 3 appreciation post 2025-02-24 16:45:54 EST Source |
37663. | singularity | shots being fired between openai and anthropic 2025-02-24 16:44:11 EST Source |
37664. | singularity | Claude models playing Pokemon 2025-02-24 16:42:11 EST Source |
37665. | singularity | an Ai that shows creates an image based on parameters, is it possible to make? 2025-02-24 16:36:48 EST Source |
37666. | singularity | Claude 3.7 = AGI ? 2025-02-24 16:35:32 EST Source |
37667. | singularity | Claude-3.7 Sonnet - #test-case-1 2025-02-24 16:11:40 EST Source |
37668. | singularity | QwQ Max Preview just released. Will be open-sourced along with Qwen2.5 Max 2025-02-24 16:10:28 EST Source |
37669. | singularity | Anthropic’s Claude Code Is Accelerating Software Development Like Never Before 2025-02-24 16:09:17 EST Source |
37670. | singularity | I've just created an "Asteroid" interactive game with Claude 3.7 in a matter of seconds... this is something incredible. 2025-02-24 15:52:55 EST Source |
37671. | singularity | Anthropic's Chief Product Officer 2025-02-24 15:47:45 EST Source |
37672. | singularity | Claude 3.7 Sonnet scored 60% on the aider polyglot benchmark w/o thinking. Tied in 3rd with o3-mini-high. Sonnet 3.7 has the highest non-thinking score (formerly Sonnet 3.5). 2025-02-24 15:44:17 EST Source |
37673. | singularity | 3D unicorn by Claude 3.7 (Single blender python script) 2025-02-24 15:38:40 EST Source |
37674. | singularity | Great, so we're going back to dial-up... could this get any more stupider? 2025-02-24 15:28:17 EST Source |
37675. | singularity | A individual human only has the potential to exhibit "Specialized Intelligence", only certain groups of humans could be considered truly capable of "General Intelligence" 2025-02-24 15:20:30 EST Source |
37676. | singularity | Is it possible that LLMs end up being able to do every task better than narrow models? 2025-02-24 15:13:58 EST Source |
37677. | singularity | Claude 'pioneers' in 2027 2025-02-24 14:55:20 EST Source |
37678. | singularity | According to anthropic grok 3 beats Claude sonnet 3.7 at graduate level reasoning, visual reasoning and high school math competition 2025-02-24 14:54:06 EST Source |
37679. | singularity | ChatGPT still struggling with "r"s in strawberry. Any day now, I'm sure...! 2025-02-24 14:45:40 EST Source |
37680. | singularity | The most interesting strawberry solution so far lmao (sonnet 3.7) 2025-02-24 14:42:22 EST Source |
37681. | singularity | Here is claude sonnet 3.7 full system prompt 2025-02-24 14:32:08 EST Source |
37682. | singularity | The argument against the simulation hypothesis 2025-02-24 14:31:21 EST Source |
37683. | singularity | Anthropic just trolled the strawberry boy (system prompt) 2025-02-24 14:30:33 EST Source |
37684. | singularity | Holy SH*T they cooked. Claude 3.7 coded this game one-shot, 3200 lines of code 2025-02-24 14:27:48 EST Source |
37685. | singularity | Flappy Bird One-Shot Claude 3.7 vs o3 Mini-High.. 2025-02-24 14:20:03 EST Source |
37686. | singularity | Shocked at sonnet 3.7 test 2025-02-24 14:16:23 EST Source |
37687. | singularity | Not sure if posted but Claude 3.7 Sonnet and Claude Code just launched! 2025-02-24 14:15:33 EST Source |
37688. | singularity | Claude 3.7 thinking fails bouncing ball challenge 2025-02-24 14:14:34 EST Source |
37689. | singularity | Claude 3.7 Sonnet progress playing Pokémon 2025-02-24 14:04:35 EST Source |
37690. | singularity | Shots Fired! Direct sting against OpenAi from Claude 3,7 realease announcement 2025-02-24 13:57:08 EST Source |
37691. | singularity | Obligatory strawberries from Sonnet 3.7 2025-02-24 13:53:25 EST Source |
37692. | singularity | Claude 3.7 is here, Anthropic came back swinging 2025-02-24 13:50:42 EST Source |
37693. | singularity | Is there a way to test this across large no of cases #Claude 3.7 2025-02-24 13:43:09 EST Source |
37694. | singularity | New SOTA for real world coding tasks 2025-02-24 13:41:47 EST Source |
37695. | singularity | Sonnet 3.7 benchmarks 2025-02-24 13:39:19 EST Source |
37696. | singularity | Claude 3.7 benchmarks 2025-02-24 13:38:25 EST Source |
37697. | singularity | Claude 3.7 (and coding tool) officially released 2025-02-24 13:37:35 EST Source |
37698. | singularity | 3.7 Sonnet and new coding tool are out 2025-02-24 13:34:57 EST Source |
37699. | singularity | Claude 3.7 Sonnet and Claude Code 2025-02-24 13:33:32 EST Source |
37700. | singularity | Claude 3.7 is now live in the Anthropic API 2025-02-24 13:32:15 EST Source |
37701. | singularity | 3.7 Sonnet is officially out 2025-02-24 13:32:14 EST Source |
37702. | singularity | xAI Co-Founder blames OpenAI for Groks 'woke' behavior after training it on ChatGPT prompts in unearthed tweet 2025-02-24 13:31:39 EST Source |
37703. | singularity | Introducing Claude 3.7 Sonnet: our most intelligent model to date 2025-02-24 13:31:04 EST Source |
37704. | singularity | All their need is a little bit of incentive to totally bypass the "user" 2025-02-24 13:26:45 EST Source |
37705. | singularity | Claude 3.7 sonnet has officially released 2025-02-24 13:25:54 EST Source |
37706. | singularity | unpopular opinion: People are severely underestimating the level of complexity in distribution of AI in society. AGI/ASI will have no problem knowing exactly how to do it, but that doesnt mean that it will be achieved because of the human factor. Brute forcing through this and we have a tyranny. 2025-02-24 13:17:15 EST Source |
37707. | singularity | Perplexity building Comet, a browser for Agentic Search 2025-02-24 12:18:30 EST Source |
37708. | singularity | Allison Duettmann says AIs may use hidden steganographic communication to collude against humans, and we'll see a "crazy Cambrian explosion" of deceptive AIs vs AIs working for humans 2025-02-24 11:56:52 EST Source |
37709. | singularity | Alef Aeronautics' Model Zero: First City Flying Car Test 2025-02-24 11:53:22 EST Source |
37710. | singularity | MIT's Max Tegmark: "If you have robots that can do everything better than us, including building smarter robots, it's pretty obvious that AGI is not just a new technology, like the internet or steam engine, but a new species ... It's the default outcome that the smarter species takes control." 2025-02-24 11:49:08 EST Source |
37711. | singularity | Dr Alan’s AGI countdown jumped 2% to 90% after the 1x Helix announcement 2025-02-24 11:37:40 EST Source |
37712. | singularity | "Branch from Here" feature in AI Studio conversations 2025-02-24 11:36:42 EST Source |
37713. | singularity | It seems like Grok 3 loves thinking...but it's been thinking the same thing, it doesn't wanna stop 2025-02-24 11:30:51 EST Source |
37714. | singularity | Inside Unitree Robotics: China's hottest humanoid robot company 2025-02-24 11:27:18 EST Source |
37715. | singularity | Looking at humans like how we look at other Animals: ASI 2025-02-24 11:24:12 EST Source |
37716. | singularity | 🔥 Fire is a DANGEROUS fad and we’re not ready!!! 🔥 2025-02-24 11:06:24 EST Source |
37717. | singularity | Conversation branching is now live in Google AI Studio 2025-02-24 11:02:15 EST Source |
37718. | singularity | What will happening with old people after we when will we get rid of old age and also pensions? 2025-02-24 10:16:02 EST Source |
37719. | singularity | Apple is investing $500 billion in US-based AI data centers and AI server manufacturing facilities over the next 4 years 2025-02-24 10:05:45 EST Source |
37720. | singularity | Microsoft Cancels Leases for AI Data Centers 2025-02-24 09:35:42 EST Source |
37721. | singularity | Claude 3.7 Sonnet announced on February 26? 2025-02-24 09:34:58 EST Source |
37722. | singularity | What’s the model that you have fond memories with? 2025-02-24 09:23:58 EST Source |
37723. | singularity | AGI IS DELAYED 2025-02-24 09:22:13 EST Source |
37724. | singularity | Is Grok really not a Nazi? 2025-02-24 09:00:04 EST Source |
37725. | singularity | Race claude 4 vs. gpt 5 2025-02-24 08:32:39 EST Source |
37726. | singularity | Day 1 of the Singularity starts today 2025-02-24 07:53:23 EST Source |
37727. | singularity | Alibaba plans to invest $52b in AI and cloud infrastructure 2025-02-24 07:41:39 EST Source |
37728. | singularity | Microsoft pulls out of Project Stargate 2025-02-24 06:21:44 EST Source |
© Copyright hackingai.app 2022-2023 Release Version 1.01202023
Return to top ►