- AIaaS (OpenAI):
- GPT-4o: $2.50/1M input tokens, $10/1M output tokens ([22][41]).
- GPT-4 Turbo: $10/1M input tokens, $30/1M output tokens ([3]).
- Cloud Hosting (AWS/GCP/Azure):
- GPU instance (e.g., NVIDIA A100): $3.06–$6.12/hour ([23][27]).
- Estimated cost: ~$0.03–$0.05 per 1K tokens (based on 158K tokens/hour on H100) ([1]).
- Self-Hosting:
- Hardware: NVIDIA H100 server ($2/hour) + maintenance.
- Cost: ~$0.013/1K tokens (44.1 tokens/sec) ([1]).
Thresholds:
- Individuals: API viable for <1M tokens/month.
- Organizations: Self-hosting economical at >10M tokens/month.
- AIaaS (OpenAI):
- $0.006/minute ([102][106]).
- Cloud Hosting (AWS Transcribe):
- $0.024–$0.040/minute ([63][84]).
- Self-Hosting:
- GPU: RTX 3090 ($10,000 upfront) + $50–$150/month power.
- Cost: ~$0.002–$0.004/minute ([84][63]).
Thresholds:
- Individuals: API viable for <1K hours/year.
- Organizations: Self-hosting saves costs at >5K hours/year.
- AIaaS (GPT-4o Vision):
- $2.50–$5.00/1M tokens ([22][54]).
- Cloud Hosting (Azure AI Vision):
- $1–$5/1K images ([25][101]).
- Self-Hosting (e.g., LLaVA):
- GPU: NVIDIA A100 ($45/hour).
- Cost: ~$0.10–$0.50/1K images ([4][27]).
Thresholds:
- Individuals: AIaaS preferred for <100K images/month.
- Organizations: Self-hosting viable at >1M images/month.
- AIaaS (Anthropic):
- Claude 3.5: $1/1M input tokens, $5/1M output tokens ([103]).
- Cloud Hosting (AWS SageMaker):
- GPU instance: $3.06–$6.12/hour ([23]).
- Cost: ~$0.04–$0.08/1K tokens.
- Self-Hosting:
- Similar to GPT-4 (~$0.015/1K tokens) ([1][27]).
Thresholds:
- Organizations: Self-hosting breaks even at >5M tokens/month.
- AIaaS (Google):
- $1.25–$5.00/1M tokens ([28][41]).
- Cloud Hosting (GCP Vertex AI):
- $0.03–$0.09/1K tokens ([28][39]).
- Self-Hosting:
- Requires high-end GPUs (A100/H100), ~$0.02–$0.03/1K tokens ([27][85]).
| Model Type | Switch API → Cloud | Switch Cloud → Self-Host |
|---|---|---|
| Text (GPT-4) | >1M tokens/month | >10M tokens/month |
| ASR (Whisper) | >500 hours/month | >5K hours/month |
| Coding (Claude-3.5) | >2M tokens/month | >20M tokens/month |
| Multimodal | >50K images/month | >500K images/month |
- AIaaS Prices Dropped:
- GPT-4 API: 50% reduction since 2023 ([3][46]).
- Whisper API: Fell from $0.010 to $0.006/minute ([102][106]).
- Cloud Hosting Stabilized:
- GPU instances: ~5% annual price cuts ([23][85]).
- Self-Hosting Hardware Improved:
- H100 efficiency: 2x tokens/sec vs. 2023 GPUs ([1][4]).
- Individuals/SMBs: Use AIaaS for flexibility; switch to cloud at ~1M tokens/month.
- Enterprises: Self-host for workflows >10M tokens/month or specialized needs (data privacy, latency).
- Multimodal Workloads: Cloud hosting balances cost and scalability for mid-volume use.
Note: Graphs illustrating these trends are unavailable in provided sources, but the above analysis reflects aggregated pricing trajectories.
Sources Cited:
[1][3][4][22][23][25][27][28][39][41][46][54][63][84][85][101][102][103][106]
Citations: [1] https://www.e2enetworks.com/blog/why-self-hosting-small-llms-are-cheaper-than-gpt-4-a-breakdown [2] https://docs.gitlab.com/ee/administration/self_hosted_models/supported_models_and_hardware_requirements.html [3] https://gptforwork.com/help/billing/self-service-packs/how-it-works [4] https://www.pugetsystems.com/labs/articles/tech-primer-what-hardware-do-you-need-to-run-a-local-llm/ [5] https://www.reddit.com/r/selfhosted/comments/18ok7u8/selfhosted_large_language_model/ [6] https://blog.briancmoses.com/2024/09/self-hosting-ai-with-spare-parts.html [7] https://blog.n8n.io/local-llm/ [8] https://blog.n8n.io/self-hosted-ai/ [9] https://learn.microsoft.com/en-us/azure/ai-services/openai/whats-new?eventId=Orapin-AzureOpenAIService_jEEVcD_QBdG2 [10] https://github.com/langgenius/dify/blob/main/README.md [11] https://botpress.com/blog/best-large-language-models [12] https://research.aimultiple.com/gpt4/ [13] https://openai.com/index/gpt-4o-system-card/ [14] https://www.datacamp.com/blog/12-gpt4-open-source-alternatives [15] https://pollthepeople.app/chatgpt-alternative-self-hosted/ [16] https://github.com/meetkool/FREE-GPT-4 [17] https://www.zdnet.com/article/openai-tailored-chatgpt-gov-for-government-use-heres-what-that-means/ [18] https://slashdot.org/software/p/GPT-4/integrations/?page=4 [19] https://www.iguazio.com/blog/commercial-vs-self-hosted-llms/ [20] https://www.plural.sh/blog/self-hosting-large-language-models/ [21] https://thedeveloperspace.com/comparison-of-generative-ai-solutions-aws-vs-azure-vs-google-cloud-2024-guide/ [22] https://openai.com/api/pricing [23] https://www.effectivesoft.com/blog/cloud-pricing-comparison.html [24] https://dev.to/keploy/understanding-gpt-4-costs-a-comprehensive-guide-540a [25] https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/?msockid=08f65cd7a8af616908664953a9da609b [26] https://research.aimultiple.com/gpt4/ [27] https://www.gleecus.com/blogs/cloud-native-ai-stack-aws-azure-gcp/ [28] https://cloud.google.com/vertex-ai/pricing?hl=en [29] https://www.linkedin.com/pulse/ai-cloud-2025-brian-stephenson-fmjhe [30] https://zapier.com/blog/claude-vs-chatgpt/ [31] https://blog.g-gen.co.jp/entry/comparing-rag-architecture-across-cloud-vendors [32] https://www.datacamp.com/blog/what-is-gpt-4o [33] https://www.opensourceforu.com/2024/09/cloud-based-ai-services-from-azure-aws-and-gcp-an-overview/ [34] https://www.devoteam.com/expert-view/generative-ai-pricing-openai-vs-google-cloud/ [35] https://dasarpai.com/dsblog/genai-capabilities-from-aws+gcp+azure [36] https://aws.amazon.com/marketplace/pp/prodview-fk4xbj3ch2e4o [37] https://www.cloudoptimo.com/blog/aws-bedrock-a-complete-guide-to-ai-models-pricing-and-integration-with-aws-services/ [38] https://ddi-dev.com/blog/programming/how-much-does-ai-cost/ [39] https://cloud.google.com/docs/get-started/aws-azure-gcp-service-comparison [40] https://keploy.io/blog/community/gpt-4-cost-everything-you-need-to-know-before-getting-started [41] https://docsbot.ai/tools/gpt-openai-api-pricing-calculator [42] https://design-code.tips/blog/2025-01-20-mastering-openai-api-in-2025-use-cases-pricing-and-practical-node-js-examples/ [43] https://research.aimultiple.com/gpt4/ [44] https://openai.com/api/pricing [45] https://www.rezolve.ai/blog/claude-vs-gpt-4 [46] https://apidog.com/blog/openai-updates-gpt-3-5-turbo-and-gpt-4/ [47] https://community.openai.com/t/batch-api-significant-price-drop-in-january-2025/1107229 [48] https://dev.to/keploy/understanding-gpt-4-costs-a-comprehensive-guide-540a [49] https://www.mckinsey.com/capabilities/mckinsey-digital/our-insights/superagency-in-the-workplace-empowering-people-to-unlock-ais-full-potential-at-work [50] https://www.notta.ai/en/blog/deepseek-r1-vs-openai-gpt-o1 [51] https://www.techtarget.com/searchenterpriseai/tip/The-best-AI-chatbots-Compare-features-and-costs [52] https://www.datacamp.com/blog/what-is-gpt-4o [53] https://zenn.dev/muit_techblog/articles/70d8be851b2f14 [54] https://apidog.com/blog/gpt-4o-mini-api/ [55] https://dirox.com/post/openai-operator [56] https://yourgpt.ai/tools/openai-and-other-llm-api-pricing-calculator [57] https://www.cnet.com/tech/services-and-software/what-is-claude-everything-to-know-about-anthropics-ai-tool/ [58] https://synthflow.ai/blog/retell-ai-pricing [59] https://zapier.com/blog/claude-vs-chatgpt/ [60] https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/?msockid=08f65cd7a8af616908664953a9da609b [61] https://incora.software/insights/whisper-vs-google-speech-to-text [62] https://www.reddit.com/r/homeassistant/comments/1cpabyf/selfhosting_whisper_on_dedicated_hardware_how/ [63] https://www.gladia.io/blog/how-much-does-it-really-cost-to-host-open-ai-whisper-ai-transcription [64] https://auphonic.com/blog/2022/11/08/auphonic-whisper-asr-beta/ [65] https://vatis.tech/blog/openai-whisper-in-house-transcription-is-it-worth-the-cost [66] https://pypi.org/project/openai-whisper/?oVg1S6RN=qeuCvGGB [67] https://community.openai.com/t/is-whisper-api-really-10x-more-expensive-than-self-hosted/576427 [68] openai/whisper#5 [69] https://www.reddit.com/r/selfhosted/comments/178twi1/whisper_self_hosted_whats_the_most_costefficient/ [70] https://github.com/openai/whisper/activity [71] https://blog.gdeltproject.org/whisper-vs-chirp-the-hidden-gpu-cost-of-free-ai-models-why-commercial-hosted-models-can-be-far-cheaper/ [72] https://www.edenai.co/post/top-free-speech-to-text-tools-apis-and-open-source-models [73] openai/whisper#608 [74] https://deepgram.com/learn/everything-about-voice-ai-agents [75] https://vatis.tech/blog/open-source-whisper-vs-api-selecting-the-best-speech-to-text [76] https://www.notta.ai/en/blog/speech-to-text-open-source [77] https://www.assemblyai.com/blog/the-state-of-python-speech-recognition/ [78] https://www.baseten.co/blog/driving-model-performance-optimization-2024-highlights/ [79] https://www.g2.com/products/whisper/reviews [80] https://community.n8n.io/t/self-host-hardware-requirements/12843 [81] https://www.opensourceforu.com/2024/09/cloud-based-ai-services-from-azure-aws-and-gcp-an-overview/ [82] https://incora.software/insights/whisper-vs-google-speech-to-text [83] https://www.indium.tech/blog/whisper-ai-model-training-on-custom-data/ [84] https://vatis.tech/blog/openai-whisper-in-house-transcription-is-it-worth-the-cost [85] https://www.astuto.ai/blogs/top-cloud-service-providers [86] https://www.willowtreeapps.com/craft/10-speech-to-text-models-tested [87] https://cloud.google.com/docs/get-started/aws-azure-gcp-service-comparison [88] https://www.g2.com/products/whisper/reviews [89] https://console.voicegain.ai [90] https://www.edenai.co/post/best-speech-to-text-apis [91] https://opea-project.github.io/latest/getting-started/README.html [92] https://sourceforge.net/software/product/Whisper/ [93] https://deepgram.com/learn/must-know-building-and-applying-conversational-ai [94] https://apidog.com/blog/whisper-api/ [95] https://docs.datasaur.ai/compatibility-and-updates/release-notes/version-6 [96] https://github.com/openai/whisper/activity [97] https://www.ml6.eu/event/ai-governance-from-theory-to-practice [98] https://blog.gdeltproject.org/whisper-vs-chirp-the-hidden-gpu-cost-of-free-ai-models-why-commercial-hosted-models-can-be-far-cheaper/ [99] https://substack.com/home/post/p-154143650 [100] https://www.transcribetube.com/blog/openai-whisper-api-limits [101] https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/?msockid=08f65cd7a8af616908664953a9da609b [102] https://apidog.com/blog/whisper-api/ [103] https://www.pymnts.com/artificial-intelligence-2/2024/anthropics-ai-upgrade-raises-price-performance-questions/ [104] https://opentools.ai/tools/whisper-api [105] https://community.openai.com/t/batch-api-significant-price-drop-in-january-2025/1107229 [106] https://blog.gdeltproject.org/experiments-with-speech-transcription-comparing-the-cost-of-running-openais-whisper-vs-googles-chirp-in-production/ [107] https://gigazine.net/gsc_news/en/20240416-anthropic-ceo-ai-training-cost/ [108] https://vatis.tech/blog/explore-the-best-free-speech-to-text-apis [109] https://community.openai.com/t/api-model-whisper-real-cost/469816 [110] https://auphonic.com/blog/2022/11/08/auphonic-whisper-asr-beta/ [111] https://opentools.ai/news/openais-gpt-4-unmasking-the-mystery-behind-the-silence [112] https://vatis.tech/blog/openai-whisper-in-house-transcription-is-it-worth-the-cost [113] https://www.linkedin.com/in/jalinden [114] https://www.g2.com/products/whisper/reviews [115] https://opentools.ai/tools/conformer2 [116] https://www.edenai.co/post/best-speech-to-text-apis [117] https://azure.microsoft.com/ja-jp/pricing/details/cognitive-services/openai-service/?msockid=3892456a0a11671d2fca50ef0bd466f5 [118] https://aiola.com/blog/best-speech-to-text-apis/ [119] https://github.com/openai/whisper/activity [120] https://github.com/ahmetoner/whisper-asr-webservice/ [121] https://www.mindee.com/blog/ocr-api-pricing-free-vs-paid [122] https://openreview.net/forum?id=fMaEbeJGpp [123] https://arxiv.org/html/2501.17887v1 [124] https://arxiv.org/html/2410.21169v1 [125] https://blog.roboflow.com/paligemma-multimodal-vision/ [126] https://github.com/infiniflow/ragflow/tree/main [127] https://www.bentoml.com/blog/multimodal-ai-a-guide-to-open-source-vision-language-models [128] https://ai.gopubby.com/the-open-source-ai-revolution-2025-how-deepseek-v3-is-making-100m-ai-systems-available-to-c23bcd853f8c?gi=1921e4948af1 [129] https://arxiv.org/pdf/2501.17887.pdf [130] https://news.microsoft.com/ignite-2024-book-of-news/ [131] https://github.com/OpenBMB/MiniCPM-o/activity [132] https://github.com/roboflow/inference/actions/workflows/docker.gpu.yml [133] https://github.com/myshell-ai/AIlice/activity [134] https://blog.n8n.io/ai-workflow-automation/ [135] https://pubs.rsna.org/doi/10.1148/radiol.241073 [136] https://developer.nvidia.com/tao-toolkit [137] https://learn.microsoft.com/en-us/azure/ai-services/openai/whats-new?eventId=Orapin-AzureOpenAIService_jEEVcD_QBdG2 [138] https://blog.roboflow.com/gpt-4-vision/ [139] https://www.ibm.com/new/announcements/ibm-granite-3-1-powerful-performance-long-context-and-more [140] https://encord.com/blog/vision-language-models-guide/ [141] https://www.infoworld.com/article/2271149/review-document-parsing-in-aws-azure-and-google-cloud.html [142] https://cloud.google.com/vision/on-prem/pricing [143] https://www.cloudthat.com/resources/blog/a-comparative-analysis-of-amazon-rekognition-vs-google-cloud-vision-ai-vs-azure-custom-vision [144] https://blog.roboflow.com/best-ocr-models-text-recognition/ [145] https://cloud.google.com/bigquery/docs/release-notes [146] https://www.e2enetworks.com/blog/a-guide-to-building-ocr-systems-using-pixtral-12b [147] https://cloud.google.com/terms/services [148] https://www.docsumo.com/blogs/ocr/api [149] https://www.linkedin.com/pulse/comprehensive-analysis-building-multi-modal-research-31-thomas-lv98c [150] https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-multimodal-embeddings [151] https://news.microsoft.com/ignite-2024-book-of-news/ [152] https://www.bentoml.com/blog/multimodal-ai-a-guide-to-open-source-vision-language-models [153] https://www.upwork.com/hire/text-recognition-specialists/ [154] https://arxiv.org/pdf/2501.17887.pdf [155] https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/whats-new [156] https://github.com/QwenLM/Qwen2.5-VL/actions [157] https://www.scaleway.com/en/generative-apis/ [158] https://www.ibm.com/new/announcements/ibm-granite-3-1-powerful-performance-long-context-and-more [159] https://mistral.ai/news/pixtral-large/ [160] https://azure.microsoft.com/en-us/products/ai-services/ai-content-understanding?msockid=20eafc2b572c63222dfbe96d5625629f [161] https://gigazine.net/gsc_news/en/20240416-anthropic-ceo-ai-training-cost/ [162] https://www.mindee.com/blog/ocr-api-pricing-free-vs-paid [163] https://ai.gopubby.com/the-open-source-ai-revolution-2025-how-deepseek-v3-is-making-100m-ai-systems-available-to-c23bcd853f8c?gi=1921e4948af1 [164] https://cloud.google.com/vision/on-prem/pricing [165] https://www.superteams.ai/blog/a-guide-to-incorporating-multimodal-ai-into-your-business-workflow [166] https://www.mindee.com/blog/leading-ocr-api-solutions [167] https://kanerika.com/blogs/chatgpt-vs-gemini-vs-claude/ [168] https://www.bentoml.com/blog/multimodal-ai-a-guide-to-open-source-vision-language-models [169] https://openai.com/api/pricing [170] https://cloud.google.com/vertex-ai/generative-ai/docs/embeddings/get-multimodal-embeddings [171] https://encord.com/blog/ [172] https://www.edenai.co/post/best-multimodal-embeddings-apis [173] https://cloud.google.com/vertex-ai/generative-ai/docs/deprecations/partner-models [174] https://blog.roboflow.com/best-ocr-models-text-recognition/ [175] https://blog.roboflow.com/how-to-read-receipts-with-ai/ [176] https://learn.microsoft.com/en-us/azure/ai-services/computer-vision/whats-new [177] https://www.linkedin.com/posts/harveysingh01_artificialintelligence-aiagents-activity-7262825928207003648-oddf [178] https://mistral.ai/news/pixtral-large/ [179] https://github.com/enricoros/big-agi/activity [180] https://github.com/LLaVA-VL/LLaVA-NeXT/activity [181] https://fireworks.ai/blog/deepseek-r1-deepdive [182] https://www.reddit.com/r/selfhosted/comments/1do0rfo/what_hardware_do_you_use_for_large_ai_models/ [183] https://towardsdatascience.com/economics-of-hosting-open-source-llms-17b4ec4e7691 [184] https://llmsforsocialscience.net/assets/presentations/oxford_llm_lecture_fowler.pdf [185] https://www.leanware.co/insights/deepseek-r1-vs-gpt-o1 [186] https://dev.to/pavanbelagatti/run-deepseek-r1-locally-for-free-in-just-3-minutes-1e82 [187] https://langfuse.com/docs/model-usage-and-cost [188] https://stratechery.com/2025/deepseek-faq/ [189] https://www.vellum.ai/blog/the-training-of-deepseek-r1-and-ways-to-use-it [190] https://discuss.huggingface.co/t/llama-7b-gpu-memory-requirement/34323 [191] https://www.theregister.com/2025/01/26/deepseek_r1_ai_cot/ [192] https://botpress.com/blog/best-large-language-models [193] https://arstechnica.com/ai/2025/01/how-does-deepseek-r1-really-fare-against-openais-best-reasoning-models/ [194] https://nodeshift.com/blog/a-step-by-step-guide-to-install-deepseek-r1-locally-with-ollama-vllm-or-transformers-2 [195] https://www.reddit.com/r/selfhosted/comments/1c7ff6q/anyone_selfhosting_chatgpt_like_llms/ [196] https://openai.com/index/learning-to-reason-with-llms/ [197] https://hackaday.com/2025/01/27/new-open-source-deepseek-v3-language-model-making-waves/ [198] https://news.ycombinator.com/item?id=42259184 [199] https://www.qodo.ai/blog/qodo-gen-adds-self-hosted-support-for-deepseek-r1/ [200] https://www.reuters.com/technology/artificial-intelligence/what-is-deepseek-why-is-it-disrupting-ai-sector-2025-01-27/ [201] https://www.effectivesoft.com/blog/cloud-pricing-comparison.html [202] https://www.scmp.com/tech/big-tech/article/3292916/alibaba-cuts-ai-visual-model-cost-85-last-day-year-price-war-rages [203] https://thedeveloperspace.com/comparison-of-generative-ai-solutions-aws-vs-azure-vs-google-cloud-2024-guide/ [204] https://azure.microsoft.com/en-us/pricing/details/cognitive-services/openai-service/?msockid=08f65cd7a8af616908664953a9da609b [205] https://www.silvertouch.ca/blog/aws-vs-azure-vs-google-cloud/ [206] https://cloud.google.com/vertex-ai/pricing?hl=en [207] https://www.aalpha.net/blog/difference-between-azure-google-cloud-aws/ [208] https://novasky-ai.github.io/posts/reduce-overthinking/ [209] https://www.linkedin.com/pulse/cloud-computing-predictions-2025-opportunities-challenges-will-kelly-hfb9e [210] https://www.linkedin.com/pulse/almost-timely-news-introduction-reasoning-ai-models-2025-01-26-penn-ogkre [211] https://www.simplilearn.com/tutorials/cloud-computing-tutorial/aws-vs-azure [212] https://www.technologyreview.com/2025/01/24/1110526/china-deepseek-top-ai-despite-sanctions/ [213] https://dasarpai.com/dsblog/genai-capabilities-from-aws+gcp+azure [214] https://writesonic.com/blog/deepseek-launches-ai-reasoning-model [215] https://www.linkedin.com/pulse/aws-vs-azure-google-cloud-comparing-major-3-platforms-yetze [216] https://techcrunch.com/2025/01/11/researchers-open-source-sky-t1-a-reasoning-ai-model-that-can-be-trained-for-less-than-450/ [217] https://www.appsierra.com/blog/cloud-pricing-comparison [218] https://www.infoq.com/news/2025/01/google-deepmind-gemini/ [219] https://www.cloud4c.com/blogs/azure-vs-aws-gcp-vs-oci-the-right-fit-for-your-business [220] https://www.ibm.com/think/news/deepseek-r1-ai [221] https://gigazine.net/gsc_news/en/20240416-anthropic-ceo-ai-training-cost/ [222] https://openai.com/api/pricing [223] https://docsbot.ai/tools/gpt-openai-api-pricing-calculator [224] https://openai.com/ja-JP/api/pricing/ [225] https://www.cnbc.com/2025/01/30/chinas-deepseek-has-some-big-ai-claims-not-all-experts-are-convinced-.html [226] https://api-docs.deepseek.com/quick_start/pricing [227] https://fortune.com/2025/01/28/venture-capital-thrive-deepseek-openai-anthropic-lightspeed/ [228] https://docs.perplexity.ai/guides/pricing [229] https://www.economist.com/business/2025/01/20/openais-latest-model-will-change-the-economics-of-software [230] https://www.tricentis.com/blog/deepseek-and-the-rise-of-ai-reasoning [231] https://www.cnbc.com/2025/01/31/deepseek-next-generation-ai-agents-may-erode-value-of-large-models.html [232] https://techcrunch.com/2025/01/27/deepseek-claims-its-reasoning-model-beats-openais-o1-on-certain-benchmarks/ [233] https://asiatimes.com/2025/01/how-deepseek-revolutionized-ais-cost-calculus/ [234] https://dev.to/visdom_04_88f1c6e8a47fe74/deepseek-r1-vs-openai-o1-which-ai-reasoning-model-dominates-in-2025-576l [235] https://www.vox.com/technology/397330/deepseek-openai-chatgpt-gemini-nvidia-china [236] https://www.datacamp.com/blog/deepseek-r1 [237] https://fortune.com/2025/01/27/deepseek-just-flipped-the-ai-script-in-favor-of-open-source-and-the-irony-for-openai-and-anthropic-is-brutal/ [238] https://openrouter.ai/announcements/reasoning-tokens-for-thinking-models [239] https://community.openai.com/t/what-is-the-impact-of-deepseek-on-the-ai-sector/1097716/11 [240] https://www.techtarget.com/whatis/feature/DeepSeek-explained-Everything-you-need-to-know [241] https://www.anthropic.com/news/claude-3-5-sonnet [242] https://www.anthropic.com/news/3-5-models-and-computer-use [243] https://www.reddit.com/r/OpenAI/comments/1h82pl3/i_spent_8_hours_testing_o1_pro_200_vs_claude/ [244] https://ai.meta.com/blog/meta-llama-3/ [245] https://deepinfra.com [246] https://www.digitalocean.com/resources/articles/best-ai-coding-assistant [247] https://composio.dev/blog/notes-on-new-deepseek-v3/ [248] https://simonwillison.net/2024/Nov/12/qwen25-coder/ [249] https://simonwillison.net/2024/Oct/22/computer-use/ [250] https://tech.co/news/how-much-does-claude-ai-cost [251] https://www.anthropic.com/claude [252] https://www.reddit.com/r/ClaudeAI/comments/1e9nmkl/software_devs_how_are_you_preparingupskilling_for/ [253] https://github.com/Doriandarko/claude-engineer/activity [254] https://www.youtube.com/watch?v=-D_YbDJ1Zb8 [255] https://forum.cursor.com/t/cursor-deepseek/43261 [256] https://towardsdatascience.com/your-company-needs-small-language-models-d0a223e0b6d9 [257] https://www.deeplearning.ai/the-batch/issue-283/ [258] https://openrouter.ai/models [259] https://github.com/enricoros/big-agi/activity [260] https://docs.snowflake.com/en/user-guide/snowflake-cortex/llm-functions