{"id":85007,"date":"2023-09-26T13:04:30","date_gmt":"2023-09-26T10:04:30","guid":{"rendered":"https:\/\/forklog.com\/en\/?p=85007"},"modified":"2025-09-12T20:45:22","modified_gmt":"2025-09-12T17:45:22","slug":"almost-human-a-major-update-to-chatgpt-goes-live","status":"publish","type":"post","link":"https:\/\/u1f987.com\/en\/almost-human-a-major-update-to-chatgpt-goes-live\/","title":{"rendered":"Almost human: a major update to ChatGPT goes live"},"content":{"rendered":"<p>OpenAI <a href=\\\"https:\/\/decrypt.co\/198611\/openai-upgrades-chatgpt-the-ai-chatbot-can-now-see-hear-and-speak\\\">released<\/a> a global update for the ChatGPT chatbot, which has learned to \u201csee, hear and speak.\u201d The update marks an important step in the development of artificial intelligence that can perceive and process information in multiple formats, not just text.<\/p>\n<blockquote class=\\\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\\\">\n<p>\u00abWe are starting to implement voice and graphical capabilities in ChatGPT. They offer a new, more intuitive type of interface, allowing you to carry on a conversation with the neural network or show it the object of the discussion\u00bb, \u2014 OpenAI explained.<\/p>\n<\/blockquote>\n<h2 class=\\\"wp-block-heading\\\"><strong>Conversations with AI<\/strong><\/h2>\n<p>The updated chatbot can hear and recognise users&#8217; speech. Any request to the AI can be made by voice, which now resembles virtual assistants like Apple\u2019s Siri.<\/p>\n<p>To enable voice features, you need to turn them on in the app settings. ChatGPT offers a choice of five voices \u2014 \u201cJuniper\u201d, \u201cBay\u201d, \u201cSky\u201d, \u201cBreeze\u201d and \u201cCharcoal\u201d. They were voiced by professional actors.<\/p>\n<figure class=\\\"wp-block-audio\\\"><audio controls src=\\\"https:\/\/u1f987.com\/wp-content\/uploads\/poem-ember.mp3\\\"><\/audio><figcaption class=\\\"wp-element-caption\\\">ChatGPT poem. Data: OpenAI.<\/figcaption><\/figure>\n<p>For speech recognition, the neural network uses the open-source Whisper system.<\/p>\n<blockquote class=\\\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\\\">\n<p>\u00abWe are starting to implement voice and graphical capabilities in ChatGPT. They offer a new, more intuitive type of interface, allowing you to carry on a conversation with the neural network or show it the object of the discussion\u00bb, \u2014 OpenAI explained.<\/p>\n<\/blockquote>\n<h2 class=\\\"wp-block-heading\\\"><strong>Show and Tell<\/strong><\/h2>\n<p>Users can also send ChatGPT various images in addition to ordinary prompts. The Vision or GPT-V feature helps the neural network provide more accurate answers.<\/p>\n<div class=\\\"wp-block-media-text alignwide is-stacked-on-mobile\\\" style=\\\"grid-template-columns:32% auto\\\">\n<figure class=\\\"wp-block-media-text__media\\\"><video controls src=\\\"https:\/\/u1f987.com\/wp-content\/uploads\/ChatGPT-can-now-see-hear-and-speak-2-1.mp4\\\"><\/video><\/figure>\n<div class=\\\"wp-block-media-text__content\\\">\\n<\/div>\n<\/div>\n<p>As an example, developers cited a scenario where something needs fixing. The faulty area can be outlined with drawing tools to ease the chatbot&#8217;s task.<\/p>\n<p>Image analysis is provided by multimodal GPT-3.5 and GPT-4. These models apply their language-thinking skills to a broad range of attachments: from screenshots and diagrams to ordinary photographs.<\/p>\n<blockquote class=\\\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\\\">\n<p>\u00abVision is intended to assist you in everyday life. The network performs best when it sees the same things as you. The approach is based directly on our work with Be My Eyes, a free mobile app for blind and visually impaired people, to understand the boundaries of use and limitations\u00bb, \u2014 OpenAI representatives explained.<\/p>\n<\/blockquote>\n<h2 class=\\\"wp-block-heading\\\"><strong>New capabilities \u2014 new risks<\/strong><\/h2>\n<p>OpenAI\u2019s overarching aim is to create a safe and beneficial artificial general intelligence (AGI). However, concerns about user protection have grown more pressing with the advent of these new features.<\/p>\n<p>They warn that voice synthesis opens new avenues for fraud; criminals could create deepfakes impersonating famous people.<\/p>\n<p>Visual models also pose problems: from misinterpreting images to making offensive comments about people in photos. Before launch, OpenAI tested the tool on a \u201cred team\u201d to assess extremism and inaccurate scientific statements.<\/p>\n<blockquote class=\\\"wp-block-quote is-layout-flow wp-block-quote-is-layout-flow\\\">\n<p>\u00abWe have also taken technical measures to significantly limit the neural network&#8217;s ability to analyze and make direct statements about people, since ChatGPT is not always accurate, and these systems must respect privacy\u00bb, \u2014 OpenAI stressed.<\/p>\n<\/blockquote>\n<p>In July, developers <a href=\"https:\/\/u1f987.com\/en\/news\/chatgpt-debunks-flat-earth-theory\">released a new plugin<\/a> for the chatbot, which can analyze data, generate Python code, build charts and solve mathematical problems. The neural networks managed to debunk the &#8216;Flat Earth&#8217; theory.<\/p>\n<p>In August, OpenAI <a href=\"https:\/\/u1f987.com\/en\/news\/openai-unveils-an-enhanced-enterprise-version-of-chatgpt\">launched ChatGPT Enterprise<\/a> \u2014 a faster, more secure and powerful version of the chatbot for enterprise clients.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI developers released a global update for the ChatGPT chatbot that can &#8220;see, hear and speak&#8221;.<\/p>\n","protected":false},"author":1,"featured_media":85008,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"select":"1","news_style_id":"1","cryptorium_level":"","_short_excerpt_text":"","creation_source":"","_metatest_mainpost_news_update":false,"footnotes":""},"categories":[3],"tags":[438,1201,1150,1190],"class_list":["post-85007","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-news-and-analysis","tag-artificial-intelligence","tag-chatbots","tag-news-plus","tag-openai"],"aioseo_notices":[],"amp_enabled":true,"views":"18","promo_type":"1","layout_type":"1","short_excerpt":"","is_update":"","_links":{"self":[{"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/posts\/85007","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/comments?post=85007"}],"version-history":[{"count":1,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/posts\/85007\/revisions"}],"predecessor-version":[{"id":85009,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/posts\/85007\/revisions\/85009"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/media\/85008"}],"wp:attachment":[{"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/media?parent=85007"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/categories?post=85007"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/u1f987.com\/en\/wp-json\/wp\/v2\/tags?post=85007"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}