AIn't What It Used to Be: March 2024

Friday, March 29, 2024

4/5/24: Masters proposal presentations

Machine Learning Study Group

Welcome! We meet from 4:00-4:40 p.m. Central Time. Anyone can join. Feel free to attend any or all sessions, or ask to be removed from the invite list as we have no wish to send unneeded emails of which we all certainly get too many.

Contacts: jdberleant@ualr.edu and mgmilanova@ualr.edu

Agenda & Minutes

108th meeting, Apr. 5, 2024

Three CS masters student project proposal presentations today! Back to our regular program next week (or maybe the week after).

Amazon product ratings analysis. Question: Why do you think linear regression will work? Answer: Simplicity. It was suggested that simplicity might not be enough of a reason. Question: What criteria for evaluating the results? Answer: (this was not really addressed). Question: what metric will you use, or how will you choose features to use? Answer: (this was not really addressed).

Topic: Hybrid feature fusion, model selection, and coffee leaf disease classification.
Question: Do we need to decide the model before research? Sounds backwards, research should dictate the model approach.

Answer: (this was not addressed).

Question: 1) Genealogy of coffee leaf data set, normalization, basis for integration and comparison.
2) diagnosis implies a decision tree. What is the space of possible outcomes and what are the range of interventions that take place? How many treatments are there for how many diseases? At what scale are the leaf features recorderd? Is this scale similar among disparate datasets?
3) CNNs are a good choice for this problem. One can train with the same disease under different lighting and sampling conditions, and conversely different diseases under similar lighting and sampling conditions.
4) Why bother with the Nano? Why not just upload the training set to the cloud, and then acquire the test set, upload it to the many-GPU cloud and then receive the results. This is not an application which requires real-time or near real-time decision making, so you don’t really need to worry about edge computing either because the deadlines are soft.
Who labels the test set? Are there culture methods that can give you high accuracy in the test set?

Answer: Feature fusion will be tested using multiple methods. There will be three disease categories.

Question: 6) Hardware is a don’t care provided the environment you are using is portable. Jupiter is a good choice. Flask is unnecessary. You can build out a compact client environment without creating too grandiose a framework.
1A) Consider choosing one methodology and doing it very well and ensure that its coverage is adequate for the spectrum of disease states you are concerned with.

Answer:

Question: What if you reduce the scope of the project? How would you do that? Why would you not want to do that. For example, why bother requiring to do it on a nano processor?

Answer:

Question: How are you going to label your training set reliably? Since you are not an expert in coffee plant diseases.

Answer:

Topic: detection, diagnosis and treatment of mental health disorders. Use machine learning.
Question: There is a cycle: diagnosis, then treatment, then evaluation of whether the treatment helped. How much of DSM-V would be covered by this project? In more detail: 1) This is a fantastic topic and of major importance to the world.
2) Diagnosis —> Intervention —> Evaluation —> Go to 1.
3) How much of the DSM-5 would be covered by this methodology? Bipolar, OCD, Tourette’s Spectrum Disorders, Schizophrenia and subtypes (catatonic, paranoid, etc), Anxiety, Depression, Mania.
4) Would detection of emotions be the same as detection of mental illness? Everyone shows all the listed emotions, but when any emotion predominates we may be concerned about a behavior pathology. How do you get form emotion detection to mental illness diagnosis? Maybe you are doing emotion classification, not illness diagnosis.
5) Decision trees, ensembles of weak learners, the success of XGboost could be used.
Answer:
Consider working with the VA, as they are very concerned with veterans who return from military service with mental health problems.
What about suicide prevention? If you can somehow prevent even a few, it is very worthwhile. You don't have to prevent all of them.
Please contact Grace to discuss data and etc.

Wednesday, March 27, 2024

3/29/24: Video! (On the "Sparks of AGI" paper)

Machine Learning Study Group

Contacts: jdberleant@ualr.edu and mgmilanova@ualr.edu

Agenda & Minutes

107th meeting, Mar. 29, 2024

Updates, announcements, questions, etc.?

We may get some masters student presenting their work or defending. They are certainly welcome and invited!
Anything else?

Readings: We are reading

As decided last time, today we viewed this Youtube video about the article: https://www.youtube.com/watch?v=qbIk7-JPB2c (found by IE)

We got up to minute 28 in the video and will start there the next time we view this video.

Sparks of Artificial General Intelligence: Early experiments with GPT-4 (https://arxiv.org/abs/2303.12712).
We are up to "Next, we explore how GPT-4 can generate" and will start with that next time we read from the article.
Want to read another article or view another video? Let me know and I'll put it on the list and/or we can time share across more than one article per meeting!

We have read chapter 1 part 4: https://huggingface.co/learn/nlp-course/chapter1/4 up to "Note that the first attention layer in a decoder block pays attention to" and we can start from there at some point.

Friday, March 22, 2024

3/22/24: Continue reading "Sparks of Artificial General Intelligence..."

Machine Learning Study Group

Contacts: jdberleant@ualr.edu and mgmilanova@ualr.edu

Agenda & Minutes

106th meeting, Mar. 22, 2024

Updates, announcements, questions, etc.?
Readings: We are reading

Sparks of Artificial General Intelligence: Early experiments with GPT-4 (https://arxiv.org/abs/2303.12712).
Next time we will do this Youtube video about the article: https://www.youtube.com/watch?v=qbIk7-JPB2c (found by IE)
We got up to "Next, we explore how GPT-4 can generate" and will start with that next time we do it.

We have read chapter 1 part 4: https://huggingface.co/learn/nlp-course/chapter1/4 up to "Note that the first attention layer in a decoder block pays attention to" and we can start from there.

Sunday, March 10, 2024

3/15/24: GPT 4.0, healthcare advice, and language translation

Machine Learning Study Group

Contacts: jdberleant@ualr.edu and mgmilanova@ualr.edu

Agenda & Minutes

105th meeting, Mar. 15, 2024

DU will lead discussion on "Efficacy of ChatGPT 4.0 in Translations between English and Spanish for Healthcare Patient Requests for Medical Advice"

Discussion, suggestions, and questions:

Why translate with ChatGPT and not Google Translate (for example)?
Will you measure improvement in outcomes by collecting before-and-after data?
What is your hypothesis? (This was given in the Central Questions subsection.)

For the actual proposal, the claims here will need to be established through citations to existing background work.

In full proposal, explain with citations that Google Translate does poorly in this domain. This helps establish the need to try ChatGPT or other AIs.
People were asking about BLEU scores so in the proposal explain the issues with it.
It's good that you have found something novel because research should have a novelty component. Another one is keep in mind is significance to the field.
Clarify how these translated messages and originals are being treated confidentially per HIPAA and so on.
Some comments after the meeting suggested that the prompts in the document were pretty rudimentary and maybe you could get better results with modest improvements in the prompts.

The meeting ended here

For next week:

Updates, announcements, questions, etc.?
Readings: We are reading

Sparks of Artificial General Intelligence: Early experiments with GPT-4 (https://arxiv.org/abs/2303.12712).
We will start with "Throughout the paper we emphasize limitations"
Here is a Youtube video about the article: https://www.youtube.com/watch?v=qbIk7-JPB2c (found by IE)

We have read chapter 1 part 4: https://huggingface.co/learn/nlp-course/chapter1/4 up to "Note that the first attention layer in a decoder block pays attention to" and weGPT

Friday, March 8, 2024

3/8/24: Discuss newly released Google Gemini and related topics

Machine Learning Study Group

Contacts: jdberleant@ualr.edu and mgmilanova@ualr.edu

Agenda & Minutes

104th meeting, Mar. 8, 2024

Updates, announcements, questions, etc.?

I will try to get another candidacy examinee to tell us about his plans in lieu of having an individual examination meeting with me.

We discussed the newly released Google Gemini and related topics

The meeting ended here.

Readings: We are reading

Sparks of Artificial General Intelligence: Early experiments with GPT-4 (https://arxiv.org/abs/2303.12712).

We will start with "Throughout the paper we emphasize limitations"
Here is a Youtube video about the article: https://www.youtube.com/watch?v=qbIk7-JPB2c (found by IE)

We have read chapter 1 part 4: https://huggingface.co/learn/nlp-course/chapter1/4 up to "Note that the first attention layer in a decoder block pays attention to" and we can start from there.

Suggested viewings/readings on LAMs (Large Action Models) we could evaluate:

https://spectrum.ieee.org/prompt-engineering-is-dead (suggested 3/8/24)
https://www.youtube.com/watch?v=22wlLy7hKP4&t=91s&ab_channel=rabbit
https://www.youtube.com/watch?v=UOZqFMxRpWE&ab_channel=ExitsMedia
https://www.youtube.com/watch?v=Rqh6fhcAqpw&ab_channel=ColdFusion
https://www.youtube.com/watch?v=uJnhh7YSr5Q&ab_channel=TheAIGRID
https://medium.com/version-1/the-rise-of-large-action-models-lams-how-ai-can-understand-and-execute-hum

Nvidia training materials

We could go through some of these. MM has a license for free course/resource access. See https://www.nvidia.com/en-us/training/online/.