[ home / rules / faq / search ] [ overboard / sfw / alt ] [ leftypol / edu / labor / siberia / lgbt / latam / hobby / tech / games / anime / music / draw / AKM ] [ meta ] [ wiki / shop / tv / tiktok / twitter / patreon ] [ GET / ref / marx / booru ]

/tech/ - Technology

"Technology reveals the active relation of man to nature" - Karl Marx

Posting mode: Reply [Return]
Name
Options
Subjectlw#8rX♹`⛿x.\%]3&1Cst?*⚘aPOY>BJQjn=;0DZ-G,4⛾z/fMLR e⛡:dqpA52k!7~^{)v⚢WHb☸6 @K☔TUF9h+g'}IE(_c<	Spoiler Image
Comment
Verification
Flag
File
Embed
Password	(For file deletion.)

Not reporting is bourgeois

[ Return / Go to bottom ]

File: 1726459786963.png (365.18 KB, 709x538, nuimageboard.png)

nu-imageboard megathread #2 Anonymous 16-09-24 04:09:47 No.26419[View All]

The neverending quest to rewrite vichan -

Archived threads:
https://archive.is/xiA7y

130 posts and 42 image replies omitted.

Anonymous 07-11-24 19:29:58 No.27034

File: 1731007798385.png (188.26 KB, 2920x1794, Screenshot 2024-11-07 at 2….png)

>>27030
Styled the error pages, cleaned up the error handling in source, and add proper handling for 500 errors in the mean time. The incremental refresh is turning out to be quite tricky. Was thinking could take a hash of the page to check for dropped messages, and just refresh the whole thing like is currently always done in this case, but it turns out the preview script modifies the DOM and this is no longer a reliable measure. Am open to suggestions on how of if its necessary to do this. Is the gevent loop just fast enough? The LLM has to wait for me to get a new computer to avoid swapping.

Anonymous 07-11-24 23:56:04 No.27043

>>27030
The only addition to the program after this morning was the simpler system to create admin accounts. Realized to setup incremental autorefresh would require reparsing posts to generate backlinks and at that point gave up for now. The view for reports of posts, users, and addresses requires me to make a query that don't know how to express in SQLAlchemy think it would be an outer join, and coalesce. The LLM requires me to have a better computer to avoid swapping while developing. So that's it from me for now. Releasing the source is the only thing which seems possible for now, and still haven't decided on a platform for this.

Open to any suggestions.

Glownonymous 08-11-24 05:41:35 No.27044

>>27018
I suggest you add the ability to run messages through an arbitrary server-side spam detection program. See https://bogofilter.sourceforge.io for something like spamassassin, that also works for plaintext.

Anonymous 08-11-24 07:19:54 No.27045

Site doesn't even work yet but already has a link at top to beg for money.

Anonymous 08-11-24 17:33:27 No.27046

File: 1731087206416.png (183.55 KB, 2920x1794, Screenshot 2024-11-08 at 1….png)

>>27045
>ability to run messages through an arbitrary server-side spam detection program.
Added, though this is a feature for advanced users given the implementation. Will use this for my LLM moderation system and have it as a separate program.

>>27045
>beg for money.
In the lupen sense and not the grifter, went ahead and removed it from the temporary nav. Not a clue what boards to actually include either if it ever gets hosted.

Also added stickies and locked stickies, styled the success pages, and cleaned up the error reporting some more.

Anonymous 09-11-24 16:18:37 No.27048

>>27046
Added support for the remaining MIME types today, so video, audio, and documents. This included making thumbnails for PDFs, and having a fallback thumbnail for documents which don't have their own (the document formats other than PDF).

Anonymous 11-11-24 02:41:47 No.27059

Found a federated imageboard using ActivityPub, and a library to interact with ActivityPub, so am considering giving this a go once my new machine is setup.

Glowing smallanon 11-11-24 15:55:25 No.27062

>>27043
Why do you need the LLM? Why not just a basic firewall for now? Also this site sounds pretty sick good luck in your endeavours.

Anonymous 11-11-24 21:38:36 No.27063

>>27062
Thank you! Avoiding the LLM is probably more practical because of the 2Gb of ram it eats up raising costs 50%. The idea was just to make moderation a little easier for those of us without thick skin.

>>27059
Haven't yet found a way to write an ActivityPub server that would be reasonable. The FChannel implementation is many thousands of lines. The way the've implemented actors is also somewhat strange. Rather than having actors be a salt hashed address or a user account they've made them boards: https://github.com/FChannel0/FChannel-Server/issues/9#issuecomment-822675142 There is a good JSON-LD library from the looks of it.

Anonymous 14-11-24 19:33:55 No.27089

Have been working on another project (a torrent client which just now started to be able to download files!), but have been growing my todo list for this project. There are a number of related enhancements, CIDR bans which perhaps won't be included (anyone know if these are actually used in practice?), RSS feeds, and one major desired feature addition: ActivityPub.

* Devops
- git repository
- host source
- public instance

* Missing
- caching. (observer pattern?)
- CIDR bans.

these don't make sense with the current interface, and it's not
clear to me whether or not they are required.

- RSS feeds.
- move thread.
- post search.
- ActivityPub.
- OOP restructuring!
- views for ban reports.

requires adding func.now() to these models for ordering.

- incremental autorefresh. (observer pattern?)

this requires not only add and remove but update messages in order
that backlinks be displayed correctly.

* Separate Projects
- booru site.
- LLM moderation.

Anonymous 14-11-24 20:17:27 No.27090

>>27089
Actually the hold up at the moment is picking a name so can make a new git repo and get to work on the OOP restructure. Was thinking something related to technomancy and like the original "channel" designation. Thought maybe Ars Via, but it's a little much isn't it. Anyone got any ideas?

Glownonymous 14-11-24 21:29:38 No.27091

>>27090
leanboard, webchannel, industrialchan, blackboard, anonpress

Anonymous 15-11-24 17:18:06 No.27099

>>27091
Ended up rejecting your idea and making the repository just with my previous attempt: Ars Via. Have completed most all of the OOP restructuring. Also used Flask-SQLAlchemy to properly handle the SQL Session. Currently trying to figure out how to go about the incremental autorefresh and caching. Recently learned that for websocket based applications like this one you're really supposed to be using tornado, which is a bit of a bummer.

Anonymous 15-11-24 23:23:29 No.27104

>>27099
Added caching but am not presently satisfied by my project. To do it right at this point seems to necessitate switching to some sort of asynchronous framework (like Tornado, FastAPI, or Starlette), and an asynchronous ORM to match. Starlettte in particular would probably let me write my socket endpoints using the observer pattern, which is desirable to me. Might not be too difficult to rewrite in this way.

Anonymous 17-11-24 01:23:13 No.27106

>>27104
>To do it right at this point seems to necessitate switching to some sort of asynchronous framework, and an asynchronous ORM to match.
This has been a little frustrating but the Japanese fusion jazz has helped. It's going to take a little while .. which is good! Spent all day and haven't even managed to get the GET routes up and running.

Anonymous 17-11-24 14:08:26 No.27128

File: 1731852506166.png (733.52 KB, 2920x1794, Screenshot 2024-11-17 at 1….png)

>>27106
Managed to get a few routes, views, and the render working; migrated to postgres from sqlite, and had to fix up the models a bit to suite. Am apparently going to need to rework the routes, and the asyncio SQLAlchemy extension lacks support for events so there's a bit to do there, and a ton of the views haven't been moved over yet, nor the websockets. Temporarily set posts to work with UUIDs, though something like this would be needed for federation.

Anonymous 17-11-24 20:55:52 No.27130

>>27128
Was able to add the error pages pretty easily, but session management is giving me some real trouble even with the starlette-login module. Feel the need to take a break from this program; may or may not.

Anonymous 17-11-24 21:25:35 No.27134

>>27130
Seems like the key is going to be to not use the starlette-login module but rather use the authentication module that comes pre-included.

Anonymous 18-11-24 16:42:58 No.27150

File: 1731948178071.png (30.34 KB, 1024x150, Screenshot 2024-11-18 at 1….png)

>>27134
Don't suppose anyone here knows how to make SQLAlchemy return only the first image for each thread? So far as can tell am doing everything correctly here, but still getting an error that "row_number" doesn't exist. If it were just SQL could pull it off but for some reason the "label" even as it emits an AS does not cooperate with the following use of the labeled item.

Unrelated am also having an issue with Starlette-Login trying to serialize user to JSON for some reason and failing at this. Should be able to take a deeper look into this myself at some point.

Anonymous 18-11-24 16:43:37 No.27151

>>27150
Well one error is that it should be "row_number = 1" rather than "==", but this isn't the problem.

Anonymous 19-11-24 16:41:56 No.27164

>>27150
>>27151
This issue was solved with the help of another forum.

Anonymous 19-11-24 19:52:52 No.27167

LOTS of bugs to fix but the rough in of all the functionality sans-websockets for the Starlette/Uvicorn migration is now complete. All the GET routes and views are working, thread posting is working, and so is login. The main question at present is how to setup the GUIDs for the posts. F-Channel uses a random base-32 number of length eight (a namespace of roughly one trillion, if it were base64url it would be two hundred fifty trillion). Another idea is to just use UUID4 (which is 2^128 or 10^38). None of these are really all that appealing to me, but surely will decide on something eventually.

Anonymous 20-11-24 00:32:11 No.27168

File: 1732062730579.png (648.5 KB, 3326x1798, Screenshot 2024-11-19 at 6….png)

>>27167
Decided on the UUID4 solution. Managed to get posting within a thread, post parsing, bump-ordering, backlinks, and the javascript up and running with the new system. To my knowledge automatic locking of full posts, initialization of admin accounts, GET pages for address and user history, and POST handling for admin, ban, and report still need to be addressed. Hopefully will be able to get through some of these tomorrow.

Anonymous 20-11-24 01:00:01 No.27169

>>27168
Is there someway to have two hashes for the same object and then confirm that they are for the same object without knowing what that object is? Was thinking for moderation purposes it would be interesting to have hashes of addresses communicated between federated servers along with their respective messages in such a way that each could check whether or not its an already banned user. Found this: https://crypto.stackexchange.com/a/102963 but it seems a little theoretical, so probably not.

Anonymous 20-11-24 07:01:41 No.27170

>>27169
> automatic locking of full posts, initialization of [default] admin accounts, GET pages for address and user history, and POST handling for admin, ban, and report still need to be addressed.
Believe have solved these now. Hopefully sometime tomorrow can finish off the migrations with its comprehensive testing, and begin to implement new features again.

Anonymous 20-11-24 18:43:56 No.27171

>>27170
>Hopefully sometime tomorrow can finish off the migrations with its comprehensive testing,
Wouldn't say the testing was quite comprehensive, but did a quick read through, and some sporadic testing which turned out well. Went ahead and merged the Starlette branch with main.

>>27170
>begin to implement new features again.
Am considering getting rid of the (you)s to enable caching to work more effectively. Theoretically post identifiers could be stored in the session, and this way the (you)s added back at a later date. Am also considering replacing the server side rendering of the [logout] button with client side rendering. The downside is slightly less functionality for noscript users, the advantage is that caching is much easier.

Anonymous 20-11-24 19:30:04 No.27172

>>27171
>Am also considering replacing the server side rendering of the [logout] button with client side rendering.
Some may be happy that this part didn't work out. So will just make template rendering a little more intricate.

Anonymous 21-11-24 00:21:25 No.27174

>>27171
>>27172
>So will just make template rendering a little more intricate.
This didn't work out either so ended up with a nice and light-weight solution to the problem. Now have caching of all the render functions that query the database. Managed to re-implement the old nonincremental socket-based autorefresh more elegantly in Starlette using the observer pattern that was trying to do with Flask-Sock. Also added a count for how many times the page has been refreshed by the socket. Am pretty happy with all this, though it's not even close to the complexity required to have incremental socket updates. It might be between compression, client-side caching, and server-side caching it wouldn't really be worth the complexity cost for now.

Anonymous 21-11-24 21:41:28 No.27176

>>27174
Made the image and reply count appear on the catalog page. Also added the user and address reports to the reported page. Been working through bugs otherwise. Here are my remaining TODOs:

Devops
- host source
- public instance

Missing
- RSS feeds.
- ActivityPub.
- post search?

Most of what the program needs at the moment is testing, to uncover bugs and then subsequently fix them. Have been doing some of this already today, but it may be about time to finally get the source code hosted somewhere and start receiving feedback. Biggest hold up here is deciding on the platform; seems like it's GitHub or Codeberg.

Anonymous 22-11-24 00:58:27 No.27178

>>27174
The likelihood of me actually implementing any of these but hosting the source is quite low come to think of it.

Anonymous 23-11-24 22:13:01 No.27187

File: 1732399980997.png (648.5 KB, 3326x1798, Screenshot 2024-11-19 at 6….png)

>>27178
Don't have it in me to write documentation at the moment but here's the repository: https://codeberg.org/jung/arsvia Plan to add install instructions, and some pictures. The gist is:
1. install postgres, imagemagick (libmagickwand-dev), python3, and pip.
2. run pip install -e .
3. setup a user and database in postgress and add to config.py
4. python3 arsvia
5. navigate to 127.0.0.1:5000
6. create an account which will automatically become an admin.

Anonymous 25-01-25 07:06:20 No.28245

use nntpchan

https://github.com/nesshy9/nntpchan/

Anonymous 26-01-25 22:54:55 No.28343

>>27176
activitypub is a bad match for imageboards unless you're going to treat every poster as one big account. Activitypub and other protocols seem to assume named accounts which imageboards don't have.

Glownonymous 10-02-25 05:29:33 No.28476

File: 1739165372796.jpeg (7.87 KB, 200x200, xhsdoge.jpeg)

>>28343
Make everyone register an account

(for real tho this could be not bad. It would help against spam and illegal content by raising the barrier to entry some [and new account creation could be temporarily halted, which is better than harming vpn or tor users or etc], and on the front everyone could still be anonymous. Or a tripfag at their choosing. whatever. Would it be so bad?)

Glownonymous 10-02-25 05:32:12 No.28477

>>28343
wait nevermind, the names would still be exposed in the federation process, i'm dumb. I guess every topic/board could be considered a user? I agree though. Also activitypub is just kind of bad in general. It's really nonspecific and everyone implements it so differently that the dream of cross-platform connection is very rarely reality.

Glownonymous 07-07-25 19:46:24 No.30503

>>27187
Am considering rewriting arsvia, my imageboard, in typescript (using Next.js and TypeORM) to expand my techstack.
Arsvia has user pages containing all their posts, and thought to run with that idea a little.
Namely adding a feature for sharing usernames either with specific people or making them public.
And more interestingly, to allow users to assign arbitrary nicknames to other users, and to post to user pages without a board.
Guess the user pages could reuse the moderation functionality so that users could manage their own pages.
Maybe with the Oath2 JWT authentication and try to figure out email verification too.

In short it would be something in between an imageboard and a microblogging platform.

Glownonymous 08-07-25 01:51:40 No.30504

File: 1751939499942.png (657.99 KB, 1920x1080, 2025-07-07-184949_1920x108….png)

>>30503
It has begun!

Glownonymous 08-07-25 15:57:46 No.30505

>>30504
Am thinking for this version instead of using the UUIDs directly I'm going to rewrite in the render such that it generates a sequential ordering by creation time followed by internal UUID. For threads there could then be an array perhaps with ranges or an epoch to avoid the ordering reusing numbers for the render (pretty ugly imo).
Further I'm going to use threads as namespaces like in textboard.org so that each thread starts from zero.
I'm also considering using the UUID primary key of the address log table to emulate accounts for anonymous users. This would mean that they could be federated if that was something of interest.
Could consider also using readable UUIDs like https://github.com/Debdut/uuid-readable though this is a little chaotic, but at least they wouldn't be like Ganymedean's from PKD's "The World Jones Made".

Glownonymous 09-07-25 05:27:07 No.30510

>>30505
Interesting. While working on the successor to >>>/siberia/679700 i settled on using a tai64n timestamp with 2 extra bytes of entropy (prepended to the start, so posts and threads can be distributed into buckets). This also means my thread files, which consist of lines of these IDs, have fixed-size records and can be atomatically added to using O_APPEND.

Glownonymous 10-07-25 16:53:18 No.30514

>>30510
Downloaded, and read the source for the three main SSG programs, and skimmed io.c.
It's obviously a very interesting approach.
Just from reading what you wrote here for some reason thought had made the program to use fixed disk space.
That it's a textboard has reminded me that I've no idea how to federate files.

Glownonymous 10-07-25 17:05:01 No.30515

>>30514
Thank you. If i had to summarize this approach, it would be reducing complexity by finding the right interfaces.
>That it's a textboard has reminded me that I've no idea how to federate files.
Elaborate.

Glownonymous 10-07-25 19:51:42 No.30516

>>30515
>right interfaces
Wouldn't have guessed that this is what you would have called this.
Don't worry about the files thing was just being foolish.
Should probably spend some time learning UNIX a little better.
If for no other reason than to get better at using other programming languages' standard libraries.

>>30514
In other news seem to have gotten something like a nearly complete schema setup:

auth
├── PrivateAddress.ts
├── PrivateUser.ts
├── PublicAddress.ts
├── PublicUser.ts
└── UserRole.ts
reports
├── PostAction.ts
├── PostReport.ts
├── Report.ts
├── UserAction.ts
└── UserReport.ts
threads
├── File.ts
├── Nicknames.ts
├── PageInfo.ts
├── Post.ts
├── Reference.ts
├── Tags.ts
└── UserProfile.ts

3 directories, 17 files

The most interesting features are the ability to assign nicknames to anonymized (by default) IP addresses or users.
And the replacement of boards with moderated user created tags; along with the posting to multiple tags.
Am thinking of federating anonymous posts using throwaway https://github.com/Debdut/uuid-readable UUID usernames.
This would be as an alternative to the high dox potential of having anonymous histories based on IP.

Glownonymous 10-07-25 20:40:48 No.30517

>>30516
>to get better at using other programming languages' standard libraries.
C doesn't really have the best standard library. You usually want to avoid using anything that isn't a thin wrapper around a syscall.
>Wouldn't have guessed that this is what you would have called this.
I got this from https://skarnet.org/software/skalibs/djblegacy.html
<One of the "DJB philosophy" key points is to question the interfaces. You have a task to do; you have existing interfaces. What do you do?
<Interfaces should be questioned right down to the libc. You cannot build strong software on flakey foundations. And from a system and network programmer's point of view, one thing is clear: most standard libc interfaces suck. There is no buffered asynchronous I/O. There is no timed I/O. There is no heap management helper. Even simple system calls are not always guaranteed to succeed!
UNIX doesn't have proper records, which most developers compensate for with databases, essentially a second filesystem layer. Files and directories are really the only level where database operations are simple, reliable and fast, so if you keep your state simple enough, you can map all of it onto simple file creation and access.

The next version will allow composing posts by processing arbitrary blobs, but i'm proud of how the static site generation approach worked out for this one. The locks are the only non-essential state of the system and usually short-lived, everything else is either persistent data, static configuration or entirely ephemeral.

Glownonymous 11-07-25 18:25:45 No.30519

>>30516
Am probably going to restart.
I've realized this should be a ActivityPub server first.
And only secondly a anonymous imageboard (front-end).

Glownonymous 17-07-25 18:25:43 No.30602

>>30517
Wonder if a torrent client might be candidate for a good djb application.
You could parse the bencode into the filesystem operating as a key value.
Lists would then just be dictionaries with enumerated keys.
Unpacking the data is probably more elegant in C than in other languages.
Clients are largely well segmented with clear modules based on a request/response pattern.
These could easily be turned into program boundaries, each implementing the previous response struct/unpacking.

Anonymous 27-07-25 15:02:48 No.30696

>>30505
That's going to make moving posts across threads a pain

Glownonymous 27-07-25 16:12:36 No.30697

>>30602
My gripes with other clients have made me look at the spec before, but it's a lot and i'd rather finish the ftp server i have lying around somewhere first.

The idea seems good and would work well with a lot of simple multiprocessing patterns. Personally i think the connectionless nature of the protocol would lead to a lot of process creation overhead with a forking model, so i would instead have a fixed number of worker processes servicing these. Maybe these workers could also spawn a process for each torrent and forward requests to them, which could then be treated as a heartbeat to approximate active "connections", if that makes sense.

Glownonymous 27-07-25 17:37:12 No.30699

>>30696
You're correct.

>>30697
This is what was thinking:
There's a daemon to check the tracker, establishing connections via pipeline, a program per request each subsequent program unpacking the response.
Ultimately spawning a message handler per torrent client for some subset of the clients in the tracker list.
A list of active clients should be persisted, along with a table of pieces as a lock, and information concerning global resource utilization.
Handlers commit process seppuku, or choke, if they are too slow, judging by global resource utilization, it's similar with upload.
These allow the "central" daemon to be exited and spawned periodically for example in a cronjob if that's desirable.

Glownonymous 27-07-25 17:38:55 No.30700

>>30697
>>30699
Not sure why felt the need to share what was thinking though…

Glownonymous 27-07-25 21:07:55 No.30702

>>30699
I didn't think about the fact ip addresses can basically act as session identifiers for clients, but that negates some advantages of UDP. While socket-level broadcasting isn't implemented on linux, i don't feel like this way to track sessions makes for good program architecture.

There are two cases where performance matters, which are many clients leeching few files and many clients leeching many files. The forking model is bad for both, while something to optimize specifically for the first would handling requests in a tight event loop. It shouldn't even matter if the server spawns a process connecting to it for each request or directly exchanges requests with it.
>These allow the "central" daemon to be exited and spawned periodically for example in a cronjob if that's desirable.
This isn't as important as it might seem. Polling periodically is inefficient, but if you have a fixed number of processes listening on a specific event on a file for example, you're not going to consume any computing resources or a lot of memory while they aren't running (UNIX is a timesharing system after all). This wouldn't be a bad choice as a way to ensure your daemon is reentrant though.

Unique IPs: 7

[Return][Go to top] [Catalog] | [Home][Post a Reply]

Delete Post [ File] Password

Reason

[ home / rules / faq / search ] [ overboard / sfw / alt ] [ leftypol / edu / labor / siberia / lgbt / latam / hobby / tech / games / anime / music / draw / AKM ] [ meta ] [ wiki / shop / tv / tiktok / twitter / patreon ] [ GET / ref / marx / booru ]

[ Return / Go to top /